Actually, DirectX emulates vs1.1 on the CPU rather quickly - although, not as fast as on the GPU. ps any version, however, doesn't emulate well, and can be quite slow. Luckily, the code for emulation of vs1.1 is the EXACT same code for writing to the GPU as DirectX handles all the emulation.
Hope ...