Windows build no multiproc
Main changes
- Rebase on master
- Revert to master's way of handling multiprocessing, and running in serial on Windows. Trying to adapt the MP for Windows makes performances a lot worse on Linux
- CI: build wheels and publish to PyPI