- #1
ergospherical
- 1,072
- 1,365
I've got some code in a public repo with a module containing my model and a python script which runs the model and returns a .parquet file with data. I've parallelized all of the important processes with Joblib, and the underlying code is written in pure C so I can't make it much faster (at least, to the best of my knowledge. I'm sure someone else can).
That said, it's taking about 3 days to run on my crappy laptop. Did some digging and found that Amazon AWS might be sensible:
https://docs.aws.amazon.com/systems...egration-github-python.html?tag=pfamazon01-20
There are no EC2 free tier offers, so it's a good idea to make sure it's the right option before proceeding. Are there better alternatives (I just want to run fast on a big compute-optimized cluster, and I don't need much memory or storage optimization).
Has anyone used EC2 before? Is it relatively painless to set up?
That said, it's taking about 3 days to run on my crappy laptop. Did some digging and found that Amazon AWS might be sensible:
https://docs.aws.amazon.com/systems...egration-github-python.html?tag=pfamazon01-20
There are no EC2 free tier offers, so it's a good idea to make sure it's the right option before proceeding. Are there better alternatives (I just want to run fast on a big compute-optimized cluster, and I don't need much memory or storage optimization).
Has anyone used EC2 before? Is it relatively painless to set up?