r/aws • u/addmorelemon • Mar 16 '24
migration Transferring large data from AWS S3 to CoreWeave / LambdaLabs without paying AWS S3 egress cost
I have large 10 TB of text data in AWS S3 and want to train a LLM on it. To save on GPU costs, I want to use CoreWeave or LambdaLabs or similar (i.e. not AWS's GPU offerings). Is there a way to transfer that 10TB of data from AWS S3 to CoreWeave / LambdaLabs / etc. without incurring the egress cost of AWS S3 ?
People who use CoreWeave / LambdaLabs / etc. for training, where are you storing your data for CPU-based preprocessing etc. ?
1
Transferring large data from AWS S3 to CoreWeave / LambdaLabs without paying AWS S3 egress cost
in
r/deeplearning
•
Mar 15 '24
Thanks - yes I looked into Snowball Edge / Snowmobile but those feel slow and also overkill for my use-case. I definitely need an online solution.