fossilesque@mander.xyzM to Science Memes@mander.xyzEnglish · 2 months agoPandasmander.xyzimagemessage-square11fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1imagePandasmander.xyzfossilesque@mander.xyzM to Science Memes@mander.xyzEnglish · 2 months agomessage-square11fedilink
minus-squareKausta@lemm.eelinkfedilinkEnglisharrow-up1·2 months agoYou havent seen anything until you need to put a 4.2gb gzipped csv into a pandas dataframe, which works without any issues I should note.
minus-squareQuizzaciousOtter@lemm.eelinkfedilinkEnglisharrow-up0·2 months agoI really don’t think that’s a lot either. Nowadays we routinely process terabytes of data.
minus-squareKausta@lemm.eelinkfedilinkEnglisharrow-up1·2 months agoYeah, it was just a simple example. Although using just pandas (without something like dask) for loading terabytes of data at once into a single dataframe may not be the best idea, even with enough memory.
You havent seen anything until you need to put a 4.2gb gzipped csv into a pandas dataframe, which works without any issues I should note.
I really don’t think that’s a lot either. Nowadays we routinely process terabytes of data.
Yeah, it was just a simple example. Although using just pandas (without something like dask) for loading terabytes of data at once into a single dataframe may not be the best idea, even with enough memory.