Parquet and Arrow are great columnar formats, but the abysmal performance of TextDecoder on Chrome (or maybe V8) completely kills any speed benefits when files contain lots of short, unique strings. Even #JavaScript CSV parsers are faster. It’s fine if you don’t need to access the string columns 🤷
5 months ago