<img height="1" width="1" src="https://www.facebook.com/tr?id=2072464173003314&ev=PageView &noscript=1">

AnswerIQ Technology

Damian Eads

Dr. Damian Eads is a founder of wise.io and creator of its core machine learning technology.
Find me on:

Recent Posts

ParaText: CSV parsing at 2.5 GB per second

Despite extensive use of distributed databases and filesystems in data-driven workflows, there remains a persistent need to rapidly read text files on single machines. Surprisingly, most modern text file readers fail to take advantage of multi-core architectures, leaving much of the I/O bandwidth unused on high performance storage systems. Introduced here, ParaText, reads text files in parallel on a single multi-core machine to consume more of that bandwidth. The alpha release includes a parallel Comma Separated Values (CSV) reader with Python bindings.

Read More

Topics: Machine Learning, Data Science, Software Engineering