Distributed Data Ingestion and Preprocessing