Co-create with Ideas2IT











From Bottleneck to Breakthrough: A Case Study in Research Data Modernization
SLU’s researchers needed to analyze 450 TB of anonymized cell data to study healthcare access, economic mobility, and homelessness. But legacy ingestion pipelines and fragmented storage slowed everything down. Ideas2IT re-architected their platform using AWS EMR, Spark, S3, and EC2 cutting query times from 15 minutes to 30 seconds and reducing monthly cloud costs to $28K. The result? A fast, cost-efficient, and researcher-friendly analytics engine that unlocked real-time, high-impact insights.
SLU’s research hinged on large-scale analysis of Veraset-provided mobility data—but their platform couldn’t keep up.
Their setup wasn’t built for scale, speed, or ease of access. The infrastructure bottleneck was turning cutting-edge research into a slow, expensive process.
Ideas2IT rebuilt SLU’s research data platform from the ground up using AWS-native services with performance, cost, and researcher usability as core goals.
We eliminated the fragmented intermediate S3 layer holding micro-files, redesigned the ingestion logic for batch-friendly structuring, and set up secure, role-based researcher access, removing the dependency on IT teams.
All infrastructure and pipeline tuning was done in close collaboration with SLU’s research computing team and AWS, ensuring alignment with grant requirements, data protection policies, and compliance mandates.
This modernization transformed SLU’s research capability from slow and brittle to fast, cost-efficient, and future-proof.
Big data infrastructure doesn’t have to be big-budget or big-effort. For SLU, aligning cloud-native architecture with actual research workflows was the unlock. We didn’t just modernize their stack, we turned data friction into discovery velocity.









