About the role
<div class="content-intro"><p>M9 Solutions is dedicated to providing IT services and solutions to the Federal Government by mobilizing the right people, skills, clearance levels, and technologies to help organizations that desire improved performance and modern, sustainable change. M9 has provided quality IT services and support to more than 30 Federal Agencies and multiple commercial customers nationwide. Our capabilities include IT Talent Solutions, Data Delivery &amp; Analytics, Cyber Security, Cloud Migration, Applications and Infrastructure, Software Development, and Finance &amp; Accounting.</p> <p>&nbsp;</p></div><p>M9 Solutions is seeking a <strong>Data Engineer</strong>&nbsp;to work onsite in support of a government contract for a client located in&nbsp;<strong>Springfield, VA</strong>. An active&nbsp;<strong>TS/SCI clearance</strong>&nbsp;is required.</p> <p><strong>Responsibilities</strong></p> <ul> <li>Data Ingestion &amp; Acquisition: Collect and integrate data from a wide variety of structured and unstructured sources, including APIs, RDBMS, file systems, third-party services, and real-time streams.</li> <li>Pipeline Development: Design and implement scalable ETL/ELT pipelines to clean, enrich, normalize, and semantically align data (ontology-driven transformations).</li> <li>Cloud Deployment: Build and deploy data pipelines and associated infrastructure on AWS or Azure, using managed services like Lambda, Glue, Step Functions, Azure Data Factory, etc.</li> <li>Database Architecture: Understand and optimize for different storage engines—relational (PostgreSQL, MySQL), columnar (Redshift, BigQuery), indexing engines (ElasticSearch), key-value stores (DynamoDB, Redis), Object stores (S3 or similar), and caching layers.</li> <li>Streaming Data Processing: Work with Apache Kafka (or similar platforms) to handle high-volume, low-latency data streams.</li> <li>Workflow Orchestration: Utilize Apache Airflow (or equivalent) to schedule and monitor complex data workflows.</li> <li>AI/ML Integration: Collaborate with data scientists to integrate LLMs and ML models into pipelines for inference, tagging, enrichment, or intelligent routing of data.</li> </ul> <p><strong>Required Skills and Qualifications</strong></p> <ul> <li>Active TS/SCI security clearance.</li> <li>Bachelor's or master’s degree in computer science, engineering, or related field.</li> <li>10+ years of experience in data engineering or software development roles.</li> <li>Strong proficiency in Python, including experience with libraries like pandas, PySpark, FastAPI, or similar.</li> <li>Solid experience with cloud services (AWS or Azure) and Cloud n