Data Lake Engineer
SOSi1
Posted: March 17, 2026
Interested in this position?
Create a free account to apply with AI-powered matching
Quick Summary
A Data Lake Engineer is responsible for developing and maintaining a scalable, federated data ecosystem to enhance interoperability, governance, and mission-driven analytics for a DoD customer.
Required Skills
Job Description
Founded in 1989, SOSi is among the largest private, founder-owned technology and services integrators in the defense and government services industry. We deliver tailored solutions, tested leadership, and trusted results to enable national security missions worldwide.
SOSi is seeking a Data Lake Engineer to support mission requirements for a structured approach to further develop, integrate, and sustain a scalable, federated data ecosystem that enhances interoperability, governance, and mission-driven analytics for a DoD customer. The primary objective of the program is to bridge the operational gaps between DoD, IC, interagency, and non-traditional international partners to enable real-time information sharing, dynamic data integration, and mission-tailored analytical capabilities.
Essential Job Duties:
• The contractor shall design, implement, and maintain scalable Data Lake architectures to support structured and unstructured data ingestion, ensuring efficient data access and retrieval.
• The contractor shall configure and manage the integration interface between the Data Lake and the knowledge graph platform (Stardog), including SPARQL endpoint access, metadata federation, and catalog alignment.
• The contractor shall follow access control policies and usage scope defined by the Government and other coordinated Work Orders.
• The contractor shall confirm compliance with access policies on a quarterly basis and document the results in the Data Governance & Compliance Report.
• The contractor shall optimize ETL pipelines for high-volume data transformation, ensuring compliance with DoD IL-4/IL-5 security standards.
• The contractor shall implement storage tiering strategies and access controls, ensuring data is properly classified, retained, and accessed per DoD governance requirements.
• The contractor shall submit the Data Lake Performance & Optimization Report, detailing ingestion efficiency, access control improvements, and storage utilization metrics.
• Active TS/SCI Clearance.
• Master’s degree or higher (e.g., Ph.D.) in Computer Science, Information Technology, Systems Engineering, Data Science, Business Administration, Engineering Management, or a closely related field, or • a minimum of eleven (11) years of experience managing complex technical projects in enterprise data architecture, Databricks administration, and cloud-based data platforms.
• Knowledge and capability to support Data Lake platform administration and enterprise data architecture for DoD data-driven projects.
• Skilled in Data Lake platform administration, including workspace management and configuration, cluster optimization and performance tuning, cloud integration, and Unity Catalog integration for secure data governance.
• Proficient in ETL/ELT pipeline development, Delta Lake architecture and optimization, AI/ML workflow integration, and Data Lakehouse optimization for DoD analytics and mission-critical data workflows.
• Experienced in SysEngOps, DevSecOps, version control systems (Git), and CI/CD pipelines to streamline Data Lake development and deployment.
• Knowledgeable in identity and access management (IAM), role-based access control (RBAC), and cloud security best practices across AWS, Azure, and GCP.
• Hands-on expertise in Python, SQL/NoSQL, Apache Spark, Databricks SQL, Terraform, and cloud-native data services for large-scale data processing and analytics.
Work Environment
• Normal office conditions
Working at SOSi
All interested individuals will receive consideration and will not be discriminated against for any reason.