Apply Now    

Data Engineer - Mid

Req #: 243484
Location: Bolling Afb, DC US
Job Category: Engineering
Minimum Clearance Required: TS/SCI

Job Description

CACI is looking for a Data Engineer supporting the IC. Individual will design, implement, and operate data driven approaches to client’s intelligence requirements. Design how data will be stored, accessed, used, integrated, and managed by different data regimes and digital systems as necessary/directed. Work will OSINT / Open Source analysts to determine, create, and populate optimal data architectures, structures, systems, process automation, and models to support customer intelligence requirements.

Primary Responsibilities:

  • Support the identification, prioritization, and scheduling of data modeling and processing requirements with users
  • Report the status of all data extraction, transformation, and load activities
  • Re-construct data provided in XML, delimited text, email ( e.g. eml, mbox, pst), and a variety of database systems (SQL Server, Oracle, PostgreSQL, MySQL)
  • Apply semantic data modeling techniques to classify, aggregate, and generalize data stored in hierarchical, network, or relational database management systems to define the meaning of data within the context of its interrelationships with other data
  • Validate semantic data models with users;
  • Transform semantic data models into physical database designs
  • Design physical database management systems to represent semantic data models, including relational and object-relational Databases (e.g. Postgress, SQL Server, MySQL), Key value stores, Inverted Indexes (Lucene, Elastic Search), and distributed file systems (e.g. Tachyon, HDFS)
  • Write software code and scripts, and uses commercial-off-the-shelf, government-off-the-shelf, and open source software to extract objects ( e.g. entities, events, documents, and relationships) from structured and unstructured data and multimedia ( e.g. exif).
  • Write software code and scripts and use commercial-off-the-shelf, government-off-the-shelf, and open source software to transform entities, and the apply general data cleansing, transformation, and augmentation methods
  • Create and maintain a repository of software code and scripts ( e.g. Java and Python), for rapidly extracting, transforming, and loading a variety of structured and unstructured data sources.
  • Integrate software code and scripts for the automation of repeatable extraction, transform, and loading.

Minimum Education and Experience Required:

  • MA/MS in Data Science, Data Analytics, Informatics, Statistics, or related field AND 3 years applicable experience.
  • BA/BS in Data Science, Data Analytics, Informatics, Statistics, or related field AND 5+ years applicable experience.
  • Excellent written & oral communication, research, and analytic skills
  • Expert ability to manage personnel, requirements, and coordination of projects
  • Expert capabilities to research, create, develop, and deliver professional briefings, multimedia presentations, and written reports
  • Experience utilizing programming languages such as SAS, R, Java, C, MATLAB, ScaLa, or Python; experience accelerating large data transactions across industry-leading GPU architectures to answer analytic questions
  • Experience with assessments, enterprise data integration, governance, and metrics, including the application of metadata management techniques and ability to interrogate databases efficiently using SQL
  • Able to execute data science method using common programming/scripting languages: Python, Java, Scala, R (statistics).
  • Able to execute data science method using parallel computing frameworks (e.g. deepleaming4j, Torch, Tensor Flow, Caffe, Neon, NVIDIA CUDA Deep Neural Network library (cuDNN), and OpenCV)) and distributed data processing frameworks (e.g. Hadoop (including HDFS, Hbase, Hive, Impala, Giraph, Sqoop), Spark (inlcuding MLib, GraphX, SQL and Dataframes)
  • Experience in processing, tagging, and indexing unstructured, semi-structured, and structured classified and unclassified data sets

Nice to have and desired qualifications :

  • PhD in Computer Science or related field
  • Demonstrated experience with the application of quantitative and qualitative analytic methods
  • Demonstrated knowledge of data transfer requirements for moving data between classified and unclassified computer systems.
  • Demonstrated ability producing reports for senior DoD decision makers.
  • Demonstrated experience with the application of quantitative and qualitative analytic methods
  • Detailed knowledge of NGIC, Army, DIA, NSA, CIA, and/or interagency operations

What We Can Offer You:

- We’ve been named a Best Place to Work by the Washington Post.

- Our employees value the flexibility at CACI that allows them to balance quality work and their personal lives.

- We offer competitive benefits and learning and development opportunities.

- We are mission-oriented and ever vigilant in aligning our solutions with the nation’s highest priorities.

- For over 55 years, the principles of CACI’s unique, character-based culture have been the driving force behind our success.


Job Location



CACI employs a diverse range of talent to create an environment that fuels innovation and fosters continuous improvement and success. At CACI, you will have the opportunity to make an immediate impact by providing information solutions and services in support of national security missions and government transformation for Intelligence, Defense, and Federal Civilian customers. CACI is proud to provide dynamic careers for employees worldwide. CACI is an Equal Opportunity Employer - Females/Minorities/Protected Veterans/Individuals with Disabilities.

Apply Now