Skip to content

mrintern/data-engineering-assessment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Engineering Assessment

info

This project uses dbt + postgres to create some insightful tables from raw data (csv's). There is also an airflow component, which makes a call to the open-meteo api to integrate weather data into the final reporting table.

running the project

  • python -m venv .venv
  • source .venv/bin/activate
  • pip install -r requirements.txt
  • navigate to cat_corp dir
  • dbt init
  • dbt seed
  • dbt run
  • dbt test

optional: integrate weather data using airflow

Please note, airflow has a seperate venv because the dependencies conflict with the rest of the project.

  • navigate to airflow dir
  • python -m venv .venv
  • source .venv/bin/activate
  • pip install -r requirements.txt
  • export AIRFLOW_HOME=$(pwd)
  • airflow db init
  • airflow-standalone

api-data from the dag is stored in outlet_weather_data.csv, and an example from a previous dag run is currently stored in the seeds dir for convenience.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages