aboutsummaryrefslogtreecommitdiffstats
path: root/README.md
diff options
context:
space:
mode:
authorAlex <git@ajschof.me>2024-08-16 15:37:46 +0100
committerGitHub <noreply@github.com>2024-08-16 15:37:46 +0100
commit9d189d7c8c4fae09ece780521ab9c10935be8aaf (patch)
tree03938087ec6c91a744ea54eae056dea0eece549e /README.md
parente153f2072eafca2c83a84e2c4210c46a40dabaf4 (diff)
parenteb3353ea0df1eee2df2c6a9a9efad121e56e1790 (diff)
downloadde-project-bentley-9d189d7c8c4fae09ece780521ab9c10935be8aaf.tar.gz
de-project-bentley-9d189d7c8c4fae09ece780521ab9c10935be8aaf.zip
Merge branch 'development' into extract_lambda_refactoring
Diffstat (limited to 'README.md')
-rw-r--r--README.md44
1 files changed, 43 insertions, 1 deletions
diff --git a/README.md b/README.md
index 8ae0cb3..6bc75dc 100644
--- a/README.md
+++ b/README.md
@@ -1 +1,43 @@
-# de-project-bentley \ No newline at end of file
+# ToteSys - Data Engineering Project
+
+# Summary
+The project aims to implement a data platform that can extract data from an
+operational database, archive it in a data lake, and make it easily accessible
+within a remodelled OLAP data warehouse.
+
+The solution showcases our skills in:
+
+- Python
+- PostgreSQL
+- Database modelling
+- Amazon Web Services (AWS)
+- Agile methodologies
+
+# Main Objective
+
+Our goal is to create a reliable ETL (Extract, Transform, Load) pipeline that
+can:
+
+1. Extract the data from the `totesys` operational database
+2. Store the data in AWS S3 buckets, that will form our data lake
+3. Transform the data into a suitable schema for the data warehouse
+4. Load the transformed data into the data warehouse hosted on AWS
+
+# Key Features
+
+We aim for the project to have certain features. Some are more prioritised than
+others.
+
+- [ ] Automated data ingestion from `totesys` db
+- [ ] Data storage for ingested and processed data in S3 buckets
+- [ ] Data transformation for data warehouse schema
+- [ ] Automated data loading into the data warehouse schema
+- [ ] Logging and monitoring with CloudWatch
+- [ ] Notifications for errors and successful runs (e.g. successful ingestion)
+- [ ] Visualisation of warehouse data
+
+# Test Coverage
+TBA
+
+# Contributors
+TBA \ No newline at end of file
git.ajschof.me — hosted by ajschofield — powered by cgit