uber-movement-speed

This project was created to create a real-time data architecture based on AWS services.

This project was realized with the following technologies:

JavaScript, TypeScript, Node, Python, Go, localstack

Architecture

Running locally

To run the project locally, you will need to have docker and docker-compose installed. There are two way to run the project locally, one is to run the project with the help of docker-compose and the other is to run the project manually.

Prerequisites

Because this project uses localstack to mock AWS services, you will need to have a localstack pro account because this architecture uses Aurora, Glue, and the API Gateway v2.

Add an .env file to the root of the project with the following contents:

LOCALSTACK_API_KEY=XXX

Running with docker-compose

A prerequisite to running the project with docker-compose is that you do not have the localstack CLI installed.

To run the project with docker-compose, you can run the following command:

$ ./localstack.sh

This will start the localstack infrastructure and run the main file which creates all the necessary services and connections to achieve the desired architecture. Besides that, it also builds the lambda functions and deploys them to localstack.

You can also run the project with docker compose up --build but this will not remove a preexisting localstack container and might not work as expected.

Running manually

A prerequisite to running the project manually is that you do have the localstack CLI, Node, pnpm, Python, and Go installed.

To run the project manually, you can run the following commands:

$ localstack start

This will start the localstack infrastructure. After that, you can run the following command to create the necessary services and connections to achieve the desired architecture:

$ cd services/kinesis_data_forwarder && pnpm i && pnpm build && cd ../..
$ cd services/dynamo_getter && ./build.sh && cd ../..
$ cd services/preprocessing && ./build.sh && cd ../..
$ go run main.go

Testing the architecture

To check if the architecture is working as expected, you can run the simulation and test the functionality with some smaller scripts. To run the simulation, you can run the following command:

$ python3 simulation/simulate_data.py

This will start the simulation and send data to the kinesisDataForwarder lambda function through the API Gateway over WebSocket, which forwards the data to Kinesis. After that, the Preprocessing lambda function gets triggered by the Kinesis stream and processes the data. The processed data is then stored in DynamoDB and S3 (bucket is named raw-data and stored as csv files). The DynamoGetter lambda function can then be triggered by a GET request through the API Gateway and returns the specific data based on the id querystring parameter.

For data processing for machine learning tasks, the data can be processed to Aurora and a separate S3 bucket (named transformed-data and stored as csv files) by a Glue job. The Glue job is already deployed but still needs to be run manually. To interact with the Glue job, you can run the following command:

$ python3 services/glue/job.py [start|stop|logs]

To check the correct insertion of the data, you can use the following scripts to check the data in DynamoDB or Aurora:

$ ./scripts/check-aurora-sql-data.sh
$ ./scripts/check-dynamodb-data.sh

Want to use the AWS cli?

If you would like to use the AWS cli to interact with the running localstack instance, you can do so by running the following command:

$ aws --endpoint-url=http://localhost:4566 <command>

It is also important to note that you will need to configure your region to be on us-east-1 for local testing.

Credits

Uber Movement

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
.vscode		.vscode
aws		aws
docs		docs
scripts		scripts
services		services
simulation		simulation
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
localstack.sh		localstack.sh
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

uber-movement-speed

Architecture

Running locally

Prerequisites

Running with docker-compose

Running manually

Testing the architecture

Want to use the AWS cli?

Credits

About

Releases

Packages

Languages

FlorianWoelki/uber-movement-speed

Folders and files

Latest commit

History

Repository files navigation

uber-movement-speed

Architecture

Running locally

Prerequisites

Running with docker-compose

Running manually

Testing the architecture

Want to use the AWS cli?

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages