Final Project: Full End-to-End Machine Learning API

Final Project: Full End-to-End Machine Learning API

Project Overview

The goal of final_project is to deploy a fully functional prediction API accessible to end users.

You will:

Utilize Poetry to define your application dependancies
Package up an existing NLP model (DistilBERT) for running efficient CPU-based sentiment analysis from HuggingFace
Create a FastAPI application to serve prediction results from user requests
Test your application with pytest
Utilize Docker to package your application as a logic unit of compute
Cache results with Redis to protect your endpoint from abuse
Deploy your application to Azure with Kubernetes
Use K6 to load test your application
Use Grafana to visualize and understand the dynamics of your system

Lab Objectives

Helpful Information

Model Background

Please review the train.py to see how the model was trained and pushed to HuggingFace as an artifact store for models and their associated configuration. This model took 5 minutes to transfer learn on 2x A4000 GPUs with a 256 batch size, taking 15 GB of memory on each GPU. Training on CPUs would likely have taken several days. The given implementation allows for a maximum text sequences of 512 tokens for each input. Do not try to run the training script.

Model loading examples are provided in example.py and in this file we directly load the model from HuggingFace however this is extremely inefficient given the size of the underlying model (256 MB) for a production enviornment. We will pull down the model locally as part of our build process.

Model prediction pipelines are included in the transformers API provided by HuggingFace which dramatically reduces the amount of complexity in the Inferencing application. Example is provided in mlapi/example.py and is instrumented already in your main.py application.

Pydantic Model Expectations

We provide to you a pytest file test_mlapi.py which has the structure of how you should design your pydantic models. You will have to do a little bit of reverse engineering so that your model matches our expectations.

Poetry Dependancies

Do not run poetry update it will take a long time due to the handling of torch dependencies. Do a poetry install instead.

Git Large File Storage (LFS)

You might need to install git lfs https://git-lfs.github.com/

Back-To-Top

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
mlapi		mlapi
trainer		trainer
.DS_Store		.DS_Store
Findings.md		Findings.md
README.md		README.md
build-push.sh		build-push.sh
example.py		example.py
grader.sh		grader.sh
image_1.png		image_1.png
image_2.png		image_2.png
image_3.png		image_3.png
image_4.png		image_4.png
image_5.png		image_5.png
image_6.png		image_6.png
image_7.png		image_7.png
load.js		load.js
run_load_tests.sh		run_load_tests.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Final Project: Full End-to-End Machine Learning API

Project Overview

Lab Objectives

Helpful Information

Model Background

Pydantic Model Expectations

Poetry Dependancies

Git Large File Storage (LFS)

About

Releases

Packages

Languages

Shuo-Wang-UCBerkeley/DistilBERT-API-Deployment

Folders and files

Latest commit

History

Repository files navigation

Final Project: Full End-to-End Machine Learning API

Project Overview

Lab Objectives

Helpful Information

Model Background

Pydantic Model Expectations

Poetry Dependancies

Git Large File Storage (LFS)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages