AWS ML Blog

Streamline external access to Amazon SageMaker MLflow using a REST API proxy

May 28, 2026•10 min read•

Level:Intermediate

For:ML Engineers, Cloud Architects

✦TL;DR

We present a Flask-based REST API proxy that securely enables HTTPS access to Amazon SageMaker MLflow without requiring the MLflow SDK, allowing organizations to preserve existing ML workflows during cloud transformation. This proxy service utilizes the Amazon SageMaker API and can be easily integrated into existing infrastructure. The solution provides a flexible and scalable way to access MLflow, enabling seamless collaboration between teams and preserving existing ML pipelines. This approach also reduces the complexity associated with managing multiple SDKs and dependencies.

⚡ Key Takeaways

The proxy service uses Flask as the web framework and can be deployed on Amazon Elastic Container Service (ECS) or Amazon Elastic Beanstalk.
The solution utilizes the Amazon SageMaker API to interact with MLflow, eliminating the need for the MLflow SDK.
The proxy service supports HTTPS access, ensuring secure communication between clients and MLflow.
The proxy service can be easily integrated into existing infrastructure using a simple REST API.
WhyItMatters: This solution enables organizations undergoing cloud transformation to preserve their existing ML workflows while adopting a more cloud-agnostic approach to ML development, reducing the complexity and overhead associated with managing multiple SDKs and dependencies.
TechnicalLevel: Intermediate
TargetAudience: ML Engineers, Cloud Architects
PracticalSteps:
Create a new Flask application using the `flask` library and install the required dependencies.
Configure the Amazon SageMaker API credentials and endpoint URL in the Flask application.
Define the REST API endpoints for interacting with MLflow using the Amazon SageMaker API.
Deploy the Flask application on Amazon ECS or Amazon Elastic Beanstalk.
ToolsMentioned: Flask, Amazon SageMaker API, Amazon ECS, Amazon Elastic Beanstalk
Tags: DEPLOYMENT, AMAZON, INFERENCE

🔧 Tools & Libraries

FlaskAmazon SageMaker APIAmazon ECSAmazon Elastic Beanstalk

💡 Why It Matters

This solution enables organizations undergoing cloud transformation to preserve their existing ML workflows while adopting a more cloud-agnostic approach to ML development, reducing the complexity and overhead associated with managing multiple SDKs and dependencies.

✅ Practical Steps

Create a new Flask application using the `flask` library and install the required dependencies.
Configure the Amazon SageMaker API credentials and endpoint URL in the Flask application.
Define the REST API endpoints for interacting with MLflow using the Amazon SageMaker API.
Deploy the Flask application on Amazon ECS or Amazon Elastic Beanstalk.

Want the full story? Read the original article.

Read on AWS ML Blog ↗

Streamline external access to Amazon SageMaker MLflow using a REST API proxy

⚡ Key Takeaways

🔧 Tools & Libraries

✅ Practical Steps

More like this

Comprehensive observability for Amazon SageMaker AI LLM inference: From GPU utilization to LLM quality

RAG Is Burning Money — I Built a Cost Control Layer to Fix It

Pinterest cut AI costs 90% by gutting a frontier model's vision layer

Training Azerbaijani language models on Amazon SageMaker AI