SageMaker Hugging Face Inference Toolkit is an open-source library for serving Transformers models on Amazon SageMaker. This library provides default pre-processing, predict and postprocessing for certain Transformers models and tasks. It utilizes the SageMaker Inference Toolkit for starting up the model server, which is responsible for handling inference requests. For the Dockerfiles used for building SageMaker Hugging Face Containers, see AWS Deep Learning Containers. The SageMaker Hugging Face Inference Toolkit implements various additional environment variables to simplify your deployment experience. The Hugging Face Inference Toolkit allows user to override the default methods of the HuggingFaceHandlerService. SageMaker Hugging Face Inference Toolkit is licensed under the Apache 2.0 License.

Features

  • Create a Amazon SageMaker endpoint with a model from the Hub
  • Create a Amazon SageMaker endpoint with a trained model
  • The HF_TASK environment variable defines the task for the used Transformers pipeline
  • The HF_MODEL_ID environment variable defines the model id, which will be automatically loaded
  • The HF_API_TOKEN environment variable defines the your Hugging Face authorization token
  • User defined code/modules

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow SageMaker Hugging Face Inference Toolkit

SageMaker Hugging Face Inference Toolkit Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of SageMaker Hugging Face Inference Toolkit!

Additional Project Details

Programming Language

Python

Related Categories

Python UML Tool, Python Libraries, Python Deep Learning Frameworks, Python LLM Inference Tool

Registered

2022-07-07