+
+

Related Products

  • Vertex AI
    961 Ratings
    Visit Website
  • LM-Kit.NET
    26 Ratings
    Visit Website
  • RunPod
    205 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • Google Cloud Speech-to-Text
    355 Ratings
    Visit Website
  • Google Cloud Platform
    60,586 Ratings
    Visit Website
  • PBRS Power BI Reports Distribution
    12 Ratings
    Visit Website
  • Synchredible
    13 Ratings
    Visit Website
  • ManageEngine OpManager
    1,660 Ratings
    Visit Website
  • Azore CFD
    24 Ratings
    Visit Website

About

This is a model quantization tool for convolution neural networks(CNN). This tool could quantize both weights/biases and activations from 32-bit floating-point (FP32) format to 8-bit integer(INT8) format or any other bit depths. With this tool, you can boost the inference performance and efficiency significantly, while maintaining the accuracy. This tool supports common layer types in neural networks, including convolution, pooling, fully-connected, batch normalization and so on. The quantization tool does not need the retraining of the network or labeled datasets, only one batch of pictures are needed. The process time ranges from a few seconds to several minutes depending on the size of neural network, which makes rapid model update possible. This tool is collaborative optimized for DeePhi DPU and could generate INT8 format model files required by DNNC.

About

Luminal is a machine-learning framework built for speed, simplicity, and composability, focusing on static graphs and compiler-based optimization to deliver high performance even for complex neural networks. It compiles models into minimal “primops” (only 12 primitive operations) and then applies compiler passes to replace those with device-specific optimized kernels, enabling efficient execution on GPU or other backends. It supports modules (building blocks of networks with a standard forward API) and the GraphTensor interface (typed tensors and graphs at compile time) for model definition and execution. Luminal’s core remains intentionally small and hackable, with extensibility via external compilers for datatypes, devices, training, quantization, and more. Quick-start guidance shows how to clone the repo, build a “Hello World” example, or run a larger model like LLaMA 3 using GPU features.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Anyone searching for a neural network solution

Audience

ML infrastructure engineers and researchers seeking a tool offering a deployment framework for GPUs or heterogeneous devices

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

$0.90 per hour
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

DeePhi Quantization Tool
aws.amazon.com/marketplace/pp/prodview-bwtx6kzwg3gva

Company Information

Luminal
United States
luminalai.com

Alternatives

Alternatives

Deci

Deci

Deci AI
Deci

Deci

Deci AI

Categories

Categories

Integrations

Hugging Face
Llama 3

Integrations

Hugging Face
Llama 3
Claim DeePhi Quantization Tool and update features and information
Claim DeePhi Quantization Tool and update features and information
Claim Luminal and update features and information
Claim Luminal and update features and information