Optimize TensorFlow Models For Deployment with TensorRT

4.6
stars

59 ratings

Offered By

3,859 already enrolled

In this Free Guided Project, you will:
1.5 hours
Intermediate
No download needed
Split-screen video
English
Desktop only

This is a hands-on, guided project on optimizing your TensorFlow models for inference with NVIDIA's TensorRT. By the end of this 1.5 hour long project, you will be able to optimize Tensorflow models using the TensorFlow integration of NVIDIA's TensorRT (TF-TRT), use TF-TRT to optimize several deep learning models at FP32, FP16, and INT8 precision, and observe how tuning TF-TRT parameters affects performance and inference throughput. Prerequisites: In order to successfully complete this project, you should be competent in Python programming, understand deep learning and what inference is, and have experience building deep learning models in TensorFlow and its Keras API. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

Requirements

Skills you will develop

  • Deep Learning

  • NVIDIA TensorRT (TF-TRT)

  • Python Programming

  • Tensorflow

  • keras

Learn step-by-step

In a video that plays in a split-screen with your work area, your instructor will walk you through these steps:

How Guided Projects work

Your workspace is a cloud desktop right in your browser, no download required

In a split-screen video, your instructor guides you step-by-step

Reviews

TOP REVIEWS FROM OPTIMIZE TENSORFLOW MODELS FOR DEPLOYMENT WITH TENSORRT

View all reviews

Frequently Asked Questions