Model Serving in PyTorch | PyTorch
pytorch.org › blog › model-serving-in-pyorchMay 08, 2019 · Usually when people talk about taking a model “to production,” they usually mean performing inference, sometimes called model evaluation or prediction or serving. At the level of a function call, in PyTorch, inference looks something like this: In Python module (input) In traced modules module (input) In C++