torchtext — torchtext 0.11.0 documentation
https://pytorch.org/texttorchtext. This library is part of the PyTorch project. PyTorch is an open source machine learning framework. Features described in this documentation are classified by release status: Stable: These features will be maintained long-term and there should generally be no major performance limitations or gaps in documentation.
Basic NLP with PyTorch Text
dzlab.github.io › pytorch › basic-nlp-pytorch-textJan 06, 2020 · PyTorch Text is a PyTorch package with a collection of text data processing utilities, it enables to do basic NLP tasks within PyTorch. It provides the following capabilities: Defining a text preprocessing pipeline: tokenization, lowecasting, etc. Building Batches and Datasets, and spliting them into (train, validation, test)
torchtext — torchtext 0.11.0 documentation
pytorch.org › texttorchtext. This library is part of the PyTorch project. PyTorch is an open source machine learning framework. Features described in this documentation are classified by release status: Stable: These features will be maintained long-term and there should generally be no major performance limitations or gaps in documentation.
torchtext.data.utils — torchtext 0.11.0 documentation
pytorch.org › text › stabletorchtext.data.utils.get_tokenizer(tokenizer, language='en') [source] Generate tokenizer function for a string sentence. Parameters. tokenizer – the name of tokenizer function. If None, it returns split () function, which splits the string sentence by space. If basic_english, it returns _basic_english_normalize () function, which normalize ...