* HuggingFace: https://github.com/huggingface/transformers, this is more for fine-tuning pre-trained transformers for NLP or training them from scratch.
They also have a tokenization library https://github.com/huggingface/tokenizers, which implements Byte-Pair encoding (see above) among other methods.