13.11.2024 Deploying Open-Source Large Language Models: A Performance Analysis « Since the release of ChatGPT in November 2022, large language models (LLMs) have seen considerable success, including in the open-source… hal.science Continuer la lecture
03.10.2024 BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training « Language models can largely benefit from efficient tokenization. However, they still mostly utilize the classical BPE algorithm, a simple and… arxiv.org Continuer la lecture