Benjamin Marie – TechToday

Fine-tune Google Gemma with Unsloth and Distilled DPO on Your Computer

By Benjamin Marie March 18, 2024AINo Comments

[ad_1] Following Hugging Face’s Zephyr recipeGenerated with DALL-EFinding good training hyperparameters for new LLMs is always difficult and time-consuming. With Zephyr Gemma 7B, Hugging Face seems to have found a…

Mixtral-8x7B: Understanding and Running the Sparse Mixture of Experts | by Benjamin Marie | Dec, 2023

By Benjamin Marie December 15, 2023AINo Comments

[ad_1] How to efficiently outperform GPT-3.5 and Llama 2 70BImage by 8385 from PixabayMost of the recent large language models (LLMs) use very similar neural architectures. For instance, the Falcon,…

Fine-tune Better Chat Models with Distilled Identity Preference Optimization (IPO)

By Benjamin Marie December 13, 2023AINo Comments

[ad_1] Mistral 7B aligned with IPOPhoto by Rishabh Dharmani on UnsplashTo become chat models, pre-trained large language models (LLMs) are fine-tuned on large datasets of instructions/questions paired with expected answers.…

QA-LoRA: Fine-Tune a Quantized Large Language Model on Your GPU

By Benjamin Marie October 14, 2023AINo Comments

[ad_1] Quantization-aware fine-tuningIllustration by the author — Made with images from Pixabay (1,2)State-of-the-art large language models (LLMs) are pre-trained with billions of parameters. While pre-trained LLMs can perform many tasks,…

Falcon 180B: Can It Run on Your Computer?

By Benjamin Marie September 12, 2023AINo Comments

[ad_1] There is also a chat version. The models are available on the Hugging Face hub:Falcon 180B is completely free and state-of-the-art. But it’s also a huge model.Can it run…

vLLM: PagedAttention for 24x Faster LLM Inference | by Benjamin Marie | Jun, 2023

By Benjamin Marie June 24, 2023AINo Comments

[ad_1] Almost all the large language models (LLM) rely on the Transformer neural architecture. While this architecture is praised for its efficiency, it has some well-known computational bottlenecks.During decoding, one…

GPT-3.5 Translates Paragraphs Better | by Benjamin Marie | May, 2023

By Benjamin Marie May 25, 2023AINo Comments

[ad_1] And outperforms Google Translate for the translation of literary worksImage from PixabayAccording to previous studies, GPT models perform as well as standard machine translation systems, e.g., Google Translate.These studies…

AI Won’t Replace Translators. But it can help them.

By Benjamin Marie March 31, 2023AINo Comments

[ad_1] OpinionIt’s 1960 all over againImage from PixabayIn a recent study, the University of Pennsylvania and OpenAI investigated the potential impact of large language models (LLM), such as GPT models,…

Traditional vs. Neural Metrics for Machine Translation Evaluation

By Benjamin Marie March 9, 2023AINo Comments

[ad_1] 100+ new metrics since 2010Image from PixabayAn evaluation with automatic metrics has the advantages to be faster, more reproducible, and cheaper than an evaluation conducted by humans.This is especially…

Data Preprocessing for Machine Translation

By Benjamin Marie February 25, 2023AINo Comments

[ad_1] Clean, normalize, and tokenizeImage from Pixabay.Data preprocessing is a critical step for any machine learning tasks. The data must be correct, clean, and in the expected format.In this blog…