.Page
About
Blog
Pills
Software
Trainings
Seminar
⚲
Search results
Large Language Models
Research feed
Blog
AutoDev: Exploring Custom LLM-Based Coding Assistance Functions
We explore the potential of custom code assistant functions based on large language models (LLMs). With our open-source software package …
Software
AutoDev: LLM-Based Coding Assistance Functions
AutoDev is a software package for the realisation of coding assistance functions using large language models (LLMs). It covers fine-tuning, …
Pill
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
The process of pre-training large language models can incur significant expenses. As these models continue to grow in size, there is an …
Pill
Reasoning Traces as Learning Signal
An important feature of large language models is their ability to provide detailed responses that resemble “thinking step by …
Pill
Direct Preference Optimization
With direct preference optimization (DPO), a language model can be aligned with human preferences without using reinforcement learning, …
Pill
Augmented Language Models: a survey
A survey of recent advances in augmenting (large) language models with new capabilities such as reasoning, tool use, and more. While the …
Pill
DictBERT: Dictionary Description Knowledge Enhanced Language Model Pre-training via Contrastive Learning
External knowledge can be injected into pre-trained language models (here specifically BERT) by training an additional language model on …
Other series in
Advances and fundamentals in ML
Simulation and AI
AI techniques are fundamentally transforming the field of simulation by combining physics-based modeling with data-driven machine learning.
Diffusion Models
Diffusion models (DM) have become the state of the art for sample quality in generative modelling. They work by sequentially corrupting …
Geometric deep learning
Specialized deep learning architectures exploit the intrinsic regularities arising from the underlying structure of the physical world. …
Check all
of our work