decoder-only

Star

Here are 23 public repositories matching this topic...

MKnoche / DONUT

Star

[ICCV 2025] DONUT: A Decoder-Only Model for Trajectory Prediction

autonomous-driving trajectory-prediction motion-prediction argoverse decoder-only iccv2025

Updated Sep 29, 2025
Python

microsoft / encoder-decoder-slm

Star

Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and vision-language capabilities

encoder-decoder vision-and-language llm decoder-only

Updated Feb 7, 2025
Python

liaoyanqing666 / Decoder-only-transformer_Time_Series_Prediction

Star

使用Decoder-only的Transformer进行时序预测，包含SwiGLU和RoPE(Rotary Positional Embedding)，Time series prediction using Decoder-only Transformer, Including SwiGLU and RoPE(Rotary Positional Embedding)

time-series pytorch transformer rope time-series-prediction decoder-only rotary-positional-embedding swiglu

Updated Jan 25, 2024
Python

cisnlp / MEXA

Star

🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment

multilingual evaluation embeddings evaluation-metrics cross-lingual multilingual-nlp large-language-models decoder-only

Updated Apr 6, 2025
Python

pittisl / mPnP-LLM

Star

Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"

deep-learning multimodal embodied-ai large-language-model decoder-only modality-adaptation

Updated Jan 19, 2024
Python

ntphuc149 / ViAG

Star

ViAG: A Novel Framework for Fine-tuning Answer Generation models ultilizing Encoder-Decoder and Decoder-only Transformers's architecture

meteor question-answering bart llama rouge bleu-score encoder-decoder fine-tuning answer-generation t5 plms bartpho llm bertscore instruction-tuning qlora qwen decoder-only vit5

Updated May 26, 2025
Python

msmrexe / pytorch-gpt2-persian-sentiment-generation

Star

A from-scratch implementation of a scaled-down GPT-2 model in PyTorch, trained on the Snappfood dataset for sentiment-controlled Persian text generation.

deep-learning university-project text-generation transformer course-project self-attention review-generation gpt2 positional-embedding decoder-only persian-text-generation causal-attention

Updated Nov 2, 2025
Python

michaelbabsek / LLM

Star

attention-mechanism multihead-attention llm llm-training llm-inference decoder-only

Updated Jun 2, 2025
Python

pablo-reyes8 / implementing-gpt

Star

Clean-room GPT-2/GPT-3 implementation: tokenizers, architecture blocks, training loop with AdamW + cosine decay, CLI scripts, inference tools, and pytest suite. Covers OpenWebText-10k & WikiText-103 workflows. Designed as an academic reference for understanding and scaling decoder-only transformers

nlp transformers pytorch gpu-acceleration language-model adamw gpt2 gpt3 cosine-decay decoder-only educational-implementation

Updated Dec 7, 2025
Python

saschque / patterndecoder

Star

This study examines the effectiveness of transformer-based models for financial time series forecasting, specifically focusing on log returns derived from daily closing prices of the DAX40 index. We propose a decoder-only transformer model designed for immediate-term financial time series forecasting: The PatternDecoder.

transformer lstm convolutional-neural-networks informer time-series-analysis autoformer decoder-only

Updated Sep 1, 2025
Jupyter Notebook

egesualp / growth-vs-forgetting

Star

This repository contains the implementation and experiments for comparing gradual growth methods, specifically the G_stack approach, with naive models trained from scratch. The project focuses on addressing catastrophic forgetting and improving model performance in continuous learning scenarios.

inference catastrophic-forgetting llm decoder-only model-growth stacking-llm

Updated Mar 18, 2025
Shell

LeviJunior21 / Trabalho-AprendizagemProfunda-Transformer

Star

Criando um modelo Transformer do zero com variações como Multi-Head Attention e Grouped Query Attention em livros de Machado de Assis.

transformers batch-normalization layer-normalization multi-head-attention grouped-query-attention rms-norm decoder-only

Updated Sep 27, 2025
Python

gloptim / text_generating_transformer

Star

Decoder-only transformer, simplest character-level tokenization, training and text generation.

python neural-network tokenizer text-generation torch pytorch transformer educational positional-encoding llm decoder-only

Updated Mar 4, 2025
Python

rud-ninja / decoder-transformer-language-model

Star

Auto regressive text generation application using decoder transformer

python deep-learning pytorch flask-application gpt decoder-only

Updated Feb 4, 2024
Python

Amir-Hofo / GPT2

Star

in dev ...

transformers pytorch english-nlp gpt2 huggingface decoder-only bpe-tokenizer tiny-stories

Updated Sep 12, 2025
Python

nadeem4 / mini-gpt

Star

A compact, readable GPT-style decoder-only Transformer implemented in pure PyTorch. The goal is to expose the essential architectural pieces with minimal scaffolding so you can train and tinker quickly.

pytorch gpt decoder-only