Huggingface-PaddlePaddle

Convert HuggingFace code and pretrained models to a PaddlePaddle supported format.

Supported models

ID	Family	Converted Checkpoints	Article
1	GPT2	gpt2	Language Models are Unsupervised Multitask Learners
2	GPT-Neo	EleutherAI/gpt-neo-125m	GPT-NeoX-20B: An Open-Source Autoregressive Language Model
3	OPT	facebook/opt-125m	OPT: Open Pre-trained Transformer Language Models
4	BLOOM	YeungNLP/bloom-396m-zh	BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
5	LLaMa	TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T	LLaMA: Open and Efficient Foundation Language Models
6	DITTO	Finetuned	Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
7	ScaleGrad	Finetuned	Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation
8	SimCTG	Finetuned	A Contrastive Framework for Neural Text Generation
9	Unlikelihood-Token	Finetuned	Neural Text Generation with Unlikelihood Training
10	Unlikelihood-Seq	Finetuned	Neural Text Generation with Unlikelihood Training
11	Qwen-1.5	Qwen/Qwen1.5-0.5B	Qwen Technical Report
12	GPT-SW3	AI-Sweden-Models/gpt-sw3-126m	GPT-SW3: An Autoregressive Language Model for the Nordic Languages
13	Galactica	facebook/galactica-125m	Galactica: A Large Language Model for Science
14	DeepSeek LLM	deepseek-ai/deepseek-coder-1.3b-base	DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
15	InternLM2	internlm/internlm2-1_8b	InternLM - GitHub Repo
16	Pythia	EleutherAI/pythia-70m	Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
17	Phi-1.5	microsoft/phi-1_5	Textbooks Are All You Need II: phi-1.5 technical report

Procedures

Step 1. Download HuggingFace model checkpoints

huggingface-cli download --resume-download PRETRAINED_MODEL_NAME --cache-dir CACHE_DIR

Step 2. Transform HuggingFace checkpoints (PyTorch) to PaddlePaddle

python transform_checkpoint/transform_xxx.py --hf-repo CACHE_DIR --pd-repo TARGET_DIR

[Optional] Step 3. Check correctness of transformation

python check_correctness.py --hf-repo CACHE_DIR --pd-repo TARGET_DIR

Step 4. Generate new config file

python generate_config.py --src-config PATH_TO_SRC_CONFIG --mode-name MODEL_NAME --tgt-dir TARGET_DIR

Step 5. Continue training from pretrained checkpoints

CUDA_VISIBLE_DEVICES={x} python train.py \
    --model-config MODEL_CONFIG \
    --model-name MODEL_NAME \
    --tokenizer TOKENIZER \
    --dataset DATASET \
    --criterion CRITERION \
    --pretrained-model-path TARGET_DIR \
    --save-dir SAVE_DIR

Step 6. Evaluation

CUDA_VISIBLE_DEVICES={x} python test.py --dataset DATASET --pretrained-model-path SAVE_DIR

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
config		config
dataset		dataset
modules		modules
scripts		scripts
transform_checkpoint		transform_checkpoint
utils		utils
README.md		README.md
__init__.py		__init__.py
check_correctness.py		check_correctness.py
generate_config.py		generate_config.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
view_checkpoint_structure.py		view_checkpoint_structure.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Huggingface-PaddlePaddle

Supported models

Procedures

Step 1. Download HuggingFace model checkpoints

Step 2. Transform HuggingFace checkpoints (PyTorch) to PaddlePaddle

[Optional] Step 3. Check correctness of transformation

Step 4. Generate new config file

Step 5. Continue training from pretrained checkpoints

Step 6. Evaluation

About

Releases

Packages

Languages

maziao/Huggingface-PaddlePaddle

Folders and files

Latest commit

History

Repository files navigation

Huggingface-PaddlePaddle

Supported models

Procedures

Step 1. Download HuggingFace model checkpoints

Step 2. Transform HuggingFace checkpoints (PyTorch) to PaddlePaddle

[Optional] Step 3. Check correctness of transformation

Step 4. Generate new config file

Step 5. Continue training from pretrained checkpoints

Step 6. Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages