Junseong’s AI Blog
/
LLM Ecosystem: Open-Source Model/Data/Code (since ChatGPT)
/
Dolly 1.0
Search
Dolly 1.0
Affiliation
Databricks
Commercial
Fine-tuning Method
SFT
Note
-
blog
- HuggingFace model card (dolly-v1-6b) :
https://huggingface.co/databricks/dolly-v1-6b
-
Databricks ML Platform & Deepspeed ZeRO 3
- 초기 version : 1 epoch 학습 in 30 mins - dolly-v1-6b :
10 epochs 학습
데이터
-
alpaca
dataset (
link
) : text-davinci-003으로 생성한 52,000개 instructions and demonstrations dataset
모델 크기
6B (
GPT-J
Fine-tuning)
새롭게 제공된 Resource
Model
출시일
2023-03