awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

738

GitHub Stars

228

Curated Resources

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me huggingfaceh4/stack-exchange-preferences resources from awesome-instruction-datasets"

A General Language Assistant as a Laboratory for AlignmentHuggingFaceH4/stack-exchange-preferences
akoksal/LongFormGeneral SFT
akoksal/LongForm
AlpacaDataCleanedGeneral SFT
yahma/alpaca-cleaned
Alpaca-GPT4 ChineseDAPO-Math-17k
Chinese Alpaca instruction data generated by GPT-4.
Alternative SourceOrca-DPO-Pairs
Auto CoTGeneral SFT
kojima-takeshi188/zero_shot_cot/dataset | kojima-takeshi188/zero_shot_cot/log

Showing a sample of 228 resources. View the full list on GitHub →