Bactrian-X is a comprehensive multilingual parallel dataset of 3.4 million instruction-response pairs across 52 languages. The instructions were obtained from alpaca-52k, and dolly-15k, and tranlated into 52 languages (52 languages x 67k instances = 3.4M instances).
Source: Bactrian-X : A Multilingual Replicable Instruction-Following Model with Low-Rank AdaptationPaper | Code | Results | Date | Stars |
---|