This repository was archived by the owner on Nov 1, 2024. It is now read-only.

Description
After preprocessing, I got files train.{lang}.[0..7].pth. But when I try to pretrain a model with MLM, I got an error saying 'no file called train.{lang}.pth exists'.
I think that might because I have to regroup my dataset into a single file, but which part of the code should I modify or extra parameters should I put in to achieve that?