You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: etc/corpus/README.md
+10-4
Original file line number
Diff line number
Diff line change
@@ -49,19 +49,25 @@ your respective `repo_metadata.jsonl` and the computed amount of repos to includ
49
49
50
50
#### Add one large (100GB+) repository and one with a lot of commits repository by hand
51
51
52
-
Invoke `git clone --bare https://github.com/NagatoDEV/PlayStation-Home-Master-Archive <corpus>/github.com/NagatoDEV/PlayStation-Home-Master-Archive` (after replacing `<curpus>` with your base path)
52
+
Invoke `git clone --bare https://github.com/NagatoDEV/PlayStation-Home-Master-Archive <corpus>/github.com/NagatoDEV/PlayStation-Home-Master-Archive.git` (after replacing `<curpus>` with your base path)
53
53
to obtain one sample of a huge repository with a lot of assets and other binary data whose tree spans more than 440k files.
54
54
55
55
That way, we also get to see what happens when we have to handle huge binary files in massive trees.
56
56
57
57
Another massive tree and a more than 1.3m commits comes in with this invocation:
This repo has 100MB+ files with a lot of append-only changes to it, giving it a very imbalanced delta-tree that triggers worst-case behaviour that needed
fordin github.com/archlinux/svntogit-community github.com/NagatoDEV/PlayStation-Home-Master-Archive;do
70
+
fordin github.com/archlinux/svntogit-community.git github.com/NagatoDEV/PlayStation-Home-Master-Archive.git github.com/fz139/vigruzki.git;do
0 commit comments