Skip to content

Commit 4a4b35c

Browse files
committed
midx-write: only load initialized packs
The fill_packs_from_midx() method was refactored in fcb2205 (midx: implement support for writing incremental MIDX chains, 2024-08-06) to allow for preferred packfiles and incremental multi-pack-indexes. However, this led to some conditions that can cause improperly initialized memory in the context's list of packfiles. The conditions caring about the preferred pack name or the incremental flag are currently necessary to load a packfile. But the context is still being populated with pack_info structs based on the packfile array for the existing multi-pack-index even if prepare_midx_pack() isn't called. Add a new test that breaks under --stress when compiled with SANITIZE=address. The chosen number of 100 packfiles was selected to get the --stress output to fail about 50% of the time, while 50 packfiles could not get a failure in most --stress runs. This test has a very minor check at the end confirming only one packfile remaining. The failing nature of this test actually relies on auto-GC cleaning up some packfiles during the creation of the commits, as tests setting gc.auto to zero make the packfile count match the number of added commits but also avoids hitting the memory issue. The test case is marked as EXPENSIVE not only because of the number of packfiles it creates, but because some CI environments were reporting errors during the test that I could not reproduce, specifically around being unable to open the packfiles or their pack-indexes. When it fails under SANITIZE=address, it provides the following error: AddressSanitizer:DEADLYSIGNAL ================================================================= ==3263517==ERROR: AddressSanitizer: SEGV on unknown address 0x000000000027 ==3263517==The signal is caused by a READ memory access. ==3263517==Hint: address points to the zero page. #0 0x562d5d82d1fb in close_pack_windows packfile.c:299 #1 0x562d5d82d3ab in close_pack packfile.c:354 #2 0x562d5d7bfdb4 in write_midx_internal midx-write.c:1490 #3 0x562d5d7c7aec in midx_repack midx-write.c:1795 #4 0x562d5d46fff6 in cmd_multi_pack_index builtin/multi-pack-index.c:305 ... This failure stack trace is disconnected from the real fix because it the bad pointers are accessed later when closing the packfiles from the context. There are a few different aspects to this fix that are worth noting: 1. We return to the previous behavior of fill_packs_from_midx to not rely on the incremental flag or existence of a preferred pack. 2. The behavior to scan all layers of an incremental midx is kept, so this is not a full revert of the change. 3. We skip allocating more room in the pack_info array if the pack fails prepare_midx_pack(). 4. The method has always returned 0 for success and 1 for failure, but the condition checking for error added a check for a negative result for failure, so that is now updated. 5. The call to open_pack_index() is removed, but this is needed later in the case of a preferred pack. That call is moved to immediately before its result is needed (checking for the object count). Signed-off-by: Derrick Stolee <[email protected]>
1 parent c44beea commit 4a4b35c

File tree

2 files changed

+29
-26
lines changed

2 files changed

+29
-26
lines changed

midx-write.c

Lines changed: 12 additions & 26 deletions
Original file line numberDiff line numberDiff line change
@@ -920,39 +920,21 @@ static struct multi_pack_index *lookup_multi_pack_index(struct repository *r,
920920
return get_multi_pack_index(source);
921921
}
922922

923-
static int fill_packs_from_midx(struct write_midx_context *ctx,
924-
const char *preferred_pack_name, uint32_t flags)
923+
static int fill_packs_from_midx(struct write_midx_context *ctx)
925924
{
926925
struct multi_pack_index *m;
927926

928927
for (m = ctx->m; m; m = m->base_midx) {
929928
uint32_t i;
930929

931930
for (i = 0; i < m->num_packs; i++) {
932-
ALLOC_GROW(ctx->info, ctx->nr + 1, ctx->alloc);
933-
934-
/*
935-
* If generating a reverse index, need to have
936-
* packed_git's loaded to compare their
937-
* mtimes and object count.
938-
*
939-
* If a preferred pack is specified, need to
940-
* have packed_git's loaded to ensure the chosen
941-
* preferred pack has a non-zero object count.
942-
*/
943-
if (flags & MIDX_WRITE_REV_INDEX ||
944-
preferred_pack_name) {
945-
if (prepare_midx_pack(ctx->repo, m,
946-
m->num_packs_in_base + i)) {
947-
error(_("could not load pack"));
948-
return 1;
949-
}
950-
951-
if (open_pack_index(m->packs[i]))
952-
die(_("could not open index for %s"),
953-
m->packs[i]->pack_name);
931+
if (prepare_midx_pack(ctx->repo, m,
932+
m->num_packs_in_base + i)) {
933+
error(_("could not load pack"));
934+
return 1;
954935
}
955936

937+
ALLOC_GROW(ctx->info, ctx->nr + 1, ctx->alloc);
956938
fill_pack_info(&ctx->info[ctx->nr++], m->packs[i],
957939
m->pack_names[i],
958940
m->num_packs_in_base + i);
@@ -1123,8 +1105,7 @@ static int write_midx_internal(struct repository *r, const char *object_dir,
11231105
ctx.num_multi_pack_indexes_before++;
11241106
m = m->base_midx;
11251107
}
1126-
} else if (ctx.m && fill_packs_from_midx(&ctx, preferred_pack_name,
1127-
flags) < 0) {
1108+
} else if (ctx.m && fill_packs_from_midx(&ctx)) {
11281109
goto cleanup;
11291110
}
11301111

@@ -1223,6 +1204,11 @@ static int write_midx_internal(struct repository *r, const char *object_dir,
12231204

12241205
if (ctx.preferred_pack_idx > -1) {
12251206
struct packed_git *preferred = ctx.info[ctx.preferred_pack_idx].p;
1207+
1208+
if (open_pack_index(preferred))
1209+
die(_("failed to open preferred pack %s"),
1210+
ctx.info[ctx.preferred_pack_idx].pack_name);
1211+
12261212
if (!preferred->num_objects) {
12271213
error(_("cannot select preferred pack %s with no objects"),
12281214
preferred->pack_name);

t/t5319-multi-pack-index.sh

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -989,6 +989,23 @@ test_expect_success 'repack --batch-size=0 repacks everything' '
989989
)
990990
'
991991

992+
test_expect_success EXPENSIVE 'repack/expire with many packs' '
993+
cp -r dup many &&
994+
(
995+
cd many &&
996+
997+
for i in $(test_seq 1 100)
998+
do
999+
test_commit extra$i &&
1000+
git maintenance run --task=loose-objects || return 1
1001+
done &&
1002+
1003+
git multi-pack-index write &&
1004+
git multi-pack-index repack &&
1005+
git multi-pack-index expire
1006+
)
1007+
'
1008+
9921009
test_expect_success 'repack --batch-size=<large> repacks everything' '
9931010
(
9941011
cd dup2 &&

0 commit comments

Comments
 (0)