global/git

mirror of https://github.com/git/git.git synced 2026-03-05 06:57:37 +01:00

Author	SHA1	Message	Date
Taylor Blau	5f3e7f7279	midx-write.c: extract `fill_pack_from_midx()` When filling packs from an existing MIDX, `fill_packs_from_midx()` handles preparing a MIDX'd pack, and reading out its pack name from the existing MIDX. MIDX compaction will want to perform an identical operation, though the caller will look quite different than `fill_packs_from_midx()`. To reduce any future code duplication, extract `fill_pack_from_midx()` from `fill_packs_from_midx()` to prepare to call our new helper function in a future change. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:34 -08:00
Taylor Blau	4f8543255e	midx-write.c: introduce `midx_pack_perm()` helper The `ctx->pack_perm` array can be considered as a permutation between the original `pack_int_id` of some given pack to its position in the `ctx->info` array containing all packs. Today we can always index into this array with any known `pack_int_id`, since there is never a `pack_int_id` which is greater than or equal to the value `ctx->nr`. That is not necessarily the case with MIDX compaction. For example, suppose we have a MIDX chain with three layers, each containing three packs. The base of the MIDX chain will have packs with IDs 0, 1, and 2, the next layer 3, 4, and 5, and so on. If we are compacting the topmost two layers, we'll have input `pack_int_id` values between [3, 8], but `ctx->nr` will only be 6. In that example, if we want to know where the pack whose original `pack_int_id` value was, say, 7, we would compute `ctx->pack_perm[7]`, leading to an uninitialized read, since there are only 6 entries allocated in that array. To address this, there are a couple of options: - We could allocate enough entries in `ctx->pack_perm` to accommodate the largest `orig_pack_int_id` value. - Or, we could internally shift the input values by the number of packs in the base layer of the lower end of the MIDX compaction range. This patch prepare us to take the latter approach, since it does not allocate more memory than strictly necessary. (In our above example, the base of the lower end of the compaction range is the first MIDX layer (having three packs), so we would end up indexing `ctx->pack_perm[7-3]`, which is a valid read.) Note that this patch does not actually implement that approach yet, but merely performs a behavior-preserving refactoring which will make the change easier to carry out in the future. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:33 -08:00
Taylor Blau	b2ec8e90c2	midx: do not require packs to be sorted in lexicographic order The MIDX file format currently requires that pack files be identified by the lexicographic ordering of their names (that is, a pack having a checksum beginning with "abc" would have a numeric pack_int_id which is smaller than the same value for a pack beginning with "bcd"). As a result, it is impossible to combine adjacent MIDX layers together without permuting bits from bitmaps that are in more recent layer(s). To see why, consider the following example: \| packs \| preferred pack --------+-------------+--------------- MIDX #0 \| { X, Y, Z } \| Y MIDX #1 \| { A, B, C } \| B MIDX #2 \| { D, E, F } \| D , where MIDX #2's base MIDX is MIDX #1, and so on. Suppose that we want to combine MIDX layers #0 and #1, to create a new layer #0' containing the packs from both layers. With the original three MIDX layers, objects are laid out in the bitmap in the order they appear in their source pack, and the packs themselves are arranged according to the pseudo-pack order. In this case, that ordering is Y, X, Z, B, A, C. But recall that the pseudo-pack ordering is defined by the order that packs appear in the MIDX, with the exception of the preferred pack, which sorts ahead of all other packs regardless of its position within the MIDX. In the above example, that means that pack 'Y' could be placed anywhere (so long as it is designated as preferred), however, all other packs must be placed in the location listed above. Because that ordering isn't sorted lexicographically, it is impossible to compact MIDX layers in the above configuration without permuting the object-to-bit-position mapping. Changing this mapping would affect all bitmaps belonging to newer layers, rendering the bitmaps associated with MIDX #2 unreadable. One of the goals of MIDX compaction is that we are able to shrink the length of the MIDX chain without invalidating bitmaps that belong to newer layers, and the lexicographic ordering constraint is at odds with this goal. However, packs do not need to be lexicographically ordered within the MIDX. As far as I can gather, the only reason they are sorted lexically is to make it possible to perform a binary search over the pack names in a MIDX, necessary to make `midx_contains_pack()`'s performance logarithmic in the number of packs rather than linear. Relax this constraint by allowing MIDX writes to proceed with packs that are not arranged in lexicographic order. `midx_contains_pack()` will lazily instantiate a `pack_names_sorted` array on the MIDX, which will be used to implement the binary search over pack names. This change produces MIDXs which may not be correctly read with external tools or older versions of Git. Though older versions of Git know how to gracefully degrade and ignore any MIDX(s) they consider corrupt, external tools may not be as robust. To avoid unintentionally breaking any such tools, guard this change behind a version bump in the MIDX's on-disk format. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:33 -08:00
Taylor Blau	82c905ea6b	midx-write.c: introduce `struct write_midx_opts` In the MIDX writing code, there are four functions which perform some sort of MIDX write operation. They are: - write_midx_file() - write_midx_file_only() - expire_midx_packs() - midx_repack() All of these functions are thin wrappers over `write_midx_internal()`, which implements the bulk of these routines. As a result, the `write_midx_internal()` function takes six arguments. Future commits in this series will want to add additional arguments, and in general this function's signature will be the union of parameters among all possible ways to write a MIDX. Instead of adding yet more arguments to this function to support MIDX compaction, introduce a `struct write_midx_opts`, which has the same struct members as `write_midx_internal()`'s arguments. Adding additional fields to the `write_midx_opts` struct is preferable to adding additional arguments to `write_midx_internal()`. This is because the callers below all zero-initialize the struct, so each time we add a new piece of information, we do not have to pass the zero value for it in all other call-sites that do not care about it. For now, no functional changes are included in this patch. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:33 -08:00
Taylor Blau	ac10f6aa40	midx-write.c: don't use `pack_perm` when assigning `bitmap_pos` In midx_pack_order(), we compute for each bitmapped pack the first bit to correspond to an object in that pack, along with how many bits were assigned to object(s) in that pack. Initially, each bitmap_nr value is set to zero, and each bitmap_pos value is set to the sentinel BITMAP_POS_UNKNOWN. This is done to ensure that there are no packs who have an unknown bit position but a somehow non-zero number of objects (cf. `write_midx_bitmapped_packs()` in midx-write.c). Once the pack order is fully determined, midx_pack_order() sets the bitmap_pos field for any bitmapped packs to zero if they are still listed as BITMAP_POS_UNKNOWN. However, we enumerate the bitmapped packs in order of `ctx->pack_perm`. This is fine for existing cases, since the only time the `ctx->pack_perm` array holds a value outside of the addressable range of `ctx->info` is when there are expired packs, which only occurs via 'git multi-pack-index expire', which does not support writing MIDX bitmaps. As a result, the range of ctx->pack_perm covers all values in [0, `ctx->nr`), so enumerating in this order isn't an issue. A future change necessary for compaction will complicate this further by introducing a wrapper around the `ctx->pack_perm` array, which turns the given `pack_int_id` into one that is relative to the lower end of the compaction range. As a result, indexing into `ctx->pack_perm` through this helper, say, with "0" will produce a crash when the lower end of the compaction range has >0 pack(s) in its base layer, since the subtraction will wrap around the 32-bit unsigned range, resulting in an uninitialized read. But the process is completely unnecessary in the first place: we are enumerating all values of `ctx->info`, and there is no reason to process them in a different order than they appear in memory. Index `ctx->info` directly to reflect that. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:33 -08:00
Taylor Blau	8043782112	t/t5319-multi-pack-index.sh: fix copy-and-paste error in t5319.39 Commit `d4bf1d88b9` (multi-pack-index: verify missing pack, 2018-09-13) adds a new test to the MIDX test script to test how we handle missing packs. While the commit itself describes the test as "verify missing pack[s]", the test itself is actually called "verify packnames out of order", despite that not being what it tests. Likely this was a copy-and-paste of the test immediately above it of the same name. Correct this by renaming the test to match the commit message. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:33 -08:00
Taylor Blau	d0e91c128b	git-multi-pack-index(1): align SYNOPSIS with 'git multi-pack-index -h' Since `c39fffc1c9` (tests: start asserting that *.txt SYNOPSIS matches -h output, 2022-10-13), the manual page for 'git multi-pack-index' has a SYNOPSIS section which differs from 'git multi-pack-index -h'. Correct this while also documenting additional options accepted by the 'write' sub-command. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:32 -08:00
Taylor Blau	f775d5b1cf	git-multi-pack-index(1): remove non-existent incompatibility Since `fcb2205b77` (midx: implement support for writing incremental MIDX chains, 2024-08-06), the command-line options '--incremental' and '--bitmap' were declared to be incompatible with one another when running 'git multi-pack-index write'. However, since `27afc272c4` (midx: implement writing incremental MIDX bitmaps, 2025-03-20), that incompatibility no longer exists, despite the documentation saying so. Correct this by removing the stale reference to their incompatibility. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:32 -08:00
Taylor Blau	6b8fb17490	builtin/multi-pack-index.c: make '--progress' a common option All multi-pack-index sub-commands (write, verify, repack, and expire) support a '--progress' command-line option, despite not listing it as one of the common options in `common_opts`. As a result each sub-command declares its own `OPT_BIT()` for a "--progress" command-line option. Centralize this within the `common_opts` to avoid re-declaring it in each sub-command. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:32 -08:00
Taylor Blau	6e86f67924	midx: introduce `midx_get_checksum_hex()` When trying to print out, say, the hexadecimal representation of a MIDX's hash, our code will do something like: hash_to_hex_algop(midx_get_checksum_hash(m), m->source->odb->repo->hash_algo); , which is both cumbersome and repetitive. In fact, all but a handful of callers to `midx_get_checksum_hash()` do exactly the above. Reduce the repetitive nature of calling `midx_get_checksum_hash()` by having it return a pointer into a static buffer containing the above result. For the handful of callers that do need to compare the raw bytes and don't want to deal with an encoded copy (e.g., because they are passing it to hasheq() or similar), they may still rely on `midx_get_checksum_hash()` which returns the raw bytes. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:32 -08:00
Taylor Blau	de811c26bb	midx: rename `get_midx_checksum()` to `midx_get_checksum_hash()` Since `541204aabe` (Documentation: document naming schema for structs and their functions, 2024-07-30), we have adopted a naming convention for functions that would prefer a name like, say, `midx_get_checksum()` over `get_midx_checksum()`. Adopt this convention throughout the midx.h API. Since this function returns a raw (that is, non-hex encoded) hash, let's suffix the function with "_hash()" to make this clear. As a side effect, this prepares us for the subsequent change which will introduce a "_hex()" variant that encodes the checksum itself. Suggested-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:32 -08:00
Taylor Blau	00c0d84e6f	midx: mark `get_midx_checksum()` arguments as const To make clear that the function `get_midx_checksum()` does not do anything to modify its argument, mark the MIDX pointer as const. The following commit will rename this function altogether to make clear that it returns the raw bytes of the checksum, not a hex-encoded copy of it. Signed-off-by: Taylor Blau <me@ttaylorr.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-24 11:16:31 -08:00
Junio C Hamano	7c02d39fc2	The 6th batch Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-20 11:36:18 -08:00
Junio C Hamano	c61120cf65	Merge branch 'ak/t9812-test-path-is-helpers' Test update. * ak/t9812-test-path-is-helpers: t9812: modernize test path helpers	2026-02-20 11:36:18 -08:00
Junio C Hamano	8ceb69f85d	Merge branch 'pw/diff-anchored-optim' "git diff --anchored=<text>" has been optimized. * pw/diff-anchored-optim: diff --anchored: avoid checking unmatched lines	2026-02-20 11:36:18 -08:00
Junio C Hamano	2a0a64b9d4	Merge branch 'jc/doc-cg-c-comment' A CodingGuidelines update. * jc/doc-cg-c-comment: CodingGuidelines: document // comments	2026-02-20 11:36:18 -08:00
Junio C Hamano	5465d3683a	Merge branch 'pw/xdiff-cleanups' Small clean-up of xdiff library to remove unnecessary data duplication. * pw/xdiff-cleanups: xdiff: remove unused data from xdlclass_t xdiff: remove "line_hash" field from xrecord_t	2026-02-20 11:36:17 -08:00
Junio C Hamano	73fd77805f	The 5th batch Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-17 13:30:43 -08:00
Junio C Hamano	05e54d2d60	Merge branch 'jc/doc-rerere-update' Doc update. * jc/doc-rerere-update: rerere: minor documantation update	2026-02-17 13:30:43 -08:00
Junio C Hamano	11294bb0fa	Merge branch 'sd/t7003-test-path-is-helpers' Test updates. * sd/t7003-test-path-is-helpers: t7003: modernize path existence checks using test helpers	2026-02-17 13:30:43 -08:00
Junio C Hamano	dba8cfa9c0	Merge branch 'kh/doc-rerere-options-xref' Doc update. * kh/doc-rerere-options-xref: doc: rerere-options.adoc: link to git-rerere(1)	2026-02-17 13:30:43 -08:00
Junio C Hamano	70d3916a7d	Merge branch 'bk/t2003-modernise' Test update. * bk/t2003-modernise: t2003: modernize path existence checks using test helpers	2026-02-17 13:30:43 -08:00
Junio C Hamano	83037cb357	Merge branch 'rs/commit-commit-stack' Code clean-up to use the commit_stack API. * rs/commit-commit-stack: commit: use commit_stack	2026-02-17 13:30:42 -08:00
Junio C Hamano	354b8d89ac	Merge branch 'rs/clean-includes' Clean up redundant includes of header files. * rs/clean-includes: remove duplicate includes	2026-02-17 13:30:42 -08:00
Junio C Hamano	d445421a54	Merge branch 'rs/xdiff-wo-the-repository' Reduce dependency on the_repository of xdiff-interface layer. * rs/xdiff-wo-the-repository: xdiff-interface: stop using the_repository	2026-02-17 13:30:42 -08:00
Junio C Hamano	e8b7e16e7b	Merge branch 'rs/version-wo-the-repository' Code clean-up. * rs/version-wo-the-repository: version: stop using the_repository	2026-02-17 13:30:42 -08:00
Junio C Hamano	73f29c8ca9	Merge branch 'yt/merge-file-outside-a-repository' "git merge-file" can be run outside a repository, but it ignored all configuration, even the per-user ones. The command now uses available configuration files to find its customization. * yt/merge-file-outside-a-repository: merge-file: honor merge.conflictStyle outside of a repository	2026-02-17 13:30:41 -08:00
Junio C Hamano	6de2d1493a	Merge branch 'ja/doc-synopsis-style-even-more' A handful of documentation pages have been modernized to use the "synopsis" style. * ja/doc-synopsis-style-even-more: doc: convert git-show to synopsis style doc: fix some style issues in git-clone and for-each-ref-options doc: finalize git-clone documentation conversion to synopsis style doc: convert git-submodule to synopsis style	2026-02-17 13:30:41 -08:00
Junio C Hamano	5779c47fa0	Merge branch 'pc/lockfile-pid' Allow recording process ID of the process that holds the lock next to a lockfile for diagnosis. * pc/lockfile-pid: lockfile: add PID file for debugging stale locks	2026-02-17 13:30:41 -08:00
Junio C Hamano	852829b3dd	The 4th batch Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-13 13:39:26 -08:00
Junio C Hamano	03dfe4e1af	Merge branch 'sb/merge-ours-sparse' "git merge-ours" is taught to work better in a sparse checkout. * sb/merge-ours-sparse: merge-ours: integrate with sparse-index merge-ours: drop USE_THE_REPOSITORY_VARIABLE	2026-02-13 13:39:26 -08:00
Junio C Hamano	94336d77bc	Merge branch 'sd/doc-my1c-api-config-reference-fix' Docfix. * sd/doc-my1c-api-config-reference-fix: doc: fix repo_config documentation reference	2026-02-13 13:39:26 -08:00
Junio C Hamano	e10d5fcad0	Merge branch 'jc/ci-test-contrib-too' Test contrib/ things in CI to catch breakages before they enter the "next" branch. * jc/ci-test-contrib-too: : Some of our downstream folks run more tests than we do and catch : breakages in them, namely, where contrib/*/Makefile has "test" target. : Let's make sure we fail upon accepting a new topic that break them in : 'seen'. ci: ubuntu: use GNU coreutils for dirname test: optionally test contrib in CI	2026-02-13 13:39:26 -08:00
Junio C Hamano	29722ee3a3	Merge branch 'jt/odb-transaction-per-source' Transaction to create objects (or not) is currently tied to the repository, but in the future a repository can have multiple object sources, which may have different transaction mechanisms. Make the odb transaction API per object source. * jt/odb-transaction-per-source: odb: transparently handle common transaction behavior odb: prepare `struct odb_transaction` to become generic object-file: rename transaction functions odb: store ODB source in `struct odb_transaction`	2026-02-13 13:39:26 -08:00
Junio C Hamano	5288202433	Merge branch 'ps/commit-list-functions-renamed' Rename three functions around the commit_list data structure. * ps/commit-list-functions-renamed: commit: rename `free_commit_list()` to conform to coding guidelines commit: rename `reverse_commit_list()` to conform to coding guidelines commit: rename `copy_commit_list()` to conform to coding guidelines	2026-02-13 13:39:25 -08:00
Junio C Hamano	70cc3bca87	Merge branch 'tc/last-modified-not-a-tree' Giving "git last-modified" a tree (not a commit-ish) died an uncontrolled death, which has been corrected. * tc/last-modified-not-a-tree: last-modified: verify revision argument is a commit-ish last-modified: remove double error message last-modified: fix memory leak when more than one commit is given last-modified: rewrite error message when more than one commit given	2026-02-13 13:39:25 -08:00
Junio C Hamano	f036245819	Merge branch 'mc/doc-send-email-signed-off-by-cc' Docfix. * mc/doc-send-email-signed-off-by-cc: doc: send-email: correct --no-signed-off-by-cc misspelling	2026-02-13 13:39:25 -08:00
Junio C Hamano	7855effc95	Merge branch 'cf/c23-const-preserving-strchr-updates-0' ISO C23 redefines strchr and friends that tradiotionally took a const pointer and returned a non-const pointer derived from it to preserve constness (i.e., if you ask for a substring in a const string, you get a const pointer to the substring). Update code paths that used non-const pointer to receive their results that did not have to be non-const to adjust. * cf/c23-const-preserving-strchr-updates-0: gpg-interface: remove an unnecessary NULL initialization global: constify some pointers that are not written to	2026-02-13 13:39:25 -08:00
Junio C Hamano	a91de2172d	Merge branch 'jc/diff-highlight-main-master-testfix' Test fix (in contrib/) * jc/diff-highlight-main-master-testfix: diff-highlight: allow testing with Git 3.0 breaking changes	2026-02-13 13:39:24 -08:00
Junio C Hamano	448a65c93b	Merge branch 'cs/subtree-reftable-testfix' Test fix (in contrib/) * cs/subtree-reftable-testfix: contrib/subtree: fix tests with reftable backend	2026-02-13 13:39:24 -08:00
Junio C Hamano	b852412523	Merge branch 'tc/memzero-array' Coccinelle rules update. * tc/memzero-array: cocci: extend MEMZERO_ARRAY() rules	2026-02-13 13:39:24 -08:00
Ashwani Kumar Kamal	580d86576a	t9812: modernize test path helpers Replace assertion-style 'test -f' checks with Git's test_path_is_file() helper for clearer failures and consistency. Signed-off-by: Ashwani Kumar Kamal <ashwanikamal.im421@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-12 11:08:29 -08:00
Phillip Wood	dd2a4c0c7a	diff --anchored: avoid checking unmatched lines For a line to be an anchor it has to appear in each of the files being diffed exactly once. With that in mind lets delay checking whether a line is an anchor until we know there is exactly one instance of the line in each file. As each line is checked at most once, there is no need to cache the result of is_anchor() and we can drop that field from the hashmap entries. When diffing 5000 recent commits in git.git this gives a modest speedup of ~2%. In the (rather extreme) example below that consists largely of deletions the speedup is ~16%. seq 0 10000000 >old printf '%s\n' 300000 100000 200000 >new git diff --no-index --anchored=300000 old new Signed-off-by: Phillip Wood <phillip.wood@dunelm.org.uk> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-12 09:28:49 -08:00
Junio C Hamano	453e7b744a	Merge branch 'master' of https://github.com/j6t/gitk * 'master' of https://github.com/j6t/gitk: gitk: fix msgfmt being required gitk: fix highlighted remote prefix of branches with directories	2026-02-11 14:49:53 -08:00
Junio C Hamano	2f99f50f2d	CodingGuidelines: document // comments We do not use // comments in our C code, which is implied by the description of multi-line comment rule and its examples, but is not explicitly spelled out. Spell it out. Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-11 13:29:36 -08:00
Junio C Hamano	6fcee47852	The 3rd batch Signed-off-by: Junio C Hamano <gitster@pobox.com>	2026-02-11 12:29:08 -08:00
Junio C Hamano	06cef761b1	Merge branch 'rs/blame-ignore-colors-fix' "git blame --ignore-revs=... --color-lines" did not account for ignored revisions passing blame to the same commit an adjacent line gets blamed for. * rs/blame-ignore-colors-fix: blame: fix coloring for repeated suspects	2026-02-11 12:29:08 -08:00
Junio C Hamano	ea03b35bb5	Merge branch 'hs/t9160-test-paths' Test update. * hs/t9160-test-paths: t9160:modernize test path checking	2026-02-11 12:29:07 -08:00
Junio C Hamano	18df0fa3ca	Merge branch 'am/doc-github-contributiong-link-to-submittingpatches' GitHub repository banner update. * am/doc-github-contributiong-link-to-submittingpatches: .github/CONTRIBUTING.md: link to SubmittingPatches on git-scm.com	2026-02-11 12:29:07 -08:00
Junio C Hamano	40c59964b3	Merge branch 'kh/doc-shortlog-fix' Doc fix. * kh/doc-shortlog-fix: doc: shortlog: put back trailer paragraphs	2026-02-11 12:29:07 -08:00

1 2 3 4 5 ...

79777 Commits