global/git

mirror of https://github.com/git/git.git synced 2026-03-04 06:27:36 +01:00

Go to file

Elijah Newren 8b09a900a1 merge-ort: restart merge with cached renames to reduce process entry cost

The merge algorithm mostly consists of the following three functions:
   collect_merge_info()
   detect_and_process_renames()
   process_entries()
Prior to the trivial directory resolution optimization of the last half
dozen commits, process_entries() was consistently the slowest, followed
by collect_merge_info(), then detect_and_process_renames().  When the
trivial directory resolution applies, it often dramatically decreases
the amount of time spent in the two slower functions.

Looking at the performance results in the previous commit, the trivial
directory resolution optimization helps amazingly well when there are no
relevant renames.  It also helps really well when reapplying a long
series of linear commits (such as in a rebase or cherry-pick), since the
relevant renames may well be cached from the first reapplied commit.
But when there are any relevant renames that are not cached (represented
by the just-one-mega testcase), then the optimization does not help at
all.

Often, I noticed that when the optimization does not apply, it is
because there are a handful of relevant sources -- maybe even only one.
It felt frustrating to need to recurse into potentially hundreds or even
thousands of directories just for a single rename, but it was needed for
correctness.

However, staring at this list of functions and noticing that
process_entries() is the most expensive and knowing I could avoid it if
I had cached renames suggested a simple idea: change
   collect_merge_info()
   detect_and_process_renames()
   process_entries()
into
   collect_merge_info()
   detect_and_process_renames()
   <cache all the renames, and restart>
   collect_merge_info()
   detect_and_process_renames()
   process_entries()

This may seem odd and look like more work.  However, note that although
we run collect_merge_info() twice, the second time we get to employ
trivial directory resolves, which makes it much faster, so the increased
time in collect_merge_info() is small.  While we run
detect_and_process_renames() again, all renames are cached so it's
nearly a no-op (we don't call into diffcore_rename_extended() but we do
have a little bit of data structure checking and fixing up).  And the
big payoff comes from the fact that process_entries(), will be much
faster due to having far fewer entries to process.

This restarting only makes sense if we can save recursing into enough
directories to make it worth our while.  Introduce a simple heuristic to
guide this.  Note that this heuristic uses a "wanted_factor" that I have
virtually no actual real world data for, just some back-of-the-envelope
quasi-scientific calculations that I included in some comments and then
plucked a simple round number out of thin air.  It could be that
tweaking this number to make it either higher or lower improves the
optimization.  (There's slightly more here; when I first introduced this
optimization, I used a factor of 10, because I was completely confident
it was big enough to not cause slowdowns in special cases.  I was
certain it was higher than needed.  Several months later, I added the
rough calculations which make me think the optimal number is close to 2;
but instead of pushing to the limit, I just bumped it to 3 to reduce the
risk that there are special cases where this optimization can result in
slowing down the code a little.  If the ratio of path counts is below 3,
we probably will only see minor performance improvements at best
anyway.)

Also, note that while the diffstat looks kind of long (nearly 100
lines), more than half of it is in two comments explaining how things
work.

For the testcases mentioned in commit 557ac0350d ("merge-ort: begin
performance work; instrument with trace2_region_* calls", 2020-10-28),
this change improves the performance as follows:

                            Before                  After
    no-renames:      205.1  ms ±  3.8  ms   204.2  ms ±  3.0  ms
    mega-renames:      1.564 s ±  0.010 s     1.076 s ±  0.015 s
    just-one-mega:   479.5  ms ±  3.9  ms   364.1  ms ±  7.0  ms

Signed-off-by: Elijah Newren <newren@gmail.com>
Signed-off-by: Junio C Hamano <gitster@pobox.com>

2021-07-20 14:47:40 -07:00

.github

cmake(install): include vcpkg dlls

2021-03-29 13:49:04 -07:00

block-sha1

block-sha1: drop trailing semicolon from macro definition

2021-03-17 10:20:01 -07:00

builtin

Merge branch 'ab/struct-init'

2021-07-16 17:42:53 -07:00

Merge branch 'dd/svn-test-wo-locale-a'

2021-07-08 13:14:58 -07:00

compat

Merge branch 'jh/simple-ipc-sans-pthread'

2021-05-22 18:29:01 +09:00

contrib

Merge branch 'js/gfw-system-config-loc-fix'

2021-07-16 17:42:46 -07:00

Documentation

The fifth batch

2021-07-16 17:42:53 -07:00

ewah

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

git-gui

Merge https://github.com/prati0100/git-gui

2021-03-04 12:38:50 -08:00

gitk-git

Merge remote-tracking branch 'paulus/master' into pm/gitk-update

2020-10-03 10:06:27 -07:00

gitweb

gitweb: add "e-mail privacy" feature to redact e-mail addresses

2021-04-08 15:54:26 -07:00

mergetools

mergetools/kdiff3: make kdiff3 work on Windows too

2021-06-08 10:26:16 +09:00

negotiator

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

perl

Merge branch 'so/log-m-implies-p'

2021-06-14 13:33:27 +09:00

l10n: es: 2.32.0 round 1

2021-06-05 20:06:23 -05:00

ppc

…

refs

Merge branch 'ab/struct-init'

2021-07-16 17:42:53 -07:00

sha1collisiondetection @ 855827c583

…

sha1dc

…

sha256

…

merge-ort: restart merge with cached renames to reduce process entry cost

2021-07-20 14:47:40 -07:00

templates

hook: add sample template for push-to-checkout

2020-10-16 08:47:02 -07:00

trace2

Merge branch 'ab/trace2-squelch-gcc-warning'

2021-06-14 13:33:28 +09:00

xdiff

Merge branch 'ab/pickaxe-pcre2'

2021-07-13 16:52:50 -07:00

.cirrus.yml

…

.clang-format

…

.editorconfig

…

.gitattributes

CoC: explicitly take any whitespace breakage

2021-01-04 09:44:49 -08:00

.gitignore

Merge branch 'ls/subtree'

2021-05-10 16:59:47 +09:00

.gitmodules

…

.mailmap

Add entry for Ramkumar Ramachandra

2021-03-08 09:56:34 -08:00

.travis.yml

ci: remove GETTEXT_POISON jobs

2021-01-21 15:50:00 -08:00

.tsan-suppressions

…

abspath.c

abspath: add a function to resolve paths with missing components

2020-12-12 23:35:47 -08:00

aclocal.m4

…

add-interactive.c

Merge branch 'rs/xcalloc-takes-nelem-first'

2021-03-19 15:25:39 -07:00

add-interactive.h

…

add-patch.c

Merge branch 'js/add-i-color-fix'

2020-12-08 15:11:17 -08:00

advice.c

add: warn when asked to update SKIP_WORKTREE entries

2021-04-08 14:18:03 -07:00

advice.h

add: warn when asked to update SKIP_WORKTREE entries

2021-04-08 14:18:03 -07:00

alias.c

…

alias.h

…

alloc.c

…

alloc.h

…

apply.c

string-list.h users: change to use *_{nodup,dup}()

2021-07-01 12:32:22 -07:00

apply.h

…

archive-tar.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

archive-zip.c

archive: read short blobs in archive.c::write_archive_entry()

2020-09-19 15:56:05 -07:00

archive.c

string-list.h users: change to use *_{nodup,dup}()

2021-07-01 12:32:22 -07:00

archive.h

archive: expand only a single %(describe) per archive

2021-03-11 13:22:44 -08:00

attr.c

*: fix typos which duplicate a word

2021-06-14 10:16:06 +09:00

attr.h

*: remove 'const' qualifier for struct index_state

2021-04-14 13:46:00 -07:00

banned.h

banned.h: mark ctime_r() and asctime_r() as banned

2020-12-02 14:30:39 -08:00

base85.c

…

bisect.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

bisect.h

bisect: combine args passed to find_bisection()

2020-08-07 15:13:03 -07:00

blame.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

blame.h

blame: simplify 'setup_blame_bloom_data' interface

2020-11-01 15:54:15 -08:00

blob.c

…

blob.h

…

bloom.c

Merge branch 'ah/plugleaks'

2021-05-07 12:47:41 +09:00

bloom.h

bloom: encode out-of-bounds filters as non-empty

2020-09-17 21:55:50 -07:00

branch.c

Merge branch 'bc/hash-transition-interop-part-1'

2021-05-10 16:59:46 +09:00

branch.h

…

builtin.h

parallel-checkout: make it truly parallel

2021-04-19 11:57:05 -07:00

bulk-checkin.c

bulk-checkin: make buffer reuse more obvious and safer

2021-06-11 13:01:22 +09:00

bulk-checkin.h

…

bundle.c

bundle: remove "ref_list" in favor of string-list.c API

2021-07-06 12:10:17 -07:00

bundle.h

bundle: remove "ref_list" in favor of string-list.c API

2021-07-06 12:10:17 -07:00

cache-tree.c

Merge branch 'ds/sparse-index-protections'

2021-04-30 13:50:26 +09:00

cache-tree.h

cache-tree: extract subtree_pos()

2021-01-23 17:14:07 -08:00

cache.h

Merge branch 'mt/parallel-checkout-part-3'

2021-05-16 21:05:23 +09:00

chdir-notify.c

…

chdir-notify.h

…

check_bindir

…

check-builtins.sh

…

checkout.c

config: drop git_config_get_string_const()

2020-08-17 15:35:47 -07:00

checkout.h

…

chunk-format.c

csum-file.h: increase hashfile buffer size

2021-05-19 16:41:21 +09:00

chunk-format.h

chunk-format: create read chunk API

2021-02-18 13:38:16 -08:00

CODE_OF_CONDUCT.md

CoC: update to version 2.0 + local changes

2021-01-13 17:45:04 -08:00

color.c

…

color.h

…

column.c

Merge branch 'jk/strvec'

2020-08-10 10:23:57 -07:00

column.h

…

combine-diff.c

Merge branch 'ab/pickaxe-pcre2'

2021-07-13 16:52:50 -07:00

command-list.txt

mailmap doc: create a new "gitmailmap(5)" man page

2021-01-12 14:04:39 -08:00

commit-graph.c

commit-graph: rewrite to use checksum_valid()

2021-06-28 20:36:17 -07:00

commit-graph.h

commit-graph: use config to specify generation type

2021-02-25 15:10:41 -08:00

commit-reach.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

commit-reach.h

commit-graph: return 64-bit generation number

2021-01-18 16:21:18 -08:00

commit-slab-decl.h

…

commit-slab-impl.h

…

commit-slab.h

…

commit.c

Merge branch 'ds/gender-neutral-doc'

2021-07-16 17:42:53 -07:00

commit.h

Merge branch 'cm/rebase-i-fixup-amend-reword'

2021-03-26 14:59:03 -07:00

common-main.c

…

config.c

Merge branch 'ds/gender-neutral-doc'

2021-07-16 17:42:53 -07:00

config.h

Merge branch 'ds/gender-neutral-doc'

2021-07-16 17:42:53 -07:00

config.mak.dev

Merge branch 'jc/sparse-error-for-developer-build'

2020-11-18 13:32:54 -08:00

config.mak.in

…

config.mak.uname

Merge branch 'js/config-mak-windows-pcre-fix'

2021-07-16 17:42:47 -07:00

configure.ac

Remove support for v1 of the PCRE library

2021-01-23 21:15:43 -08:00

connect.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

connect.h

…

connected.c

Merge branch 'rs/more-buffered-io'

2020-08-24 14:54:31 -07:00

connected.h

…

convert.c

Merge branch 'ds/sparse-index-protections'

2021-04-30 13:50:26 +09:00

convert.h

Merge branch 'ds/sparse-index-protections'

2021-04-30 13:50:26 +09:00

copy.c

…

COPYING

…

credential.c

*.c *_init(): define in terms of corresponding *_INIT macro

2021-07-01 12:32:22 -07:00

credential.h

*.h: move some *_INIT to designated initializers

2021-07-01 12:31:45 -07:00

csum-file.c

Merge branch 'tb/midx-use-checksum'

2021-07-16 17:42:46 -07:00

csum-file.h

Merge branch 'tb/midx-use-checksum'

2021-07-16 17:42:46 -07:00

ctype.c

…

daemon.c

Merge branch 'rs/daemon-sanitize-dir-sep'

2021-04-08 13:23:26 -07:00

date.c

comments: avoid using the gender of our users

2021-06-16 11:25:11 +09:00

decorate.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

decorate.h

…

delta-islands.c

…

delta-islands.h

…

delta.h

…

detect-compiler

…

diff-delta.c

…

diff-lib.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

diff-merges.c

diff-merges: let "-m" imply "-p"

2021-05-21 09:24:14 +09:00

diff-merges.h

diff-merges: move specific diff-index "-m" handling to diff-index

2021-05-21 09:24:14 +09:00

diff-no-index.c

dir: introduce readdir_skip_dot_and_dotdot() helper

2021-05-27 14:02:37 +09:00

diff.c

Merge branch 'ab/pickaxe-pcre2'

2021-07-13 16:52:50 -07:00

diff.h

diff.h: move pickaxe fields together again

2021-05-11 12:47:31 +09:00

diffcore-break.c

…

diffcore-delta.c

…

diffcore-order.c

…

diffcore-pickaxe.c

xdiff-interface: replace discard_hunk_line() with a flag

2021-05-11 12:47:31 +09:00

diffcore-rename.c

Merge branch 'en/ort-perf-batch-13'

2021-07-16 17:42:45 -07:00

diffcore-rotate.c

diff: --{rotate,skip}-to=<path>

2021-02-16 09:30:42 -08:00

diffcore.h

Merge branch 'en/ort-perf-batch-11'

2021-06-14 13:33:27 +09:00

dir-iterator.c

…

dir-iterator.h

…

dir.c

dir.[ch]: replace dir_init() with DIR_INIT

2021-07-01 12:32:22 -07:00

dir.h

dir.[ch]: replace dir_init() with DIR_INIT

2021-07-01 12:32:22 -07:00

editor.c

config: fix leaks from git_config_get_string_const()

2020-08-14 10:52:04 -07:00

entry.c

string-list.h users: change to use *_{nodup,dup}()

2021-07-01 12:32:22 -07:00

entry.h

entry: add checkout_entry_ca() taking preloaded conv_attrs

2021-03-23 10:34:05 -07:00

environment.c

config: allow specifying config entries via envvar pairs

2021-01-15 13:03:45 -08:00

environment.h

environment: make getenv_safe() a public function

2021-01-15 13:03:45 -08:00

exec-cmd.c

strvec: rename struct fields

2020-07-30 19:18:06 -07:00

exec-cmd.h

…

fetch-negotiator.c

negotiator/noop: add noop fetch negotiator

2020-08-18 13:25:05 -07:00

fetch-negotiator.h

…

fetch-pack.c

fetch-pack: signal v2 server that we are done making requests

2021-05-20 07:38:40 +09:00

fetch-pack.h

fetch: teach independent negotiation (no packfile)

2021-05-05 10:41:29 +09:00

fmt-merge-msg.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

fmt-merge-msg.h

…

fsck.c

fsck: warn about symlinked dotfiles we'll open with O_NOFOLLOW

2021-05-04 11:52:02 +09:00

fsck.h

fsck: warn about symlinked dotfiles we'll open with O_NOFOLLOW

2021-05-04 11:52:02 +09:00

fsmonitor.c

Merge branch 'jh/fsmonitor-prework'

2021-03-19 15:25:37 -07:00

fsmonitor.h

fsmonitor: add assertion that fsmonitor is valid to check_removed

2021-03-18 13:31:13 -07:00

fuzz-commit-graph.c

commit-graph: pass a 'struct repository *' in more places

2020-09-09 12:51:48 -07:00

fuzz-pack-headers.c

…

fuzz-pack-idx.c

…

generate-cmdlist.sh

Fit to Plan 9's ANSI/POSIX compatibility layer

2020-09-09 22:31:31 -07:00

generate-configlist.sh

…

gettext.c

Merge branch 'ab/detox-gettext-tests'

2021-02-10 14:48:33 -08:00

gettext.h

tests: remove support for GIT_TEST_GETTEXT_POISON

2021-01-21 15:50:01 -08:00

git-add--interactive.perl

Merge branch 'js/add-i-color-fix'

2020-12-08 15:11:17 -08:00

git-archimport.perl

…

git-bisect.sh

bisect--helper: reimplement bisect_skip shell function in C

2021-02-03 14:52:09 -08:00

git-compat-util.h

Merge branch 'ew/mmap-failures'

2021-07-16 17:42:47 -07:00

git-cvsexportcommit.perl

cvsexportcommit: do not run git programs in dashed form

2020-08-26 14:49:52 -07:00

git-cvsimport.perl

…

git-cvsserver.perl

*: fix typos which duplicate a word

2021-06-14 10:16:06 +09:00

git-difftool--helper.sh

mergetool: break setup_tool out into separate initialization function

2021-02-09 14:09:16 -08:00

git-filter-branch.sh

filter-branch: drop $_x40 glob

2021-03-10 14:16:58 -08:00

git-instaweb.sh

…

git-merge-octopus.sh

…

git-merge-one-file.sh

…

git-merge-resolve.sh

…

git-mergetool--lib.sh

Merge branch 'sh/mergetool-hideresolved'

2021-02-17 17:21:41 -08:00

git-mergetool.sh

mergetool: do not enable hideResolved by default

2021-03-13 15:30:29 -08:00

git-p4.py

git-p4: fix failed submit by skip non-text data files

2021-06-28 09:49:30 -07:00

git-quiltimport.sh

…

git-rebase--preserve-merges.sh

rebase: remove unused function reschedule_last_action

2020-08-12 12:25:42 -07:00

git-request-pull.sh

…

git-send-email.perl

Merge branch 'ga/send-email-sendmail-cmd'

2021-06-14 13:33:26 +09:00

git-sh-i18n.sh

tests: remove support for GIT_TEST_GETTEXT_POISON

2021-01-21 15:50:01 -08:00

git-sh-setup.sh

…

git-submodule.sh

submodule: remove unnecessary prefix based option logic

2021-06-28 20:28:19 -07:00

git-svn.perl

perl: check for perl warnings while running tests

2020-10-21 23:11:48 -07:00

GIT-VERSION-GEN

Git 2.32

2021-06-06 15:40:01 +09:00

git-web--browse.sh

…

git.c

Merge branch 'ps/config-env-option-with-separate-value'

2021-05-07 12:47:37 +09:00

git.rc

…

gpg-interface.c

gpg-interface: remove other signature headers before verifying

2021-02-10 23:35:42 -08:00

gpg-interface.h

gpg-interface: improve interface for parsing tags

2021-02-10 23:35:42 -08:00

graph.c

graph: improve grammar of "invalid color" error message

2021-06-15 12:54:26 +09:00

graph.h

…

grep.c

Merge branch 'rs/grep-parser-fix'

2021-07-13 16:52:53 -07:00

grep.h

grep/pcre2: move back to thread-only PCREv2 structures

2021-02-17 16:32:19 -08:00

hash-lookup.c

oid_pos(): access table through const pointers

2021-01-28 12:03:26 -08:00

hash-lookup.h

oid_pos(): access table through const pointers

2021-01-28 12:03:26 -08:00

hash.h

parallel-checkout: send the new object_id algo field to the workers

2021-05-18 05:38:54 +09:00

hashmap.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

hashmap.h

hashmap: provide deallocation function names

2020-11-02 12:15:50 -08:00

help.c

help.c: help.autocorrect=never means "do not compute suggestions"

2020-11-25 13:02:15 -08:00

help.h

help: do not expect built-in commands to be hardlinked

2020-10-07 15:25:10 -07:00

hex.c

hex: print objects using the hash algorithm member

2021-04-27 16:31:39 +09:00

http-backend.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

http-fetch.c

http-fetch: allow custom index-pack args

2021-02-22 12:07:40 -08:00

http-push.c

Merge branch 'bc/hash-transition-interop-part-1'

2021-05-10 16:59:46 +09:00

http-walker.c

Always use oidread to read into struct object_id

2021-04-27 16:31:38 +09:00

http.c

Merge branch 'cs/http-use-basic-after-failed-negotiate'

2021-05-21 05:49:41 +09:00

http.h

http: allow custom index-pack args

2021-02-22 12:07:40 -08:00

ident.c

Merge branch 'pw/rebase-i-more-options'

2020-09-03 12:37:01 -07:00

imap-send.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

INSTALL

INSTALL: note on using Asciidoctor to build doc

2021-03-19 10:49:20 -07:00

iterator.h

…

json-writer.c

*.c *_init(): define in terms of corresponding *_INIT macro

2021-07-01 12:32:22 -07:00

json-writer.h

*.h: move some *_INIT to designated initializers

2021-07-01 12:31:45 -07:00

khash.h

khash: clarify that allocations never fail

2021-07-06 13:07:50 -07:00

kwset.c

…

kwset.h

…

levenshtein.c

…

levenshtein.h

…

LGPL-2.1

…

line-log.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

line-log.h

…

line-range.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

line-range.h

…

linear-assignment.c

…

linear-assignment.h

…

list-objects-filter-options.c

help: fix small typo in error message

2021-05-21 07:52:10 +09:00

list-objects-filter-options.h

list-objects: implement object type filter

2021-04-19 14:09:11 -07:00

list-objects-filter.c

list-objects: implement object type filter

2021-04-19 14:09:11 -07:00

list-objects-filter.h

list-objects: support filtering by tag and commit

2021-04-12 09:35:50 -07:00

list-objects.c

bitmaps: don't recurse into trees already in the bitmap

2021-06-15 11:13:11 +09:00

list-objects.h

…

list.h

…

ll-merge.c

ll_union_merge(): rename path_unused parameter

2021-06-11 12:37:33 +09:00

ll-merge.h

…

lockfile.c

…

lockfile.h

…

log-tree.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

log-tree.h

format-patch: make output filename configurable

2020-11-09 17:44:41 -08:00

ls-refs.c

Merge branch 'jt/clone-unborn-head'

2021-02-17 17:21:40 -08:00

ls-refs.h

ls-refs: report unborn targets of symrefs

2021-02-05 13:49:53 -08:00

mailinfo.c

Merge branch 'ar/mailinfo-memcmp-to-skip-prefix'

2021-07-08 13:15:03 -07:00

mailinfo.h

am: learn to process quoted lines that ends with CRLF

2021-05-10 15:06:22 +09:00

mailmap.c

Merge branch 'jk/open-dotgitx-with-nofollow'

2021-03-22 14:00:22 -07:00

mailmap.h

shortlog: remove unused(?) "repo-abbrev" feature

2021-01-12 14:04:42 -08:00

Makefile

Merge branch 'jt/partial-clone-submodule-1'

2021-07-16 17:42:53 -07:00

match-trees.c

Always use oidread to read into struct object_id

2021-04-27 16:31:38 +09:00

mem-pool.c

mem-pool: drop trailing semicolon from macro definition

2021-03-17 10:20:16 -07:00

mem-pool.h

mem-pool: use consistent pool variable name

2020-08-18 12:16:08 -07:00

merge-blobs.c

…

merge-blobs.h

…

merge-ort-wrappers.c

merge(s): apply consistent punctuation to "up to date" messages

2021-05-03 14:14:56 +09:00

merge-ort-wrappers.h

merge-ort-wrappers: new convience wrappers to mimic the old merge API

2020-10-26 22:36:14 -07:00

merge-ort.c

merge-ort: restart merge with cached renames to reduce process entry cost

2021-07-20 14:47:40 -07:00

merge-ort.h

merge-ort: avoid accidental API mis-use

2021-05-20 15:40:39 +09:00

merge-recursive.c

Merge branch 'ab/struct-init'

2021-07-16 17:42:53 -07:00

merge-recursive.h

merge-recursive: fix unclear and outright wrong comments

2020-08-02 11:03:57 -07:00

merge.c

dir.[ch]: replace dir_init() with DIR_INIT

2021-07-01 12:32:22 -07:00

mergesort.c

…

mergesort.h

…

midx.c

midx: report checksum mismatches during 'verify'

2021-06-28 20:36:17 -07:00

midx.h

pack-revindex: write multi-pack reverse indexes

2021-04-01 13:07:37 -07:00

name-hash.c

Merge branch 'ds/sparse-index-protections'

2021-04-30 13:50:26 +09:00

notes-cache.c

…

notes-cache.h

…

notes-merge.c

dir: introduce readdir_skip_dot_and_dotdot() helper

2021-05-27 14:02:37 +09:00

notes-merge.h

…

notes-utils.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

notes-utils.h

…

notes.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

notes.h

…

object-file.c

Merge branch 'jt/partial-clone-submodule-1'

2021-07-16 17:42:53 -07:00

object-name.c

object-name.c: rename from sha1-name.c

2021-01-04 13:01:55 -08:00

object-store.h

packfile: add kept-pack cache for find_kept_pack_entry()

2021-02-22 23:30:52 -08:00

object.c

lookup_unknown_object(): take a repository argument

2021-04-13 13:18:46 -07:00

object.h

Merge branch 'jt/push-negotiation'

2021-05-16 21:05:22 +09:00

oid-array.c

oid_pos(): access table through const pointers

2021-01-28 12:03:26 -08:00

oid-array.h

oid-array: provide a for-loop iterator

2020-12-07 12:32:04 -08:00

oidmap.c

hashmap: provide deallocation function names

2020-11-02 12:15:50 -08:00

oidmap.h

…

oidset.c

blame: silently ignore invalid ignore file objects

2020-11-10 13:05:06 -08:00

oidset.h

blame: validate and peel the object names on the ignore list

2020-09-24 22:20:58 -07:00

pack-bitmap-write.c

oid_pos(): access table through const pointers

2021-01-28 12:03:26 -08:00

pack-bitmap.c

bitmaps: don't recurse into trees already in the bitmap

2021-06-15 11:13:11 +09:00

pack-bitmap.h

Merge branch 'ps/rev-list-object-type-filter'

2021-05-07 12:47:41 +09:00

pack-check.c

csum-file: introduce checksum_valid()

2021-06-28 20:36:17 -07:00

pack-objects.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

pack-objects.h

pack-objects: move static inline from a header to the sole consumer

2021-05-27 12:14:41 +09:00

pack-revindex.c

Merge branch 'tb/reverse-midx'

2021-04-08 13:23:25 -07:00

pack-revindex.h

pack-revindex: read multi-pack reverse indexes

2021-04-01 13:07:37 -07:00

pack-write.c

Merge branch 'tb/reverse-midx'

2021-04-08 13:23:25 -07:00

pack.h

Merge branch 'tb/reverse-midx'

2021-04-08 13:23:25 -07:00

packfile.c

xmmap: inform Linux users of tuning knobs on ENOMEM

2021-06-29 23:14:25 -07:00

packfile.h

packfile: introduce 'find_kept_pack_entry()'

2021-02-22 23:30:52 -08:00

pager.c

pager: avoid setting COLUMNS when we're guessing its value

2021-06-28 10:42:10 -07:00

parallel-checkout.c

parallel-checkout: send the new object_id algo field to the workers

2021-05-18 05:38:54 +09:00

parallel-checkout.h

parallel-checkout: support progress displaying

2021-04-19 11:57:05 -07:00

parse-options-cb.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

parse-options.c

parse-options: don't leak alias help messages

2021-03-21 14:39:10 -07:00

parse-options.h

parse-options: don't leak alias help messages

2021-03-21 14:39:10 -07:00

patch-delta.c

…

patch-ids.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

patch-ids.h

patch-ids: handle duplicate hashmap entries

2021-01-12 11:13:32 -08:00

path.c

t0060: test ntfs/hfs-obscured dotfiles

2021-05-04 11:52:02 +09:00

path.h

merge-ort: write $GIT_DIR/AUTO_MERGE whenever we hit a conflict

2021-03-20 12:35:40 -07:00

pathspec.c

Merge branch 'mt/add-rm-in-sparse-checkout'

2021-05-07 12:47:40 +09:00

pathspec.h

Merge branch 'ds/gender-neutral-doc'

2021-07-16 17:42:53 -07:00

pkt-line.c

pkt-line: do not report packet write errors twice

2021-04-15 15:05:31 -07:00

pkt-line.h

pkt-line: add options argument to read_packetized_to_strbuf()

2021-03-15 14:32:50 -07:00

preload-index.c

preload-index: log the number of lstat calls to trace2

2021-02-16 17:14:34 -08:00

pretty.c

pretty: provide human date format

2021-04-27 16:09:32 +09:00

pretty.h

Merge branch 'rs/pretty-describe'

2021-03-22 14:00:24 -07:00

prio-queue.c

…

prio-queue.h

…

progress.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

progress.h

…

promisor-remote.c

Merge branch 'jt/partial-clone-submodule-1'

2021-07-16 17:42:53 -07:00

promisor-remote.h

promisor-remote: support per-repository config

2021-06-28 09:57:42 -07:00

prompt.c

…

prompt.h

…

protocol-caps.c

object-info: support for retrieving object info

2021-04-20 17:41:13 -07:00

protocol-caps.h

protocol-caps.h: add newline at end of file

2021-06-08 10:08:41 +09:00

protocol.c

protocol: re-enable v2 protocol by default

2020-09-25 11:40:42 -07:00

protocol.h

…

prune-packed.c

…

prune-packed.h

…

quote.c

quote: make sq_dequote_step() a public function

2021-01-12 12:03:18 -08:00

quote.h

quote: make sq_dequote_step() a public function

2021-01-12 12:03:18 -08:00

range-diff.c

Merge branch 'ab/pickaxe-pcre2'

2021-07-13 16:52:50 -07:00

range-diff.h

Merge branch 'js/range-diff-one-side-only'

2021-02-17 17:21:41 -08:00

reachable.c

Merge branch 'ps/rev-list-object-type-filter'

2021-05-07 12:47:41 +09:00

reachable.h

…

read-cache.c

Merge branch 'ew/mmap-failures'

2021-07-16 17:42:47 -07:00

README.md

…

rebase-interactive.c

rebase -i: clarify and fix 'fixup -c' rebase-todo help

2021-02-10 13:58:19 -08:00

rebase-interactive.h

…

rebase.c

…

rebase.h

…

ref-filter.c

Merge branch 'zh/ref-filter-push-remote-fix'

2021-05-20 08:55:00 +09:00

ref-filter.h

ref-filter: get rid of show_ref_array_item

2021-04-19 15:08:00 -07:00

reflog-walk.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

reflog-walk.h

…

refs.c

Merge branch 'hn/refs-iterator-peel-returns-boolean'

2021-07-16 17:42:49 -07:00

refs.h

Merge branch 'tb/ls-refs-optim'

2021-02-05 16:40:45 -08:00

refspec.c

Merge branch 'fc/atmark-in-refspec'

2020-12-14 10:21:36 -08:00

refspec.h

Merge branch 'sb/clone-origin'

2020-10-27 15:09:50 -07:00

RelNotes

The first batch post Git 2.32

2021-06-10 12:04:27 +09:00

remote-curl.c

remote-curl: fix clone on sha256 repos

2021-05-12 12:14:44 +09:00

remote.c

*: fix typos which duplicate a word

2021-06-14 10:16:06 +09:00

remote.h

Merge branch 'jt/clone-unborn-head'

2021-02-17 17:21:40 -08:00

replace-object.c

…

replace-object.h

…

repo-settings.c

sparse-index: add index.sparse config option

2021-03-30 12:57:47 -07:00

repository.c

promisor-remote: support per-repository config

2021-06-28 09:57:42 -07:00

repository.h

promisor-remote: support per-repository config

2021-06-28 09:57:42 -07:00

rerere.c

dir: introduce readdir_skip_dot_and_dotdot() helper

2021-05-27 14:02:37 +09:00

rerere.h

…

reset.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

reset.h

…

resolve-undo.c

Merge branch 'ds/sparse-index-protections'

2021-04-30 13:50:26 +09:00

resolve-undo.h

…

revision.c

add_pending_object_with_path(): work around "gcc -O3" complaint

2021-06-11 12:45:37 +09:00

revision.h

Merge branch 'jk/bitmap-tree-optim'

2021-07-08 13:15:00 -07:00

run-command.c

Merge branch 'jt/partial-clone-submodule-1'

2021-07-16 17:42:53 -07:00

run-command.h

Merge branch 'jt/partial-clone-submodule-1'

2021-07-16 17:42:53 -07:00

SECURITY.md

SECURITY: describe how to report vulnerabilities

2021-03-27 15:13:02 -07:00

send-pack.c

send-pack.c: move "no refs in common" abort earlier

2021-06-30 14:57:22 -07:00

send-pack.h

…

sequencer.c

Merge branch 'bc/hash-transition-interop-part-1'

2021-05-10 16:59:46 +09:00

sequencer.h

sequencer: fix edit handling for cherry-pick and revert messages

2021-03-31 14:10:50 -07:00

serve.c

object-info: support for retrieving object info

2021-04-20 17:41:13 -07:00

serve.h

…

server-info.c

use CALLOC_ARRAY

2021-03-13 16:00:09 -08:00

setup.c

Merge branch 'jt/partial-clone-submodule-1'

2021-07-16 17:42:53 -07:00

sh-i18n--envsubst.c

builtins + test helpers: use return instead of exit() in cmd_*

2021-06-09 09:15:58 +09:00

sha1dc_git.c

…

sha1dc_git.h

…

shallow.c

commit_graft_pos(): take an oid instead of a bare hash

2021-01-28 11:21:07 -08:00

shallow.h

…

shell.c

builtins + test helpers: use return instead of exit() in cmd_*

2021-06-09 09:15:58 +09:00

shortlog.h

shortlog: remove unused(?) "repo-abbrev" feature

2021-01-12 14:04:42 -08:00

sideband.c

sideband: don't lose clear-to-eol at packet boundary

2021-06-17 14:11:36 +09:00

sideband.h

sideband: diagnose more sideband anomalies

2020-10-29 09:23:29 -07:00

sigchain.c

…

sigchain.h

…

simple-ipc.h

simple-ipc: correct ifdefs when NO_PTHREADS is defined

2021-05-21 07:55:00 +09:00

sparse-index.c

Merge branch 'ds/sparse-index-protections'

2021-05-21 05:50:38 +09:00

sparse-index.h

sparse-index: expand_to_path()

2021-04-14 13:47:54 -07:00

split-index.c

split-index: use oideq instead of memcmp to compare object_id's

2021-06-15 12:07:56 +09:00

split-index.h

…

stable-qsort.c

…

strbuf.c

*.c *_init(): define in terms of corresponding *_INIT macro

2021-07-01 12:32:22 -07:00

strbuf.h

comments: avoid using the gender of our users

2021-06-16 11:25:11 +09:00

streaming.c

streaming.c: move {open,close,read} from vtable to "struct git_istream"

2021-05-06 12:56:10 +09:00

streaming.h

…

string-list.c

string-list.[ch]: add a string_list_init_{nodup,dup}()

2021-07-01 12:32:22 -07:00

string-list.h

string-list.[ch]: add a string_list_init_{nodup,dup}()

2021-07-01 12:32:22 -07:00

strmap.c

*.c *_init(): define in terms of corresponding *_INIT macro

2021-07-01 12:32:22 -07:00

strmap.h

strmap: make callers of strmap_remove() to call it in void context

2020-12-15 15:30:44 -08:00

strvec.c

*.c *_init(): define in terms of corresponding *_INIT macro

2021-07-01 12:32:22 -07:00

strvec.h

strvec: rename struct fields

2020-07-30 19:18:06 -07:00

sub-process.c

…

sub-process.h

…

submodule-config.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

submodule-config.h

…

submodule.c

run-command: refactor subprocess env preparation

2021-06-28 09:58:01 -07:00

submodule.h

*: remove 'const' qualifier for struct index_state

2021-04-14 13:46:00 -07:00

symlinks.c

checkout: don't follow symlinks when removing entries

2021-03-18 12:58:10 -07:00

tag.c

gpg-interface: improve interface for parsing tags

2021-02-10 23:35:42 -08:00

tag.h

…

tar.h

…

tempfile.c

…

tempfile.h

…

thread-utils.c

…

thread-utils.h

…

tmp-objdir.c

packfile: prepare for the existence of '*.rev' files

2021-01-25 18:32:43 -08:00

tmp-objdir.h

…

trace2.c

trace2: add a public function for getting the SID

2020-11-11 18:26:52 -08:00

trace2.h

trace2: add a public function for getting the SID

2020-11-11 18:26:52 -08:00

trace.c

…

trace.h

…

trailer.c

trailer: add new .cmd config option

2021-05-04 12:09:43 +09:00

trailer.h

pretty format %(trailers): add a "key_value_separator"

2020-12-09 14:16:42 -08:00

transport-helper.c

fetch: teach independent negotiation (no packfile)

2021-05-05 10:41:29 +09:00

transport-internal.h

connect, transport: encapsulate arg in struct

2021-02-05 13:49:54 -08:00

transport.c

Merge branch 'ab/struct-init'

2021-07-16 17:42:53 -07:00

transport.h

fetch: teach independent negotiation (no packfile)

2021-05-05 10:41:29 +09:00

tree-diff.c

hash: provide per-algorithm null OIDs

2021-04-27 16:31:39 +09:00

tree-walk.c

Always use oidread to read into struct object_id

2021-04-27 16:31:38 +09:00

tree-walk.h

…

tree.c

tree.h API: simplify read_tree_recursive() signature

2021-03-20 16:09:26 -07:00

tree.h

tree.h API: simplify read_tree_recursive() signature

2021-03-20 16:09:26 -07:00

unicode-width.h

…

unimplemented.sh

…

unix-socket.c

unix-socket: disallow chdir() when creating unix domain sockets

2021-03-15 14:32:51 -07:00

unix-socket.h

unix-socket: disallow chdir() when creating unix domain sockets

2021-03-15 14:32:51 -07:00

unix-stream-server.c

unix-stream-server: create unix domain socket under lock

2021-03-15 14:32:51 -07:00

unix-stream-server.h

unix-stream-server: create unix domain socket under lock

2021-03-15 14:32:51 -07:00

unpack-trees.c

Merge branch 'mt/parallel-checkout-part-3'

2021-05-16 21:05:23 +09:00

unpack-trees.h

stash show: teach --include-untracked and --only-untracked

2021-03-05 14:31:26 -08:00

upload-pack.c

Merge branch 'jt/push-negotiation'

2021-05-16 21:05:22 +09:00

upload-pack.h

…

url.c

…

url.h

…

urlmatch.c

…

urlmatch.h

…

usage.c

usage.c: don't copy/paste the same comment three times

2021-04-13 14:56:28 -07:00

userdiff.c

userdiff: add support for C# record types

2021-06-16 17:06:20 +09:00

userdiff.h

userdiff: add and use for_each_userdiff_driver()

2021-04-08 12:19:10 -07:00

utf8.c

t0060: test ntfs/hfs-obscured dotfiles

2021-05-04 11:52:02 +09:00

utf8.h

t0060: test ntfs/hfs-obscured dotfiles

2021-05-04 11:52:02 +09:00

varint.c

…

varint.h

…

version.c

…

version.h

…

versioncmp.c

…

walker.c

lookup_unknown_object(): take a repository argument

2021-04-13 13:18:46 -07:00

walker.h

…

wildmatch.c

…

wildmatch.h

…

worktree.c

dir: introduce readdir_skip_dot_and_dotdot() helper

2021-05-27 14:02:37 +09:00

worktree.h

worktree: teach worktree to lazy-load "prunable" reason

2021-01-30 09:57:16 -08:00

wrap-for-bin.sh

…

wrapper.c

add open_nofollow() helper

2021-02-16 09:41:32 -08:00

write-or-die.c

fsync(): be prepared to see EINTR

2021-06-05 22:13:40 +09:00

ws.c

…

wt-status.c

Merge branch 'ds/gender-neutral-doc'

2021-07-16 17:42:53 -07:00

wt-status.h

branch: sort detached HEAD based on a flag

2021-01-07 15:13:21 -08:00

xdiff-interface.c

Merge branch 'ab/pickaxe-pcre2'

2021-07-13 16:52:50 -07:00

xdiff-interface.h

xdiff-interface: replace discard_hunk_line() with a flag

2021-05-11 12:47:31 +09:00

zlib.c

…

README.md

Git - fast, scalable, distributed revision control system

Git is a fast, scalable, distributed revision control system with an unusually rich command set that provides both high-level operations and full access to internals.

Git is an Open Source project covered by the GNU General Public License version 2 (some parts of it are under different licenses, compatible with the GPLv2). It was originally written by Linus Torvalds with help of a group of hackers around the net.

Please read the file INSTALL for installation instructions.

Many Git online resources are accessible from https://git-scm.com/ including full documentation and Git related tools.

See Documentation/gittutorial.txt to get started, then see Documentation/giteveryday.txt for a useful minimum set of commands, and Documentation/git-<commandname>.txt for documentation of each command. If git has been correctly installed, then the tutorial can also be read with man gittutorial or git help tutorial, and the documentation of each command with man git-<commandname> or git help <commandname>.

CVS users may also want to read Documentation/gitcvs-migration.txt (man gitcvs-migration or git help cvs-migration if git is installed).

The user discussion and development of Git take place on the Git mailing list -- everyone is welcome to post bug reports, feature requests, comments and patches to git@vger.kernel.org (read Documentation/SubmittingPatches for instructions on patch submission). To subscribe to the list, send an email with just "subscribe git" in the body to majordomo@vger.kernel.org. The mailing list archives are available at https://lore.kernel.org/git/, http://marc.info/?l=git and other archival sites.

Issues which are security relevant should be disclosed privately to the Git Security mailing list git-security@googlegroups.com.

The maintainer frequently sends the "What's cooking" reports that list the current status of various development topics to the mailing list. The discussion following them give a good reference for project status, development direction and remaining tasks.

The name "git" was given by Linus Torvalds when he wrote the very first version. He described the tool as "the stupid content tracker" and the name as (depending on your mood):

random three-letter combination that is pronounceable, and not actually used by any common UNIX command. The fact that it is a mispronunciation of "get" may or may not be relevant.
stupid. contemptible and despicable. simple. Take your pick from the dictionary of slang.
"global information tracker": you're in a good mood, and it actually works for you. Angels sing, and a light suddenly fills the room.
"goddamn idiotic truckload of sh*t": when it breaks

Languages

C 50.5%

Shell 38.8%

Perl 4.4%

Tcl 3.2%

Python 0.8%

Other 2.1%