global/git

mirror of https://github.com/git/git.git synced 2026-01-24 09:30:22 +00:00

Author	SHA1	Message	Date
Jeff Hostetler	d05c046536	add: use preload-index and fscache for performance Teach "add" to use preload-index and fscache features to improve performance on very large repositories. During an "add", a call is made to run_diff_files() which calls check_remove() for each index-entry. This calls lstat(). On Windows, the fscache code intercepts the lstat() calls and builds a private cache using the FindFirst/FindNext routines, which are much faster. Somewhat independent of this, is the preload-index code which distributes some of the start-up costs across multiple threads. We need to keep the call to read_cache() before parsing the pathspecs (and hence cannot use the pathspecs to limit any preload) because parse_pathspec() is using the index to determine whether a pathspec is, in fact, in a submodule. If we would not read the index first, parse_pathspec() would not error out on a path that is inside a submodule, and t7400-submodule-basic.sh would fail with not ok 47 - do not add files from a submodule We still want the nice preload performance boost, though, so we simply call read_cache_preload(&pathspecs) after parsing the pathspecs. Signed-off-by: Jeff Hostetler <jeffhost@microsoft.com> Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-17 21:38:37 +01:00
Johannes Schindelin	7bf7d35303	Export the preload_index() function The purpose of this function is to stat() the files listed in the index in a multi-threaded fashion. It is called directly after reading the index in the read_index_preloaded() function. However, in some cases we may want to separate the index reading from the preloading step, e.g. in builtin/add.c, where we need to load the index before we parse the pathspecs (which needs to error out if one of the pathspecs refers to a path within a submodule, for which the index must have been read already), and only then will we want to preload, possibly limited by the just-parsed pathspecs. So let's just export that function to allow calling it separately. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-17 21:38:37 +01:00
Karsten Blees	697fe2fbe0	fscache: load directories only once If multiple threads access a directory that is not yet in the cache, the directory will be loaded by each thread. Only one of the results is added to the cache, all others are leaked. This wastes performance and memory. On cache miss, add a future object to the cache to indicate that the directory is currently being loaded. Subsequent threads register themselves with the future object and wait. When the first thread has loaded the directory, it replaces the future object with the result and notifies waiting threads. Signed-off-by: Karsten Blees <blees@dcon.de>	2018-01-17 21:38:20 +01:00
Karsten Blees	c12404dbd0	Win32: add a cache below mingw's lstat and dirent implementations Checking the work tree status is quite slow on Windows, due to slow lstat emulation (git calls lstat once for each file in the index). Windows operating system APIs seem to be much better at scanning the status of entire directories than checking single files. Add an lstat implementation that uses a cache for lstat data. Cache misses read the entire parent directory and add it to the cache. Subsequent lstat calls for the same directory are served directly from the cache. Also implement opendir / readdir / closedir so that they create and use directory listings in the cache. The cache doesn't track file system changes and doesn't plug into any modifying file APIs, so it has to be explicitly enabled for git functions that don't modify the working copy. Note: in an earlier version of this patch, the cache was always active and tracked file system changes via ReadDirectoryChangesW. However, this was much more complex and had negative impact on the performance of modifying git commands such as 'git checkout'. Signed-off-by: Karsten Blees <blees@dcon.de>	2018-01-17 21:38:20 +01:00
Karsten Blees	e0a4995c97	add infrastructure for read-only file system level caches Add a macro to mark code sections that only read from the file system, along with a config option and documentation. This facilitates implementation of relatively simple file system level caches without the need to synchronize with the file system. Enable read-only sections for 'git status' and preload_index. Signed-off-by: Karsten Blees <blees@dcon.de>	2018-01-17 21:38:20 +01:00
Karsten Blees	7e6a5a6d26	Win32: make the lstat implementation pluggable Emulating the POSIX lstat API on Windows via GetFileAttributes[Ex] is quite slow. Windows operating system APIs seem to be much better at scanning the status of entire directories than checking single files. A caching implementation may improve performance by bulk-reading entire directories or reusing data obtained via opendir / readdir. Make the lstat implementation pluggable so that it can be switched at runtime, e.g. based on a config option. Signed-off-by: Karsten Blees <blees@dcon.de>	2018-01-17 21:38:20 +01:00
Karsten Blees	d353360b3c	Win32: Make the dirent implementation pluggable Emulating the POSIX dirent API on Windows via FindFirstFile/FindNextFile is pretty staightforward, however, most of the information provided in the WIN32_FIND_DATA structure is thrown away in the process. A more sophisticated implementation may cache this data, e.g. for later reuse in calls to lstat. Make the dirent implementation pluggable so that it can be switched at runtime, e.g. based on a config option. Define a base DIR structure with pointers to readdir/closedir that match the opendir implementation (i.e. similar to vtable pointers in OOP). Define readdir/closedir so that they call the function pointers in the DIR structure. This allows to choose the opendir implementation on a call-by-call basis. Move the fixed sized dirent.d_name buffer to the dirent-specific DIR structure, as d_name may be implementation specific (e.g. a caching implementation may just set d_name to point into the cache instead of copying the entire file name string). Signed-off-by: Karsten Blees <blees@dcon.de>	2018-01-17 21:38:20 +01:00
Karsten Blees	7d701621ad	Win32: dirent.c: Move opendir down Move opendir down in preparation for the next patch. Signed-off-by: Karsten Blees <blees@dcon.de>	2018-01-17 21:38:20 +01:00
Karsten Blees	0faa5f07b3	Win32: make FILETIME conversion functions public Signed-off-by: Karsten Blees <blees@dcon.de>	2018-01-17 21:38:19 +01:00
Johannes Schindelin	3a5f02ea24	mingw: unset PERL5LIB by default Git for Windows ships with its own Perl interpreter, and insists on using it, so it will most likely wreak havoc if PERL5LIB is set before launching Git. Let's just unset that environment variables when spawning processes. To make this feature extensible (and overrideable), there is a new config setting `core.unsetenvvars` that allows specifying a comma-separated list of names to unset before spawning processes. Reported by Gabriel Fuhrmann. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-17 21:38:19 +01:00
Johannes Schindelin	30aa7911e4	Move Windows-specific config settings into compat/mingw.c Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-17 21:38:19 +01:00
Johannes Schindelin	07b463e5cc	Allow for platform-specific core.* config settings In the Git for Windows project, we have ample precendent for config settings that apply to Windows, and to Windows only. Let's formalize this concept by introducing a platform_core_config() function that can be #define'd in a platform-specific manner. This will allow us to contain platform-specific code better, as the corresponding variables no longer need to be exported so that they can be defined in environment.c and be set in config.c Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-17 21:38:19 +01:00
Johannes Schindelin	cf94acdf1a	config: rename `dummy` parameter to `cb` in git_default_config() This is the convention elsewhere (and prepares for the case where we may need to pass callback data). Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-17 21:38:19 +01:00
Johannes Schindelin	9940e9b0c6	Start the merging-rebase of b9451fb46a1..4584fb7323e onto v2.16.0 Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-17 21:37:53 +01:00
Junio C Hamano	2512f15446	Git 2.16 Signed-off-by: Junio C Hamano <gitster@pobox.com> v2.16.0	2018-01-17 13:06:51 -08:00
Johannes Schindelin	4584fb7323	Merge pull request #1427 from atetubou/reset_fscache reset.c: enable fscache	2018-01-17 19:43:01 +01:00
Junio C Hamano	e0d575025a	Merge tag 'l10n-2.16.0-rnd2' of git://github.com/git-l10n/git-po l10n for Git 2.16.0 round 2 * tag 'l10n-2.16.0-rnd2' of git://github.com/git-l10n/git-po: (24 commits) l10n: de.po: translate 72 new messages l10n: de.po: improve messages when a branch starts to track another ref l10n: bg.po: Updated Bulgarian translation (3288t) l10n: TEAMS: add zh_CN team members l10n: zh_CN: for git v2.16.0 l10n round 2 l10n: sv.po: Update Swedish translation (3288t0f0u) l10n: ru.po: update Russian translation l10n: TEAMS: Add ko team members l10n: ko.po: Update Korean translation l10n: fr.po 2.16 round 2 l10n: es.po: Spanish translation 2.16.0 round 2 l10n: vi.po(3288t): Updated Vietnamese translation for v2.16.0 round 2 l10n: git.pot: v2.16.0 round 2 (8 new, 4 removed) l10n: es.po: Update Spanish Translation v2.16.0 l10n: fr.po v2.16.0 round 1 l10n: bg.po: Updated Bulgarian translation (3284t) l10n: sv.po: Update Swedish translation (3284t0f0u) l10n: fr.po: "worktree list" mistranslated as prune l10n: git.pot: v2.16.0 round 1 (64 new, 25 removed) l10n: fixes to German translation ...	2018-01-16 14:49:58 -08:00
Takuto Ikuta	ce7699f6ef	reset.c: enable fscache In git reset --hard, unpack-trees() is called with oneway_merge(). oneway_merge calls lstat for each files in a repository. It is bottleneck of git reset --hard, especially in large repository. This patch improves time by using fscache. In chromium repository, time of git reset --hard is changed like below. I took 3 times stats in the repository. master: TotalSeconds: 21.0337971 TotalSeconds: 20.0046612 TotalSeconds: 20.6501752 Avg: 20.5628778333333 this patch: TotalSeconds: 4.8552376 TotalSeconds: 4.8722343 TotalSeconds: 4.9268245 Avg: 4.88476546666667 Signed-off-by: Takuto Ikuta <tikuta@chromium.org>	2018-01-16 21:33:47 +09:00
Ralf Thielow	c9741bb98e	l10n: de.po: translate 72 new messages Translate 72 new messages came from git.pot update in `18a907225` (l10n: git.pot: v2.16.0 round 1 (64 new, 25 removed)) and `005c62fe4` (l10n: git.pot: v2.16.0 round 2 (8 new, 4 removed)). Signed-off-by: Ralf Thielow <ralf.thielow@gmail.com> Acked-by: Matthias Rüster <matthias.ruester@gmail.com>	2018-01-15 07:47:30 +01:00
Ralf Thielow	31eaa14e81	l10n: de.po: improve messages when a branch starts to track another ref Signed-off-by: Ralf Thielow <ralf.thielow@gmail.com>	2018-01-15 07:47:30 +01:00
SZEDER Gábor	0c37383f2e	RelNotes: minor typofix Signed-off-by: SZEDER Gábor <szeder.dev@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2018-01-12 10:40:42 -08:00
Johannes Schindelin	d0192a7c4e	Merge pull request #1426 from atetubou/fetch_pack fetch-pack.c: enable fscache for stats under .git/objects	2018-01-12 14:48:19 +01:00
Takuto Ikuta	0323774fe2	fetch-pack.c: enable fscache for stats under .git/objects When I do git fetch, git call file stats under .git/objects for each refs. This takes time when there are many refs. By enabling fscache, git takes file stats by directory traversing and that improved the speed of fetch-pack for repository having large number of refs. In my windows workstation, this improves the time of `git fetch` for chromium repository like below. I took stats 3 times. * With this patch TotalSeconds: 9.9825165 TotalSeconds: 9.1862075 TotalSeconds: 10.1956256 Avg: 9.78811653333333 * Without this patch TotalSeconds: 15.8406702 TotalSeconds: 15.6248053 TotalSeconds: 15.2085938 Avg: 15.5580231 Signed-off-by: Takuto Ikuta <tikuta@chromium.org>	2018-01-12 17:04:11 +09:00
Junio C Hamano	c6c75c93aa	Git 2.16-rc2 Signed-off-by: Junio C Hamano <gitster@pobox.com> v2.16.0-rc2	2018-01-11 13:20:41 -08:00
Junio C Hamano	ba82fdaea3	Merge branch 'jh/object-filtering' Hotfix for a topic already in 'master'. * jh/object-filtering: oidset: don't return value from oidset_init	2018-01-11 13:16:37 -08:00
Junio C Hamano	453f3fec59	Merge branch 'tg/worktree-create-tracking' Doc hotfix. * tg/worktree-create-tracking: Documentation/git-worktree.txt: add missing `	2018-01-11 13:16:36 -08:00
Junio C Hamano	91ec08a078	Merge branch 'js/test-with-ws-in-path' Hot fix to a test. * js/test-with-ws-in-path: t3900: add some more quotes	2018-01-11 13:16:36 -08:00
Alexander Shopov	1b6d5e83b6	l10n: bg.po: Updated Bulgarian translation (3288t) Signed-off-by: Alexander Shopov <ash@kambanaria.org>	2018-01-11 22:02:02 +01:00
Ralf Thielow	50fdf7b1b1	Documentation/git-worktree.txt: add missing ` Signed-off-by: Ralf Thielow <ralf.thielow@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2018-01-11 12:19:40 -08:00
Beat Bolli	36a6f49cc3	t3900: add some more quotes In `89a70b80` ("t0302 & t3900: add forgotten quotes", 2018-01-03), quotes were added to protect against spaces in $HOME. In the test_when_finished command, two files are deleted which must be quoted individually. [jc: with \$HOME in the test_when_finished command quoted, as pointed out by j6t]. Signed-off-by: Beat Bolli <dev+git@drbeat.li> Helped-by: Johannes Sixt <j6t@kdbg.org> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2018-01-10 15:07:26 -08:00
Junio C Hamano	650b103706	RelNotes update before -rc2 Signed-off-by: Junio C Hamano <gitster@pobox.com>	2018-01-10 14:01:50 -08:00
Junio C Hamano	fac910641a	Merge branch 'js/perl-path-workaround-in-tests' * js/perl-path-workaround-in-tests: mingw: handle GITPERLLIB in t0021 in a Windows-compatible way	2018-01-10 14:01:31 -08:00
Junio C Hamano	a466ef018e	Merge branch 'ew/empty-merge-with-dirty-index' "git merge -s recursive" did not correctly abort when the index is dirty, if the merged tree happened to be the same as the current HEAD, which has been fixed. * ew/empty-merge-with-dirty-index: merge-recursive: do not look at the index during recursive merge	2018-01-10 14:01:25 -08:00
Junio C Hamano	4cc676c46c	Merge branch 'ma/bisect-leakfix' A hotfix for a recent update that broke 'git bisect'. * ma/bisect-leakfix: bisect: fix a regression causing a segfault	2018-01-10 14:01:25 -08:00
Junio C Hamano	bc4efaf103	Merge branch 'js/fix-merge-arg-quoting-in-rebase-p' "git rebase -p -X<option>" did not propagate the option properly down to underlying merge strategy backend. * js/fix-merge-arg-quoting-in-rebase-p: rebase -p: fix quoting when calling `git merge`	2018-01-10 14:01:24 -08:00
Johannes Schindelin	3306f6524d	mingw: handle GITPERLLIB in t0021 in a Windows-compatible way Git's assumption that all path lists are colon-separated is not only wrong on Windows, it is not even an assumption that is compatible with POSIX. In the interest of time, let's not try to fix this properly but simply work around the obvious breakage on Windows, where the MSYS2 Bash used by Git for Windows to interpret the Git's Unix shell scripts will automagically convert path lists in the environment to semicolon-separated lists of Windows paths (with drive letter and the corresponding colon and all that jazz). In other words, we simply look whether there is a semicolon in GITPERLLIB and split by semicolons if found instead of colons. This is not fool-proof, of course, as the path list could consist of a single path. But that is not the case in Git for Windows' test suite, there are always two paths in GITPERLLIB. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2018-01-10 14:00:54 -08:00
Johannes Schindelin	f80bd10f48	Merge branch 'no-ahead-behind-v5' Especially in huge code bases with fast-moving `master`, it can be prohibitively expensive to calculate whether an upstream branch of a local branch is ahead, behind or diverged. This topic branch introduces a set of flags to avoid that computation when we're not even interested in it to begin with. This merge commit takes the feature early, therefore it is marked experimental. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-10 20:53:35 +01:00
Jeff Hostetler	348108d883	status: support --no-ahead-behind in long format Teach long (normal) status format to respect the --no-ahead-behind parameter and skip the possibly expensive ahead/behind computation between the branch and the upstream. Signed-off-by: Jeff Hostetler <jeffhost@microsoft.com> Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-10 11:48:29 +01:00
Jeff Hostetler	b474398b9c	status: update short status to respect --no-ahead-behind Teach "git status --short --branch" to respect "--no-ahead-behind" parameter to skip computing ahead/behind counts for the branch and its upstream and just report '[different]'. Signed-off-by: Jeff Hostetler <jeffhost@microsoft.com> Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-10 11:48:29 +01:00
Jeff Hostetler	7ab11fd214	status: add --[no-]ahead-behind to status and commit for V2 format. Teach "git status" and "git commit" to accept "--no-ahead-behind" and "--ahead-behind" arguments to request quick or full ahead/behind reporting. When "--no-ahead-behind" is given, the existing porcelain V2 line "branch.ab +x -y" is replaced with a new "branch.ab +? -?" line. This indicates that the branch and its upstream are or are not equal without the expense of computing the full ahead/behind values. Signed-off-by: Jeff Hostetler <jeffhost@microsoft.com> Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-10 11:48:29 +01:00
Jeff Hostetler	c2efcf5805	stat_tracking_info: return +1 when branches not equal Extend stat_tracking_info() to return +1 when branches are not equal and to take a new "enum ahead_behind_flags" argument to allow skipping the (possibly expensive) ahead/behind computation. This will be used in the next commit to allow "git status" to avoid full ahead/behind calculations for performance reasons. Signed-off-by: Jeff Hostetler <jeffhost@microsoft.com> Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2018-01-10 11:48:28 +01:00
Jiang Xin	0d08328dd8	l10n: TEAMS: add zh_CN team members Add Fangyi Zhou to zh_CN l10n team members. Signed-off-by: Jiang Xin <worldhello.net@gmail.com>	2018-01-10 11:31:55 +08:00
Jiang Xin	5809aa05f7	l10n: zh_CN: for git v2.16.0 l10n round 2 Translate 72 messages (3288t0f0u) for git v2.16.0-rc1. Signed-off-by: Jiang Xin <worldhello.net@gmail.com> Reviewed-by: 依云 <lilydjwg@gmail.com> Reviewed-by: Fangyi Zhou <fangyi.zhou@yuriko.moe>	2018-01-10 11:31:32 +08:00
Jiang Xin	dfb5c4c15b	Merge branch 'master' of git://github.com/nafmo/git-l10n-sv * 'master' of git://github.com/nafmo/git-l10n-sv: l10n: sv.po: Update Swedish translation (3288t0f0u)	2018-01-10 11:30:04 +08:00
Jiang Xin	45498f08b6	Merge branch 'russian-l10n' of https://github.com/DJm00n/git-po-ru * 'russian-l10n' of https://github.com/DJm00n/git-po-ru: l10n: ru.po: update Russian translation	2018-01-10 11:28:56 +08:00
Junio C Hamano	6366dd9000	Merge branch 'jk/doc-diff-options' Doc update. * jk/doc-diff-options: docs/diff-options: clarify scope of diff-filter types	2018-01-09 14:32:57 -08:00
Junio C Hamano	4e51984e82	Merge branch 'bw/protocol-v1' Test fix for a topic already in 'master'. * bw/protocol-v1: http: fix v1 protocol tests with apache httpd < 2.4	2018-01-09 14:32:56 -08:00
Junio C Hamano	14c84cd55b	Merge branch 'sg/travis-check-untracked' * sg/travis-check-untracked: travis-ci: check that all build artifacts are .gitignore-d travis-ci: don't store P4 and Git LFS in the working tree	2018-01-09 14:32:55 -08:00
Junio C Hamano	d702d5c5bd	Merge branch 'js/test-with-ws-in-path' Test fixes. * js/test-with-ws-in-path: t0302 & t3900: add forgotten quotes Allow the test suite to pass in a directory whose name contains spaces	2018-01-09 14:32:55 -08:00
Junio C Hamano	e6932248fc	Merge branch 'bc/submitting-patches-in-asciidoc' Doc readability update. * bc/submitting-patches-in-asciidoc: doc/SubmittingPatches: improve text formatting	2018-01-09 14:32:54 -08:00

1 2 3 4 5 ...

80298 Commits