Junio C Hamano [Thu, 16 Feb 2006 09:57:39 +0000 (01:57 -0800)]
Merge branch 'lt/merge-tree'
* lt/merge-tree:
git-merge-tree: generalize the "traverse <n> trees in sync" functionality
Handling large files with GIT
Handling large files with GIT
Junio C Hamano [Thu, 16 Feb 2006 09:57:33 +0000 (01:57 -0800)]
Merge branch 'jc/topo'
* jc/topo:
topo-order: make --date-order optional.
Linus Torvalds [Thu, 16 Feb 2006 03:25:32 +0000 (19:25 -0800)]
git-merge-tree: generalize the "traverse <n> trees in sync" functionality
It's actually very useful for other things too. Notably, we could do the
combined diff a lot more efficiently with this.
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Linus Torvalds [Wed, 15 Feb 2006 02:33:02 +0000 (18:33 -0800)]
Handling large files with GIT
On Tue, 14 Feb 2006, Linus Torvalds wrote:
>
> Here, btw, is the trivial diff to turn my previous "tree-resolve" into a
> "resolve tree relative to the current branch".
Gaah. It was trivial, and it happened to work fine for my test-case, but
when I started looking at not doing that extremely aggressive subdirectory
merging, that showed a few other issues...
So in case people want to try, here's a third patch. Oh, and it's against
my _original_ path, not incremental to the middle one (ie both patches two
and three are against patch #1, it's not a nice series).
Now I'm really done, and won't be sending out any more patches today.
Sorry for the noise.
Linus
Signed-off-by: Junio C Hamano <junkio@cox.net>
Linus Torvalds [Wed, 15 Feb 2006 02:05:30 +0000 (18:05 -0800)]
Handling large files with GIT
On Tue, 14 Feb 2006, Junio C Hamano wrote:
> Linus Torvalds <torvalds@osdl.org> writes:
>
> > If somebody is interested in making the "lots of filename changes" case go
> > fast, I'd be more than happy to walk them through what they'd need to
> > change. I'm just not horribly motivated to do it myself. Hint, hint.
>
> In case anybody is wondering, I share the same feeling. I
> cannot say I'd be "more than happy to" clean up potential
> breakages during the development of such changes, but if the
> change eventually would help certain use cases, I can be
> persuaded to help debugging such a mess ;-).
Actually, I got interested in seeing how hard this is, and wrote a simple
first cut at doing a tree-optimized merger.
Let me shout a bit first:
THIS IS WORKING CODE, BUT BE CAREFUL: IT'S A TECHNOLOGY DEMONSTRATION
RATHER THAN THE FINAL PRODUCT!
With that out of the way, let me descibe what this does (and then describe
the missing parts).
This is basically a three-way merge that works entirely on the "tree"
level, rather than on the index. A lot of the _concepts_ are the same,
though, and if you're familiar with the results of an index merge, some of
the output will make more sense.
You give it three trees: the base tree (tree 0), and the two branches to
be merged (tree 1 and tree 2 respectively). It will then walk these three
trees, and resolve them as it goes along.
The interesting part is:
- it can resolve whole sub-directories in one go, without actually even
looking recursively at them. A whole subdirectory will resolve the same
way as any individual files will (although that may need some
modification, see later).
- if it has a "content conflict", for subdirectories that means "try to
do a recursive tree merge", while for non-subdirectories it's just a
content conflict and we'll output the stage 1/2/3 information.
- a successful merge will output a single stage 0 ("merged") entry,
potentially for a whole subdirectory.
- it outputs all the resolve information on stdout, so something like the
recursive resolver can pretty easily parse it all.
Now, the caveats:
- we probably need to be more careful about subdirectory resolves. The
trivial case (both branches have the exact same subdirectory) is a
trivial resolve, but the other cases ("branch1 matches base, branch2 is
different" probably can't be silently just resolved to the "branch2"
subdirectory state, since it might involve renames into - or out of -
that subdirectory)
- we do not track the current index file at all, so this does not do the
"check that index matches branch1" logic that the three-way merge in
git-read-tree does. The theory is that we'd do a full three-way merge
(ignoring the index and working directory), and then to update the
working tree, we'd do a two-way "git-read-tree branch1->result"
- I didn't actually make it do all the trivial resolve cases that
git-read-tree does. It's a technology demonstration.
Finally (a more serious caveat):
- doing things through stdout may end up being so expensive that we'd
need to do something else. In particular, it's likely that I should
not actually output the "merge results", but instead output a "merge
results as they _differ_ from branch1"
However, I think this patch is already interesting enough that people who
are interested in merging trees might want to look at it. Please keep in
mind that tech _demo_ part, and in particular, keep in mind the final
"serious caveat" part.
In many ways, the really _interesting_ part of a merge is not the result,
but how it _changes_ the branch we're merging into. That's particularly
important as it should hopefully also mean that the output size for any
reasonable case is minimal (and tracks what we actually need to do to the
current state to create the final result).
The code very much is organized so that doing the result as a "diff
against branch1" should be quite easy/possible. I was actually going to do
it, but I decided that it probably makes the output harder to read. I
dunno.
Anyway, let's think about this kind of approach.. Note how the code itself
is actually quite small and short, although it's prbably pretty "dense".
As an interesting test-case, I'd suggest this merge in the kernel:
git-merge-tree $(git-merge-base
4cbf876 7d2babc)
4cbf876 7d2babc
which resolves beautifully (there are no actual file-level conflicts), and
you can look at the output of that command to start thinking about what
it does.
The interesting part (perhaps) is that timing that command for me shows
that it takes all of 0.004 seconds.. (the git-merge-base thing takes
considerably more ;)
The point is, we _can_ do the actual merge part really really quickly.
Linus
PS. Final note: when I say that it is "WORKING CODE", that is obviously by
my standards. IOW, I tested it once and it gave reasonable results - so it
must be perfect.
Whether it works for anybody else, or indeed for any other test-case, is
not my problem ;)
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Thu, 16 Feb 2006 06:05:33 +0000 (22:05 -0800)]
topo-order: make --date-order optional.
This adds --date-order to rev-list; it is similar to topo order
in the sense that no parent comes before all of its children,
but otherwise things are still ordered in the commit timestamp
order.
The same flag is also added to show-branch.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Thu, 16 Feb 2006 03:45:03 +0000 (19:45 -0800)]
Merge master to get fixes up to 1.2.1
Junio C Hamano [Thu, 16 Feb 2006 03:42:15 +0000 (19:42 -0800)]
Merge branch 'jc/add'
* jc/add:
Detect misspelled pathspec to git-add
Junio C Hamano [Thu, 16 Feb 2006 03:39:21 +0000 (19:39 -0800)]
Merge fixes up to 1.2.1
Josef Weidendorfer [Wed, 15 Feb 2006 19:22:11 +0000 (20:22 +0100)]
More useful/hinting error messages in git-checkout
Signed-off-by: Josef Weidendorfer <Josef.Weidendorfer@gmx.de>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Fernando J. Pereda [Wed, 15 Feb 2006 11:37:30 +0000 (12:37 +0100)]
Print an error if cloning a http repo and NO_CURL is set
If Git is compiled with NO_CURL=YesPlease and one tries to
clone a http repository, git-clone tries to call the curl
binary. This trivial patch prints an error instead in such
situation.
Signed-off-by: Fernando J. Pereda <ferdy@gentoo.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Wed, 15 Feb 2006 20:47:43 +0000 (12:47 -0800)]
packed objects: minor cleanup
The delta depth is unsigned.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Wed, 15 Feb 2006 09:58:26 +0000 (01:58 -0800)]
Merge branch 'jc/add'
* jc/add:
Detect misspelled pathspec to git-add
ls-files --error-unmatch pathspec error reporting fix.
Junio C Hamano [Wed, 15 Feb 2006 09:05:59 +0000 (01:05 -0800)]
Detect misspelled pathspec to git-add
This is in the same spirit as an earlier patch for git-commit.
It does an extra ls-files to avoid complaining when a fully
tracked directory name is given on the command line (otherwise
--others restriction would say the pathspec does not match).
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Wed, 15 Feb 2006 09:10:13 +0000 (01:10 -0800)]
ls-files --error-unmatch pathspec error reporting fix.
Earlier patch mistakenly used prefix_len when it meant
prefix_offset. The latter is to strip the leading directories
when run from a subdirectory.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Wed, 15 Feb 2006 01:56:48 +0000 (17:56 -0800)]
Merge branch 'jc/rebase-limit'
* jc/rebase-limit:
rebase: allow rebasing onto different base.
Junio C Hamano [Wed, 15 Feb 2006 01:56:07 +0000 (17:56 -0800)]
Merge branch 'fix'
* fix:
checkout: fix dirty-file display.
Junio C Hamano [Wed, 15 Feb 2006 01:56:02 +0000 (17:56 -0800)]
Merge branch 'master'
* master:
Merge branch 'kh/svn'
git-svnimport: -r adds svn revision number to commit messages
Merge branch 'jc/commit'
commit: detect misspelled pathspec while making a partial commit.
combine-diff: diff-files fix (#2)
combine-diff: diff-files fix.
Merge branch 'jc/rebase'
Merge branch 'ra/email'
Junio C Hamano [Wed, 15 Feb 2006 01:51:50 +0000 (17:51 -0800)]
Merge branch 'kh/svn'
* kh/svn:
git-svnimport: -r adds svn revision number to commit messages
Junio C Hamano [Wed, 15 Feb 2006 01:51:02 +0000 (17:51 -0800)]
Merge branch 'jc/commit'
* jc/commit:
commit: detect misspelled pathspec while making a partial commit.
combine-diff: diff-files fix (#2)
combine-diff: diff-files fix.
Junio C Hamano [Wed, 15 Feb 2006 01:49:00 +0000 (17:49 -0800)]
Merge branch 'jc/rebase'
* jc/rebase:
rebase: allow a hook to refuse rebasing.
Junio C Hamano [Wed, 15 Feb 2006 01:46:41 +0000 (17:46 -0800)]
Merge branch 'ra/email'
* ra/email:
send-email: Add --cc
send-email: Add some options for controlling how addresses are automatically added to the cc: list.
Junio C Hamano [Tue, 14 Feb 2006 22:42:05 +0000 (14:42 -0800)]
rebase: allow rebasing onto different base.
This allows you to rewrite history a bit more flexibly, by
separating the other branch name and new branch point. By
default, the new branch point is the same as the tip of the
other branch as before, but you can specify where you graft the
rebased branch onto.
When you have this ancestry graph:
A---B---C topic
/
D---E---F---G master
$ git rebase --onto master~1 master topic
would rewrite the history to look like this:
A'\''--B'\''--C'\'' topic
/
D---E---F---G master
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Wed, 15 Feb 2006 00:05:57 +0000 (16:05 -0800)]
checkout: fix dirty-file display.
When we refused to switch branches, we incorrectly showed
differences from the branch we would have switched to.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Tue, 14 Feb 2006 20:40:20 +0000 (12:40 -0800)]
commit: detect misspelled pathspec while making a partial commit.
When you say "git commit Documentaiton" to make partial commit
for the files only in that directory, we did not detect that as
a misspelled pathname and attempted to commit index without
change. If nothing matched, there is no harm done, but if the
index gets modified otherwise by having another valid pathspec
or after an explicit update-index, a user will not notice
without paying attention to the "git status" preview.
This introduces --error-unmatch option to ls-files, and uses it
to detect this common user error.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Karl Hasselström [Tue, 14 Feb 2006 02:43:34 +0000 (03:43 +0100)]
git-svnimport: -r adds svn revision number to commit messages
New -r flag for prepending the corresponding Subversion revision
number to each commit message.
Signed-off-by: Karl Hasselström <kha@treskal.com>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Tue, 14 Feb 2006 09:11:42 +0000 (01:11 -0800)]
combine-diff: diff-files fix (#2)
The raw format "git-diff-files -c" to show unmerged state forgot
to initialize the status fields from parents, causing NUL
characters to be emitted.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Tue, 14 Feb 2006 07:44:41 +0000 (23:44 -0800)]
Merge branch 'master'
* master:
Merge some proposed fixes
s/SHELL/SHELL_PATH/ in Makefile
bisect: remove BISECT_NAMES after done.
Documentation: git-ls-files asciidocco.
Documentation: git-commit in 1.2.X series defaults to --include.
Merge branch 'pb/bisect'
Junio C Hamano [Tue, 14 Feb 2006 07:34:58 +0000 (23:34 -0800)]
Merge some proposed fixes
Conflicts:
Documentation/git-commit.txt - taking the post 1.2.0 semantics.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Tue, 14 Feb 2006 07:26:53 +0000 (23:26 -0800)]
Merge branch 'pb/bisect'
* pb/bisect:
Properly git-bisect reset after bisecting from non-master head
Junio C Hamano [Tue, 14 Feb 2006 07:07:04 +0000 (23:07 -0800)]
combine-diff: diff-files fix.
When showing a conflicted merge from index stages and working
tree file, we did not fetch the mode from the working tree,
and mistook that as a deleted file. Also if the manual
resolution (or automated resolution by git rerere) ended up
taking either parent's version, we did not show _anything_ for
that path. Either was quite bad and confusing.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Fredrik Kuivinen [Mon, 13 Feb 2006 23:15:14 +0000 (00:15 +0100)]
s/SHELL/SHELL_PATH/ in Makefile
With the current Makefile we don't use the shell chosen by the
platform specific defines when we invoke GIT-VERSION-GEN.
Signed-off-by: Fredrik Kuivinen <freku045@student.liu.se>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Tue, 14 Feb 2006 05:25:38 +0000 (21:25 -0800)]
bisect: remove BISECT_NAMES after done.
I noticed that we forgot to clean this file and kept it that
way, while trying to help with Andrew's bisect problem.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Tue, 14 Feb 2006 05:52:10 +0000 (21:52 -0800)]
Documentation: git-ls-files asciidocco.
Noticed by Jon Nelson.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Mon, 13 Feb 2006 10:38:57 +0000 (02:38 -0800)]
Merge branch 'ra/email'
* ra/email:
send-email: Add --cc
send-email: Add some options for controlling how addresses are automatically added to the cc: list.
Junio C Hamano [Mon, 13 Feb 2006 10:38:20 +0000 (02:38 -0800)]
Merge branch 'jc/commit'
* jc/commit:
git-commit: Now --only semantics is the default.
Junio C Hamano [Mon, 13 Feb 2006 10:38:16 +0000 (02:38 -0800)]
Merge branch 'jc/rebase'
* jc/rebase:
rebase: allow a hook to refuse rebasing.
Junio C Hamano [Mon, 13 Feb 2006 10:38:12 +0000 (02:38 -0800)]
Merge branch 'jc/nostat'
* jc/nostat:
cache_name_compare() compares name and stage, nothing else.
Ryan Anderson [Mon, 13 Feb 2006 08:05:15 +0000 (03:05 -0500)]
send-email: Add --cc
Since Junio used this in an example, and I've personally tried to use it, I
suppose the option should actually exist.
Signed-off-by: Ryan Anderson <ryan@michonline.com>
Junio C Hamano [Mon, 13 Feb 2006 08:26:14 +0000 (00:26 -0800)]
Documentation: git-commit in 1.2.X series defaults to --include.
The documentation was mistakenly describing the --only semantics to
be default. The 1.2.0 release and its maintenance series 1.2.X will
keep the traditional --include semantics as the default. Clarify the
situation.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Ryan Anderson [Mon, 13 Feb 2006 07:57:09 +0000 (02:57 -0500)]
send-email: Add some options for controlling how addresses are automatically added to the cc: list.
Signed-off-by: Ryan Anderson <ryan@michonline.com>
Junio C Hamano [Mon, 13 Feb 2006 07:17:04 +0000 (23:17 -0800)]
rebase: allow a hook to refuse rebasing.
This lets a hook to interfere a rebase and help prevent certain
branches from being rebased by mistake. A sample hook to show
how to prevent a topic branch that has already been merged into
publish branch.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Mon, 13 Feb 2006 07:55:07 +0000 (23:55 -0800)]
git-commit: Now --only semantics is the default.
This changes the "git commit paths..." to default to --only
semantics from traditional --include semantics, as agreed on the
list.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Mon, 13 Feb 2006 07:46:25 +0000 (23:46 -0800)]
cache_name_compare() compares name and stage, nothing else.
The code was a bit unclear in expressing what it wants to compare.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 21:15:12 +0000 (13:15 -0800)]
Merge branch 'master'
* master:
GIT 1.2.0
Fix "test: unexpected operator" on bsd
Junio C Hamano [Sun, 12 Feb 2006 21:14:53 +0000 (13:14 -0800)]
GIT 1.2.0
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 21:13:12 +0000 (13:13 -0800)]
Fix "test: unexpected operator" on bsd
This fixes the same issue as a previous fix by Alex Riesen does.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 21:09:08 +0000 (13:09 -0800)]
Merge branch 'pb/bisect'
* pb/bisect:
Properly git-bisect reset after bisecting from non-master head
git-commit: show dirtiness including index.
Make pack-objects chattier.
Petr Baudis [Sun, 12 Feb 2006 16:06:14 +0000 (17:06 +0100)]
Properly git-bisect reset after bisecting from non-master head
git-bisect reset without an argument would return to master even
if the bisecting started at a non-master branch. This patch makes
it save the original branch name to .git/head-name and restore it
afterwards.
This is also compatible with Cogito and cg-seek, so cg-status will
show that we are seeked on the bisect branch and cg-reset will
properly restore the original branch.
git-bisect start will refuse to work if it is not on a bisect but
.git/head-name exists; this is to protect against conflicts with
other seeking tools.
Signed-off-by: Petr Baudis <pasky@suse.cz>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 21:05:53 +0000 (13:05 -0800)]
git-commit: show dirtiness including index.
Earlier, when we switched a branch we used diff-files to show
paths that are dirty in the working tree. But we allow switching
branches with updated index ("read-tree -m -u $old $new" works that
way), and only showing paths that have differences in the working
tree but not paths that are different in index was confusing.
This shows both as modified from the top commit of the branch we
just have switched to.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 21:01:54 +0000 (13:01 -0800)]
Make pack-objects chattier.
You could give -q to squelch it, but currently no tool does it.
This would make 'git clone host:repo here' over ssh not silent
again.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 19:36:54 +0000 (11:36 -0800)]
Merge branch 'master'
* master:
avoid echo -e, there are systems where it does not work
fix "test: 2: unexpected operator" on bsd
Fix object re-hashing
hashtable-based objects: minimum fixups.
Use a hashtable for objects instead of a sorted list
Alex Riesen [Sun, 12 Feb 2006 18:05:34 +0000 (19:05 +0100)]
avoid echo -e, there are systems where it does not work
FreeBSD 4.11 being one example: the built-in echo doesn't have -e,
and the installed /bin/echo does not do "-e" as well.
"printf" works, laking just "\e" and "\xAB'.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Alex Riesen [Sun, 12 Feb 2006 18:03:16 +0000 (19:03 +0100)]
fix "test: 2: unexpected operator" on bsd
Signed-off-by: Junio C Hamano <junkio@cox.net>
Linus Torvalds [Sun, 12 Feb 2006 19:24:50 +0000 (11:24 -0800)]
Fix object re-hashing
The hashed object lookup had a subtle bug in re-hashing: it did
for (i = 0; i < count; i++)
if (objs[i]) {
.. rehash ..
where "count" was the old hash couny. Oon the face of it is obvious, since
it clearly re-hashes all the old objects.
However, it's wrong.
If the last old hash entry before re-hashing was in use (or became in use
by the re-hashing), then when re-hashing could have inserted an object
into the hash entries with idx >= count due to overflow. When we then
rehash the last old entry, that old entry might become empty, which means
that the overflow entries should be re-hashed again.
In other words, the loop has to be fixed to either traverse the whole
array, rather than just the old count.
(There's room for a slight optimization: instead of counting all the way
up, we can break when we see the first empty slot that is above the old
"count". At that point we know we don't have any collissions that we might
have to fix up any more. This patch only does the trivial fix)
[jc: with trivial fix on trivial fix]
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 02:51:19 +0000 (18:51 -0800)]
hashtable-based objects: minimum fixups.
Calling hashtable_index from find_object before objs is created
would result in division by zero failure. Avoid it.
Also the given object name may not be aligned suitably for
unsigned int; avoid dereferencing casted pointer.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Johannes Schindelin [Sun, 12 Feb 2006 01:57:57 +0000 (02:57 +0100)]
Use a hashtable for objects instead of a sorted list
In a simple test, this brings down the CPU time from 47 sec to 22 sec.
Signed-off-by: Johannes Schindelin <Johannes.Schindelin@gmx.de>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 13:03:40 +0000 (05:03 -0800)]
Merge branch 'master'
* master:
Add howto about separating topics.
Merge branch 'pb/repo'
Add support for explicit type specifiers when calling git-repo-config
Merge branch 'jc/fixdiff'
diff-tree: do not default to -c
Avoid using "git-var -l" until it gets fixed.
t5500: adjust to change in pack-object reporting behaviour.
Only call git-rerere if $GIT_DIR/rr-cache exists.
Use a relative path for SVN importing
fetch-clone progress: finishing touches.
Fix fetch-clone in the presense of signals
Make "git clone" pack-fetching download statistics better
Make "git clone" less of a deathly quiet experience
kent@lysator.liu.se [Sun, 12 Feb 2006 12:00:52 +0000 (13:00 +0100)]
Add howto about separating topics.
This howto consists of a footnote from an email by JC to the git
mailing list (<7vfyms0x4p.fsf@assigned-by-dhcp.cox.net>).
Signed-off-by: Kent Engstrom <kent@lysator.liu.se>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 13:02:30 +0000 (05:02 -0800)]
Merge branch 'pb/repo'
* pb/repo:
Add support for explicit type specifiers when calling git-repo-config
Junio C Hamano [Sun, 12 Feb 2006 13:02:25 +0000 (05:02 -0800)]
Merge branch 'jc/fixdiff'
* jc/fixdiff:
diff-tree: do not default to -c
Junio C Hamano [Sat, 11 Feb 2006 20:39:11 +0000 (12:39 -0800)]
Avoid using "git-var -l" until it gets fixed.
This is to be nicer to people with unusable GECOS field.
"git-var -l" is currently broken in that when used by a user who
does not have a usable GECOS field and has not corrected it by
exporting GIT_COMMITTER_NAME environment variable it dies when
it tries to output GIT_COMMITTER_IDENT (same thing for AUTHOR).
"git-pull" used "git-var -l" only because it needed to get a
configuration variable before "git-repo-config --get" was
introduced. Use the latter tool designed exactly for this
purpose.
"git-sh-setup" used "git-var GIT_AUTHOR_IDENT" without actually
wanting to use its value. The only purpose was to cause the
command to check and barf if the repository format version
recorded in the $GIT_DIR/config file is too new for us to deal
with correctly. Instead, use "repo-config --get" on a random
property and see if it die()s, and check if the exit status is
128 (comes from die -- missing variable is reported with exit
status 1, so we can tell that case apart).
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 12:15:50 +0000 (04:15 -0800)]
Merge branch 'jc/nostat'
* jc/nostat:
"assume unchanged" git: documentation.
ls-files: split "show-valid-bit" into a different option.
Junio C Hamano [Sun, 12 Feb 2006 09:48:47 +0000 (01:48 -0800)]
"assume unchanged" git: documentation.
This updates documentation to describe the "assume unchanged"
behaviour.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 09:47:57 +0000 (01:47 -0800)]
ls-files: split "show-valid-bit" into a different option.
To preserve compatibility with scripts that expect uppercase
letters to be shown, do not make '-t' to unconditionally show
the valid bit. Introduce '-v' option for that.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Petr Baudis [Sun, 12 Feb 2006 03:14:48 +0000 (04:14 +0100)]
Add support for explicit type specifiers when calling git-repo-config
Currently, git-repo-config will just return the raw value of option
as specified in the config file; this makes things difficult for scripts
calling it, especially if the value is supposed to be boolean.
This patch makes it possible to ask git-repo-config to check if the option
is of the given type (int or bool) and write out the value in its
canonical form. If you do not pass --int or --bool, the behaviour stays
unchanged and the raw value is emitted.
This also incidentally fixes the segfault when option with no value is
encountered.
[jc: tweaked the option parsing a bit to make it easier to see
that the patch does not change anything but the type stuff in
the diff output. Also changed to avoid "foo ? : bar" construct. ]
Signed-off-by: Petr Baudis <pasky@suse.cz>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 00:43:30 +0000 (16:43 -0800)]
diff-tree: do not default to -c
Marco says it breaks qgit. This makes the flags a bit more
orthogonal.
$ git-diff-tree -r --abbrev ca18
No output from this command because you asked to skip merge by
not having -m there.
$ git-diff-tree -r -m --abbrev ca18
ca182053c7710a286d72102f4576cf32e0dafcfb
:100644 100644
538d21d...
59042d1... M Makefile
:100644 100644
410b758...
6c47c3a... M entry.c
ca182053c7710a286d72102f4576cf32e0dafcfb
:100644 100644
30479b4...
59042d1... M Makefile
The same "independent sets of diff" as before without -c.
$ git-diff-tree -r -m -c --abbrev ca18
ca182053c7710a286d72102f4576cf32e0dafcfb
::100644 100644 100644
538d21d...
30479b4...
59042d1... MM Makefile
Combined.
$ git-diff-tree -r -c --abbrev ca18
ca182053c7710a286d72102f4576cf32e0dafcfb
::100644 100644 100644
538d21d...
30479b4...
59042d1... MM Makefile
Asking for combined without -m does not make sense, so -c
implies -m.
We need to supply -c as default to whatchanged, which is a
one-liner.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 07:08:23 +0000 (23:08 -0800)]
t5500: adjust to change in pack-object reporting behaviour.
Now pack-object is not as chatty when its stderr is not connected
to a terminal, so the test needs to be adjusted for that.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 02:55:43 +0000 (18:55 -0800)]
Only call git-rerere if $GIT_DIR/rr-cache exists.
Johannes noticed that git-rerere depends on Digest.pm, and if
one does not use the command, one can live without it.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Christian Biesinger [Sat, 11 Feb 2006 15:44:11 +0000 (16:44 +0100)]
Use a relative path for SVN importing
The absolute path (with the leading slash) breaks SVN importing,
because it then looks for /trunk/... instead of /svn/trunk/...
(in my case, the repository URL was https://servername/svn/)
Signed-off-by: Christian Biesinger <cbiesinger@web.de>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sun, 12 Feb 2006 01:54:18 +0000 (17:54 -0800)]
fetch-clone progress: finishing touches.
This makes fetch-pack also report the progress of packing part.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Linus Torvalds [Sat, 11 Feb 2006 18:41:22 +0000 (10:41 -0800)]
Fix fetch-clone in the presense of signals
We shouldn't fail a fetch just because a signal might have interrupted
the read.
Normally, we don't install any signal handlers, so EINTR really shouldn't
happen. That said, really old versions of Linux will interrupt an
interruptible system call even for signals that turn out to be ignored
(SIGWINCH is the classic example - resizing your xterm would cause it).
The same might well be true elsewhere too.
Also, since receive_keep_pack() doesn't control the caller, it can't know
that no signal handlers exist.
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Linus Torvalds [Sat, 11 Feb 2006 18:43:56 +0000 (10:43 -0800)]
Make "git clone" pack-fetching download statistics better
Average it out over a few events to make the numbers stable, and fix the
silly usec->binary-ms conversion.
Yeah, yeah, it's arguably eye-candy to keep the user calm, but let's do
that right.
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Linus Torvalds [Sat, 11 Feb 2006 04:31:09 +0000 (20:31 -0800)]
Make "git clone" less of a deathly quiet experience
It used to be that "git-unpack-objects" would give nice percentages, but
now that we don't unpack the initial clone pack any more, it doesn't. And
I'd love to do that nice percentage view in the pack objects downloader
too, but the thing doesn't even read the pack header, much less know how
much it's going to get, so I was lazy and didn't.
Instead, it at least prints out how much data it's gotten, and what the
packing speed is. Which makes the user realize that it's actually doing
something useful instead of sitting there silently (and if the recipient
knows how large the final result is, he can at least make a guess about
when it migt be done).
So with this patch, I get something like this on my DSL line:
[torvalds@g5 ~]$ time git clone master.kernel.org:/pub/scm/linux/kernel/git/torvalds/linux-2.6 clone-test
Packing 188543 objects
48.398MB (154 kB/s)
where even the speed approximation seems to be roughtly correct (even
though my algorithm is a truly stupid one, and only really gives "speed in
the last half second or so").
Anyway, _something_ like this is definitely needed. It could certainly be
better (if it showed the same kind of thing that git-unpack-objects did,
that would be much nicer, but would require parsing the object stream as
it comes in). But this is big step forward, I think.
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sat, 11 Feb 2006 03:12:57 +0000 (19:12 -0800)]
Merge branch 'master'
* master:
Define GIT_(AUTHOR|COMMITTER)_(NAME|EMAIL) to known values.
Merge branch 'lt/diff-tree'
git-commit -v: have patch at the end.
Junio C Hamano [Sat, 11 Feb 2006 03:11:23 +0000 (19:11 -0800)]
Define GIT_(AUTHOR|COMMITTER)_(NAME|EMAIL) to known values.
Without these, running tests with an account with empty gecos
field would fail.
We might want to loosen error from "git-var -l" (but not
"git-var GIT_AUTHOR_NAME") later, but that is more or less an
independent issue.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Sat, 11 Feb 2006 02:47:41 +0000 (18:47 -0800)]
Merge branch 'lt/diff-tree'
* lt/diff-tree:
combine-diff: Record diff status a bit more faithfully
find_unique_abbrev() simplification.
combine-diff: move formatting logic to show_combined_diff()
combined-diff: use diffcore before intersecting paths.
diff-tree -c raw output
Junio C Hamano [Sat, 11 Feb 2006 02:38:24 +0000 (18:38 -0800)]
git-commit -v: have patch at the end.
It was pointed out that otherwise more important summary
information prefixed with '#' would become prone to be missed.
Also instead of chopping at the first '^---$' line, stop at the
first 'diff --git a/' line.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Fri, 10 Feb 2006 19:57:08 +0000 (11:57 -0800)]
Merge branch 'master'
* master:
rev-list: default to abbreviate merge parent names under --pretty.
delta micro optimization
count-delta.c: comment fixes
Merge branch 'jc/empty-commit'
Junio C Hamano [Fri, 10 Feb 2006 19:56:42 +0000 (11:56 -0800)]
rev-list: default to abbreviate merge parent names under --pretty.
When we prettyprint commit log messages, merge parent names were
often very long and there was no way to abbreviate it.
This changes them to be abbreviated by default, and non-default
abbreviations can be specified with --no-abbrev or --abbrev=<n>
options.
Note that this affects only the prettyprinted parent names. The
output from --show-parents is meant for machine consumption and
is not affected by this flag.
Nicolas Pitre [Fri, 10 Feb 2006 18:42:05 +0000 (13:42 -0500)]
delta micro optimization
My kernel work habit made me look at the generated assembly for the
delta code, and one obvious albeit small improvement is this patch.
Signed-off-by: Nicolas Pitre <nico@cam.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Nicolas Pitre [Fri, 10 Feb 2006 15:20:40 +0000 (10:20 -0500)]
count-delta.c: comment fixes
There was a stale comment that explains why the old code could
undercount when delta data copied things around inside detination
buffer. We do not use that kind of delta, so the comment does
not apply.
Junio C Hamano [Fri, 10 Feb 2006 15:14:55 +0000 (07:14 -0800)]
Merge branch 'jc/empty-commit'
* jc/empty-commit:
t6000: fix a careless test library add-on.
Do not allow empty name or email.
Junio C Hamano [Fri, 10 Feb 2006 14:51:28 +0000 (06:51 -0800)]
Merge branch 'lt/diff-tree'
* lt/diff-tree:
combine-diff: Record diff status a bit more faithfully
find_unique_abbrev() simplification.
Junio C Hamano [Fri, 10 Feb 2006 10:30:52 +0000 (02:30 -0800)]
combine-diff: Record diff status a bit more faithfully
This shows "new file mode XXXX" and "deleted file mode XXXX"
lines like two-way diff-patch output does, by checking the
status from each parent.
The diff-raw output for combined diff is made a bit uglier by
showing diff status letters with each parent. While most of the
case you would see "MM" in the output, an Evil Merge that
touches a path that was added by inheriting from one parent is
possible and it would be shown like these:
$ git-diff-tree --abbrev -c HEAD
2d7ca89675eb8888b0b88a91102f096d4471f09f
::000000 000000 100644
0000000...
0000000...
31dd686... AA b
::000000 100644 100644
0000000...
6c884ae...
c6d4fa8... AM d
::100644 100644 100644
4f7cbe7...
f8c295c...
19d5d80... RR e
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Fri, 10 Feb 2006 09:51:12 +0000 (01:51 -0800)]
find_unique_abbrev() simplification.
Earlier it did not grok the 0{40} SHA1 very well, but what it
needed to do was to find the shortest 0{N} that is not used as a
valid object name to be consistent with the way names of valid
objects are abbreviated. This makes some users simpler.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Fri, 10 Feb 2006 08:55:34 +0000 (00:55 -0800)]
Merge branch 'jc/status'
* jc/status:
git-status -v
Junio C Hamano [Fri, 10 Feb 2006 08:45:59 +0000 (00:45 -0800)]
git-status -v
This revamps the git-status command to take the same set of
parameters as git commit. It gives a preview of what is being
committed with that command. With -v flag, it shows the diff
output between the HEAD commit and the index that would be
committed if these flags were given to git-commit command.
git-commit also acquires -v flag (it used to mean "verify" but
that is the default anyway and there is --no-verify to turn it
off, so not much is lost), which uses the updated git-status -v
to seed the commit log buffer. This is handy for writing a log
message while reviewing the changes one last time.
Now, git-commit and git-status are internally share the same
implementation.
Unlike previous git-commit change, this uses a temporary index
to prepare the index file that would become the real index file
after a successful commit, and moves it to the real index file
once the commit is actually made. This makes it safer than the
previous scheme, which stashed away the original index file and
restored it after an aborted commit.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Fri, 10 Feb 2006 06:19:21 +0000 (22:19 -0800)]
Merge branch 'master'
* master:
Merge branch 'jc/ls-files-o'
count-delta.c: Match the delta data semantics change in version 3.
remove delta-against-self bit
stat() for existence in safe_create_leading_directories()
call git_config() after setup_git_directory()
Add --diff-filter= documentation paragraph
Junio C Hamano [Fri, 10 Feb 2006 06:19:07 +0000 (22:19 -0800)]
Merge branch 'jc/ls-files-o'
* jc/ls-files-o:
ls-files: honour per-directory ignore file from higher directories.
Junio C Hamano [Fri, 10 Feb 2006 05:10:52 +0000 (21:10 -0800)]
Merge branch 'lt/diff-tree'
* lt/diff-tree:
combine-diff: move formatting logic to show_combined_diff()
combined-diff: use diffcore before intersecting paths.
diff-tree -c raw output
Junio C Hamano [Fri, 10 Feb 2006 01:15:59 +0000 (17:15 -0800)]
count-delta.c: Match the delta data semantics change in version 3.
This matches the count_delta() logic to the change previous
commit introduces to patch_delta().
Signed-off-by: Junio C Hamano <junkio@cox.net>
Nicolas Pitre [Thu, 9 Feb 2006 22:50:04 +0000 (17:50 -0500)]
remove delta-against-self bit
After experimenting with code to add the ability to encode a delta
against part of the deltified file, it turns out that resulting packs
are _bigger_ than when this ability is not used. The raw delta output
might be smaller, but it doesn't compress as well using gzip with a
negative net saving on average.
Said bit would in fact be more useful to allow for encoding the copying
of chunks larger than 64KB providing more savings with large files.
This will correspond to packs version 3.
While the current code still produces packs version 2, it is made future
proof so pack versions 2 and 3 are accepted. Any pack version 2 are
compatible with version 3 since the redefined bit was never used before.
When enough time has passed, code to use that bit to produce version 3
packs could be added.
Signed-off-by: Nicolas Pitre <nico@cam.org>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Jason Riedy [Fri, 10 Feb 2006 01:56:13 +0000 (17:56 -0800)]
stat() for existence in safe_create_leading_directories()
Use stat() to explicitly check for existence rather than
relying on the non-portable EEXIST error in sha1_file.c's
safe_create_leading_directories(). There certainly are
optimizations possible, but then the code becomes almost
the same as that in coreutil's lib/mkdir-p.c.
Other uses of EEXIST seem ok. Tested on Solaris 8, AIX 5.2L,
and a few Linux versions. AIX has some unrelated (I think)
failures right now; I haven't tried many recent gits there.
Anyone have an old Ultrix box to break everything? ;)
Also remove extraneous #includes. Everything's already in
git-compat-util.h, included through cache.h.
Signed-off-by: Jason Riedy <ejr@cs.berkeley.edu>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Thu, 9 Feb 2006 23:23:06 +0000 (15:23 -0800)]
combine-diff: move formatting logic to show_combined_diff()
This way, diff-files can make use of it. Also implement the
full suite of what diff_flush_raw() supports just for
consistency. With this, 'diff-tree -c -r --name-status' would
show what is expected.
There is no way to get the historical output (useful for
debugging and low-level Plumbing work) anymore, so tentatively
it makes '-m' to mean "do not combine and show individual diffs
with parents".
diff-files matches diff-tree to produce raw output for -c. For
textual combined diff, use -p -c.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Thu, 9 Feb 2006 22:41:39 +0000 (14:41 -0800)]
call git_config() after setup_git_directory()
If you call setup_git_directory() to work from a subdirectory,
that should be run first before running git_config(). Otherwise
you would not read the configuration file from the correct place.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Thu, 9 Feb 2006 22:35:19 +0000 (14:35 -0800)]
combined-diff: use diffcore before intersecting paths.
This is needed to make "diff-tree -c -M" to work semi-sensibly.
Otherwise rename detection, pickaxe and friends would never be
invoked.
Signed-off-by: Junio C Hamano <junkio@cox.net>
Jon Loeliger [Thu, 9 Feb 2006 15:12:11 +0000 (09:12 -0600)]
Add --diff-filter= documentation paragraph
Signed-off-by: Jon Loeliger <jdl@jdl.com>
Signed-off-by: Junio C Hamano <junkio@cox.net>
Linus Torvalds [Thu, 9 Feb 2006 18:30:28 +0000 (10:30 -0800)]
diff-tree -c raw output
NOTE! This makes "-c" be the default, which effectively means that merges
are never ignored any more, and "-m" is a no-op. So it changes semantics.
I would also like to make "--cc" the default if you do patches, but didn't
actually do that.
The raw output format is not wonderfully pretty, but it's distinguishable
from a "normal patch" in that a normal patch with just one parent has just
one colon at the beginning, while a multi-parent raw diff has <n> colons
for <n> parents.
So now, in the kernel, when you do
git-diff-tree
cce0cac125623f9b68f25dd1350f6d616220a8dd
(to see the manual ARM merge that had a conflict in arch/arm/Kconfig), you
get
cce0cac125623f9b68f25dd1350f6d616220a8dd
::100644 100644 100644
4a63a8e2e45247a11c068c6ed66c6e7aba29ddd9 77eee38762d69d3de95ae45dd9278df9b8225e2c 2f61726d2f4b636f6e66696700dbf71a59dad287 arch/arm/Kconfig
ie you see two colons (two parents), then three modes (parent modes
followed by result mode), then three sha1s (parent sha1s followed by
result sha1).
Which is pretty close to the normal raw diff output.
Cool/stupid exercise:
$ git-whatchanged | grep '^::' | cut -f2- | sort |
uniq -c | sort -n | less -S
will show which files have needed the most file-level merge conflict
resolution. Useful? Probably not. But kind of interesting.
For the kernel, it's
....
10 arch/ia64/Kconfig
11 drivers/scsi/Kconfig
12 drivers/net/Makefile
17 include/linux/libata.h
18 include/linux/pci_ids.h
23 drivers/net/Kconfig
24 drivers/scsi/libata-scsi.c
28 drivers/scsi/libata-core.c
43 MAINTAINERS
Signed-off-by: Junio C Hamano <junkio@cox.net>
Junio C Hamano [Thu, 9 Feb 2006 08:55:45 +0000 (00:55 -0800)]
Merge branch 'jc/nostat'
* jc/nostat:
"Assume unchanged" git: --really-refresh fix.