mercurial-crew-with-dirclash: comparison notes.txt

equal deleted inserted replaced

-:45c293b71341
+:d7ce76d82876
 revision's contents and its graph position relative to the root, so
 merge is simply checking whether each nodeid in graph A is in the hash
 table of graph B. If not, we pull them in, adding them sequentially to
 the revlog.
-Graph resolving:
+Branching and merging:
-Mercurial does branching by copying (or COWing) a repository and thus
+Everything in Mercurial is potentially a branch and every user
-keeps everything nice and linear within a repository. However, when a
+effectively works in their own branch. When you do a checkout,
-merge of repositories (a "pull") is done, we may often have two head
+Mercurial remembers what the parent changeset was and uses it for the
-revisions in a given graph. To keep things simple, Mercurial forces
+next check in.
-the head revisions to be merged.
-It first finds the closest common ancestor of the two heads. If one is
+To do a merge of branches in Mercurial, you check out the heads of the
-a child of the other, it becomes the new head. Otherwise, we call out
+two branches into the same working directory which causes a merge to
-to a user-specified 3-way merge tool.
+be performed, and then check in the result once you're happy with it.
+The resulting checkin will have two parents.
-Merging files, manifests, and changesets:
+It decides when a merge is necessary by first determining if there are
+any uncommitted changes in the working directory. This effectively
+makes the working directory a branch off the checked in version it's
+based on. Then it also determines if the working directory is a direct
+ancestor or descendent of the second version we're attempting to
+checkout. If neither is true, we simply replace the working directory
+version with the new version. Otherwise we perform a merge between the
+two versions.
-We begin by comparing changeset DAGs, pulling all nodes we don't have
+Merging files and manifests:
-in our DAG from the other repository. As we do so, we collect a list
-of changed files to merge.
+We begin by comparing two versions manifests and deciding which files
+need to be added, deleted, and merged.
 Then for each file, we perform a graph merge and resolve as above.
 It's important to merge files using per-file DAGs rather than just
 changeset level DAGs as this diagram illustrates:
 ???    depending on which ancestor we choose, we will have
 	        to redo A hand-merge, B hand-merge, or both
 but if we look at the files independently, everything
 		is fine
-After we've merged files, we merge the manifest log DAG and resolve
+The result is a merged version in the working directory, waiting for
-additions and deletions. Then we are ready to resolve the changeset
+check-in.
-DAG - if our merge required any changes (the new head is not a
-decendent of our tip), we must create a new changeset describing all
-of the changes needed to merge it into the tip.
-Merge performance:
-The I/O operations for performing a merge are O(changed files), not
-O(total changes) and in many cases, we needn't even unpack the deltas
-to add them to our repository (though this optimization isn't
-necessary).
 Rollback:
 When performing a commit or a merge, we order things so that the
 changeset entry gets added last. We keep a transaction log of the name
 of each file touched and its length prior to the transaction. On
 abort, we simply truncate each file to its prior length. This is one
 of the nice properties of the append-only structure of the revlogs.
+We can also reuse this journal for "undo".
-Remote access:
+Merging between repositories:
-Mercurial currently supports pulling from "serverless" repositories.
+One of the key features of Mercurial is the ability to merge between
-Simply making the repo directory accessibly via the web and pointing
+independent repositories in a decentralized fashion. Each repository
-hg at it can accomplish a pull. This is relatively bandwidth efficient
+can act as a read-only server or a client. Clients operating by
-but no effort has been spent on pipelining, so it won't work
+pulling all branches that it hasn't seen from the server and adding
-especially well over LAN yet.
+them into its graph. This is done in two steps: searching for new
+"roots" and pulling a "changegroup"
-It's also quite amenable to rsync, if you don't mind keeping an intact
+Searching for new "roots" begins by finding all new heads and
-copy of the master around locally.
+searching backwards from those heads to the first unknown nodes in
+their respective branches. These nodes are the 'roots' that are used
+to calculate the 'changegroup': the set of all changesets starting at
+those roots. Mercurial takes pains to make this search efficient in
+both bandwidth and round-trips.
-Also note the append-only and ordering properties of the commit
+Once the roots are found, the changegroup can be transferred as a
-guarantee that readers will always see a repository in a consistent
+single streaming transfer. This is organized as an ordered set of
-state and no special locking is necessary. As there is generally only
+deltas for changesets, manifests, and files. Large chunks of deltas
-one writer to an hg repository, there is in fact no exclusion
+can be directly added to the repository without unpacking so it's
-implemented yet.
+fairly fast.
-Some comparisons to git:
-Most notably, Mercurial uses delta compression and repositories
-created with it will grow much more slowly over time. This also allows
-it to be much more bandwidth efficient. I expect repos sizes and sync
-speeds to be similar to or better than BK, given the use of binary diffs.
-Mercurial is roughly the same performance as git in some areas and is
-faster in others as it keeps around more metadata. One example is
-listing and retrieving past versions of a file, which it can do
-without reading all the changesets. This metadata will also allow it
-to perform better merges as described above.

changeset 260	d7ce76d82876
parent 59	2bff7c0ea1d3
child 1308	2073e5a71008