dirstate walking optimizations
The repo walking code introduces a number of calls to dirstate.map.copy(),
significantly slowing down the walk on large trees. When a list of
files is passed to the walking code, we should only look at map entries
relevant to the file list passed in.
dirstate.filterfiles() is added to return a subset of the dirstate map.
The subset includes in files passed in, and if one of the files requested
is actually a directory, it includes any files inside that directory tree.
This brings the time for hg diff Makefile down from 1.7s to .3s on
a linux kernel repo.
Also, the diff command was unconditionally calling makewalk, leading
to an extra pass through repo.changes. This patch avoids the call
to makewalk when commands.diff isn't given a list of patterns, cutting
the time for hg diff (with no args) in half.
Index: mine/mercurial/hg.py
===================================================================
#header#
<title>#repo|escape#: changeset #node|short#</title>
</head>
<body>
<div class="buttons">
<a href="?cmd=changelog;rev=#rev#">changelog</a>
<a href="?cmd=tags">tags</a>
<a href="?cmd=manifest;manifest=#manifest#;path=/">manifest</a>
<a href="?cmd=changeset;node=#node#;style=raw">raw</a>
</div>
<h2>changeset: #desc|escape|firstline#</h2>
<table id="changesetEntry">
<tr>
<th class="changeset">changeset #rev#:</th>
<td class="changeset"><a href="?cmd=changeset;node=#node#">#node|short#</a></td>
</tr>
#parent#
#changesettag#
<tr>
<th class="manifest">manifest:</th>
<td class="manifest"><a href="?cmd=manifest;manifest=#manifest#;path=/">#manifest|short#</a></td>
</tr>
<tr>
<th class="author">author:</th>
<td class="author">#author|obfuscate#</td>
</tr>
<tr>
<th class="date">date:</th>
<td class="date">#date|date# (#date|age# ago)</td></tr>
<tr>
<th class="files">files:</th>
<td class="files">#files#</td></tr>
<tr>
<th class="description">description:</th>
<td class="description">#desc|escape|addbreaks#</td>
</tr>
</table>
<div id="changesetDiff">
#diff#
</div>
</body>
</html>