Re: [darcs-users] Re: [BK] upgrade will be needed

From: Andrea Arcangeli
Date: Fri Feb 18 2005 - 07:51:44 EST


On Fri, Feb 18, 2005 at 12:53:09PM +0100, Erik Bågfors wrote:
> RCS/SCCS format doesn't make much sence for a changeset oriented SCM.

The advantage it will provide is that it'll be compact and a backup will
compress at best too. Small compressed tarballs compress very badly
instead, it wouldn't be even comparable. Once the thing is very compact
it has a better chance to fit in cache, and if it fits in cache
extracting diffs from each file will be very fast. Once it'll be compact
the cost of a changeset will be diminished allowing it to scale better
too.

Now it's true new disks are bigger, but they're not much faster, so if
the size of the repository is much larger, it'll be much slower to
checkout if it doesn't fit in cache. And if it's smaller it has better
chances of fitting in cache too.

The thing is, RCS seems a space efficient format for storing patches,
and it's efficient at extracting them too (plus it's textual so it's not
going to get lost info even if something goes wrong).

The whole linux-2.5 CVS is 500M uncompressed and 75M tar.bz2 compressed.

My suggestion is to convert _all_ dozen thousand changesets to arch or
SVN and then compare the size with CVS (also the compressed size is
interesting for backups IMHO). Unfortunately I know nothing about darcs
yet (except it eats quite some memory ;)
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/