﻿<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Dedupe to tape: Are you crazy?</title>
	<atom:link href="http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/</link>
	<description>A blog for IT Admins who do everything by an IT Admin who does everything</description>
	<lastBuildDate>Thu, 11 Mar 2010 18:01:50 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: UX-admin</title>
		<link>http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/comment-page-1/#comment-3513</link>
		<dc:creator>UX-admin</dc:creator>
		<pubDate>Tue, 03 Nov 2009 15:29:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/#comment-3513</guid>
		<description>&quot;If you lose the 1 tape that contains the deduplicated data, though, then you immediately have a Bad Day(tm).&quot;

That&#039;s why you do something called &quot;cloning&quot; at the end of the month, where all your full backup sets are consolidated onto a clone. &quot;Deduped&quot; clone in this case, of course.

For example, Legato NetWorker has the capability to produce clones (and automatically, too). I don&#039;t know if it has the capability to deduplicate, but the groundwork is there. Other backup software likely has some sort of cloning capability too, and if it doesn&#039;t, it should. It&#039;s one&#039;s &#039;insurance policy&#039; and consolidation after all, and that&#039;s not to be taken lightly.</description>
		<content:encoded><![CDATA[<p>&#8220;If you lose the 1 tape that contains the deduplicated data, though, then you immediately have a Bad Day(tm).&#8221;</p>
<p>That&#8217;s why you do something called &#8220;cloning&#8221; at the end of the month, where all your full backup sets are consolidated onto a clone. &#8220;Deduped&#8221; clone in this case, of course.</p>
<p>For example, Legato NetWorker has the capability to produce clones (and automatically, too). I don&#8217;t know if it has the capability to deduplicate, but the groundwork is there. Other backup software likely has some sort of cloning capability too, and if it doesn&#8217;t, it should. It&#8217;s one&#8217;s &#8216;insurance policy&#8217; and consolidation after all, and that&#8217;s not to be taken lightly.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James</title>
		<link>http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/comment-page-1/#comment-3487</link>
		<dc:creator>James</dc:creator>
		<pubDate>Mon, 26 Oct 2009 18:30:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/#comment-3487</guid>
		<description>The old backup system: Full backup Saturday night, differentials the rest of the week. This requires us to have both the full backup tape from Saturday night and the nightly differential in order to perform a restoration

The new backup system: Full backup Saturday night, dedupe backups the rest of the week. This requires us to have both the full backup tape from Saturday night and the nightly dedupe in order to perform a restoration. 

The more things change...

Frankly, this deduping sounds like a defensive measure implemented by a tech in a fight with a Dilbert style PHB who has declared that incrementals/differentials are bad since they are not full backups. The PHB doesn&#039;t care that full backups take longer than 24 hours, he just declares that the tech must get everything working. The tech knows that he needs to do full backup on Saturday, then incrementals or differentials during the week. So the tech creates this new name &#039;deduping&#039;. The PHB is happy and goes back to his golf game, and the data is safe under the same general procedures that have been working fine for thirty or so years in the computer industry.</description>
		<content:encoded><![CDATA[<p>The old backup system: Full backup Saturday night, differentials the rest of the week. This requires us to have both the full backup tape from Saturday night and the nightly differential in order to perform a restoration</p>
<p>The new backup system: Full backup Saturday night, dedupe backups the rest of the week. This requires us to have both the full backup tape from Saturday night and the nightly dedupe in order to perform a restoration. </p>
<p>The more things change&#8230;</p>
<p>Frankly, this deduping sounds like a defensive measure implemented by a tech in a fight with a Dilbert style PHB who has declared that incrementals/differentials are bad since they are not full backups. The PHB doesn&#8217;t care that full backups take longer than 24 hours, he just declares that the tech must get everything working. The tech knows that he needs to do full backup on Saturday, then incrementals or differentials during the week. So the tech creates this new name &#8216;deduping&#8217;. The PHB is happy and goes back to his golf game, and the data is safe under the same general procedures that have been working fine for thirty or so years in the computer industry.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Preston de Guise</title>
		<link>http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/comment-page-1/#comment-3486</link>
		<dc:creator>Preston de Guise</dc:creator>
		<pubDate>Mon, 26 Oct 2009 06:09:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/#comment-3486</guid>
		<description>I see dedupe to tape being about as irritating and slow as file level recovery from block level backups.

File level recovery from block level backups requires an interim recovery of required blocks from media into a cache, with files/data then reconstructed out of that cache. The more heavily fragmented the files originally at backup time, the slower and more painful this process is (or the significantly larger the cache space required to minimise media passes!)

I don&#039;t see how recovery from dedupe tape would be any different from this.</description>
		<content:encoded><![CDATA[<p>I see dedupe to tape being about as irritating and slow as file level recovery from block level backups.</p>
<p>File level recovery from block level backups requires an interim recovery of required blocks from media into a cache, with files/data then reconstructed out of that cache. The more heavily fragmented the files originally at backup time, the slower and more painful this process is (or the significantly larger the cache space required to minimise media passes!)</p>
<p>I don&#8217;t see how recovery from dedupe tape would be any different from this.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robert Sander</title>
		<link>http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/comment-page-1/#comment-3483</link>
		<dc:creator>Robert Sander</dc:creator>
		<pubDate>Sun, 25 Oct 2009 22:26:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/#comment-3483</guid>
		<description>We are using an rsync based backup to disk tool (www.dirvish.org). It uses rsync&#039;s --link-destination feature for file-based de-duplication between runs. This saves space, because after the initial run all backup runs are incremental. But you have a full backup for every run as non-changed files are just hardlinks to their already existing copy on the backup volume.

And we do at least bi-weekly tape backups of the current snapshot to store them off-site.

There are other tools (rsnapshot, rdiff-backup) with a similar approach.

This may or may not work in your environment, especially the original data should be rsync-friendly (e.g. a directory tree with only some not so big files changing, i.e. not a database).</description>
		<content:encoded><![CDATA[<p>We are using an rsync based backup to disk tool (www.dirvish.org). It uses rsync&#8217;s &#8211;link-destination feature for file-based de-duplication between runs. This saves space, because after the initial run all backup runs are incremental. But you have a full backup for every run as non-changed files are just hardlinks to their already existing copy on the backup volume.</p>
<p>And we do at least bi-weekly tape backups of the current snapshot to store them off-site.</p>
<p>There are other tools (rsnapshot, rdiff-backup) with a similar approach.</p>
<p>This may or may not work in your environment, especially the original data should be rsync-friendly (e.g. a directory tree with only some not so big files changing, i.e. not a database).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: morphium</title>
		<link>http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/comment-page-1/#comment-3480</link>
		<dc:creator>morphium</dc:creator>
		<pubDate>Sun, 25 Oct 2009 15:24:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.standalone-sysadmin.com/blog/2009/10/dedupe-to-tape-are-you-crazy/#comment-3480</guid>
		<description>I completely agree with you. Nothing more to say.

morphium</description>
		<content:encoded><![CDATA[<p>I completely agree with you. Nothing more to say.</p>
<p>morphium</p>
]]></content:encoded>
	</item>
</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->