#deduplication #software #open #source
I am looking for a solution for backing up about 30 of my servers that are a mix of centos, windows 2003 and 2008 machines.
I have a san in another data center with a 1 Gig link between the sites that im going to be backing up onto so this is purely a disk based backup solution.
What I am ideally looking for is some open source software possible with a nice gui that will allow me to have a backup cycle with full backups on a monthly basis that go back 6 months and weekly backups that are saved for one month. I’m hoping for de duplication as well so if a file is saved on the first backup the software creates a symlink to it rather then saving the data again in all the proceeding backups. It must also be smart enough to know when a file is deleted that the file is moved to the latest backup it exists in and all symlinks to it are updated.
I was looking at bacula but im unsure if it can do the deduplication or something similar. If it can will someone be kind enough to point me at a decent guide as to how to do it.
If it does not can someone recommend another solution open source or not?
asked Feb 17 ’11 at 15:51
closed as off topic by Bryan. MadHatter. mdpc. Ward. rnxrx Dec 1 ’12 at 8:03
Questions on Server Fault are expected to relate to server, networking, or related infrastructure administration within the scope defined by the community. Consider editing the question or leaving comments for improvement if you believe the question can be reworded to fit within the scope. Read more about reopening questions here. If this question can be reworded to fit the rules in the help center. please edit the question.
I think bacula has deduplication, but it might only be available with the enterprise version of it. It was causing us some issues so our team decided to turn it off to fulfill a security requirement. user1207381 Dec 3 ’15 at 22:41
Something simple like rsnapshot may do what you want. It doesn’t sound like you want what’s traditionally referred to as deduplication. The synthetic full backup functionality using hard-links that rsnapshot provides sounds like exactly what you’re asking for.
(On the traditional deduplication front you may want to look at the Opendedup project as a starting point. It’s the most complete open source data deduplication project that I’ve seen. It uses a custom filesystem (SDFS) as the basis of its functionality. I tend to think that this isn’t want you’re looking for, but in the interests of completeness I mention it here.)
answered Feb 17 ’11 at 16:07