How Forever Incremental Backup Changed the World

In this episode of The Backup Wrap-Up, Curtis and Prasanna explore how forever incremental backup technology revolutionized the data protection industry. They discuss the evolution from traditional backup methods to modern approaches that eliminate the need for regular full backups, dramatically reducing network traffic, storage requirements, and backup windows.
The hosts examine the technical foundations of forever incremental backups, from block-level incremental tracking to backend storage innovations that make multiple recovery points possible without redundant data transfers. They compare older approaches like synthetic fulls with true forever incremental implementations, highlighting the critical differences and benefits. Whether you're still using legacy backup tools or evaluating modern solutions, this episode provides essential insights into why forever incremental has become the standard for efficient, reliable backup systems.
You found the backup wrap up your go-to podcast for all things
Speaker:backup recovery and cyber recovery.
Speaker:In this episode, we're covering forever incremental backup technology and
Speaker:how that completely revolutionized the entire backup industry.
Speaker:I. Remember the old days of doing weekly fos that brought
Speaker:your systems to their knees?
Speaker:Well, those days are thankfully gone for most people.
Speaker:We're gonna walk through the evolution of, from traditional approaches with
Speaker:fools and incrementals to synthetic fos, and finally to true forever incremental.
Speaker:If you're still doing things the old way, well, you're missing
Speaker:out on some serious benefits.
Speaker:Better performance, lower storage costs, and way faster restores.
Speaker:Perhaps this episode will cause you to rethink things a little bit.
Speaker:By the way, if you don't know who I am, I'm w Curtis Preston, AKA, Mr. Backup,
Speaker:and I've been passionate about backup and recovery for over 30 years, ever since.
Speaker:I had to tell my boss that we had no backups of the production
Speaker:database that we had just lost.
Speaker:I don't want that to happen to you, and that's why I do this.
Speaker:On this podcast, we turn unappreciated backup admins into Cyber Recovery Heroes.
Speaker:This is the backup wrap up.
Speaker:Welcome to the show.
Speaker:Hi, I am w Curtis Preston, AKA, Mr. Backup, and I have with me a guy that
Speaker:I bet is pretty dang happy right now.
Speaker:Prasanna Malaiyandi.
Speaker:Why are you looking like that?
Speaker:very confused about what I'm happy about.
Speaker:Really?
Speaker:Do I need to explain recent events that happened at your house
Speaker:Oh
Speaker:that, that.
Speaker:For those of you by the way, um, who are listening to this podcast via
Speaker:audio, we also provide video on YouTube.
Speaker:So if you wanted to see our expressions just then and really see what Curtis
Speaker:is laughing about, uh, go check us
Speaker:Yeah,
Speaker:Backup wrap up.
Speaker:so what events happened in the last 48 hours?
Speaker:my wife was in India
Speaker:Yeah.
Speaker:Visiting family and she just got back.
Speaker:Yes.
Speaker:it was good.
Speaker:Suddenly, you're so much harder to get on the phone between that and your new job.
Speaker:I don't even know you anymore.
Speaker:He's like, why would I be happy?
Speaker:I, I was really, I was, I thought it was like, I was thinking, I was like,
Speaker:was it something YouTube related?
Speaker:Was it something like I, we had watched or we had talked about and.
Speaker:No, the fact that your wife is back
Speaker:although she's also slightly jet lagged, so,
Speaker:so well, it's what, what, what's it is 12 hours difference.
Speaker:right now it is 13 and a half.
Speaker:Yeah.
Speaker:Yeah.
Speaker:So
Speaker:does
Speaker:of course she's,
Speaker:jet lagged than I do.
Speaker:So
Speaker:yeah.
Speaker:and come back, it takes me like three weeks.
Speaker:And,
Speaker:Yeah.
Speaker:have a dog, which I think some people have seen on the podcast.
Speaker:Um, it's funny because he gets jet lagged when I'm jet lagged.
Speaker:So he, his hours will shift to match with mine.
Speaker:So
Speaker:That's really adorable.
Speaker:and he would be like, Hey, I'm ready.
Speaker:Let's go, let's go.
Speaker:And then when I'm passed out, he's asleep.
Speaker:So he's
Speaker:Interesting.
Speaker:Yes.
Speaker:Your dog that's named after an Indian dessert.
Speaker:cream.
Speaker:Yep.
Speaker:Ice cream.
Speaker:Oh yeah.
Speaker:Ice cream.
Speaker:Yeah.
Speaker:Kulfi, which I have now had, um.
Speaker:your, did you know they have various flavors of Kulfi.
Speaker:They're not all just like one.
Speaker:Oh, I thought it was all mango.
Speaker:So it's met like mango Kulfi and then what?
Speaker:Coconut Kulfi.
Speaker:What are what?
Speaker:What are the other typical
Speaker:and pistachio.
Speaker:Pistachio and, and.
Speaker:I think there are other ones too.
Speaker:Yeah.
Speaker:I don't, yeah, I don't.
Speaker:I've used cardamom as a, an ingredient, you know, when I made the, the
Speaker:mango pie thing, but I don't think I know what it tastes like by itself.
Speaker:So I'll have to, I'll have to sprinkle some and, uh, see what that tastes like.
Speaker:Or maybe just go and
Speaker:Yeah.
Speaker:some
Speaker:next time you're at the Indian store.
Speaker:Actually, sometimes even the Costcos carry like a sampler pack
Speaker:Oh, yeah, so like when I go to the cash and carry down in Mira Mesa.
Speaker:the cash and carry will definitely have it.
Speaker:If you go check out the freezer section.
Speaker:Just
Speaker:Interesting.
Speaker:all before you get home.
Speaker:Yeah.
Speaker:I, oh man.
Speaker:I'm, you're, you're literally like, like my mouth is Yeah, yeah.
Speaker:Thinking about, thinking about it sounds yummy.
Speaker:Hmm.
Speaker:Uh, so we're gonna talk about something that's just as yummy.
Speaker:That is if you're into, into the backup thing, uh, we're gonna talk
Speaker:about forever incremental and sort of.
Speaker:Uh, just how big of a deal that is or was.
Speaker:Um, it, at, at this point?
Speaker:It's in, in most modern backup products that have come out in
Speaker:the last 10 years, I'd say, uh, ones that are actually new, right?
Speaker:So, you know, when we look at the products like Rubrik, Cohesity, or
Speaker:modern SaaS applications like Druva, Hyku, keep it, Alcion, uh, right.
Speaker:You know, these are all of these products.
Speaker:Have forever.
Speaker:Incremental in common, right?
Speaker:Nobody wants to do full backups anymore.
Speaker:Uh, or, or anything like full backups, right?
Speaker:So, um, so let's talk about that.
Speaker:Let's talk about what, you know, where were we, what was it like back in the
Speaker:day, and, um, and then sort of how we got the, from there to here and why it's
Speaker:such a big deal and why you should care.
Speaker:Right.
Speaker:And why this perhaps may give you a reason to finally, um, you
Speaker:know, what, what is it, shuck?
Speaker:The mortal coil.
Speaker:What, what, what, what is that?
Speaker:What,
Speaker:I'm just
Speaker:thinking are there company, and maybe as we're talking we can figure this out, like
Speaker:are there still vendors that exist that don't support forever Incrementals?
Speaker:That's actually a really good question and I'm gonna say, uh, absolutely
Speaker:right, because my concept of what a true forever incremental will be based on.
Speaker:Like a, a complete redesign of the, of the underneath.
Speaker:Right.
Speaker:Um, so the phrase I was looking for is shuttle off sh shuffle off the
Speaker:mortal coil of your old backup product,
Speaker:by the way, wait, quick story.
Speaker:So I'm on, I'm on a plane one time and this, this, um.
Speaker:This pilot, he was talking about the fact that we were about to take off and
Speaker:he referred to taking off as shuffling off the mortal coil of this earth.
Speaker:And I'm like, dude, that's a phrase for dying.
Speaker:What?
Speaker:Why the hell are you, why would you do that?
Speaker:Right.
Speaker:, it's a Shakespeare, it's from the to be or not to be soliloquy
Speaker:I,
Speaker:Hamlet.
Speaker:in all fairness though, I would say 97% of people probably had
Speaker:no idea what that phrase meant.
Speaker:They, they never even heard.
Speaker:Yeah.
Speaker:Yeah.
Speaker:And, and, and, and, and just to, just to, uh, like, you know, since I'm over
Speaker:here quoting Shakespeare, let me just put this, I'm not a huge Shakespeare person.
Speaker:In fact, it, it wasn't until, I dunno, several years ago, and we were in England
Speaker:where I, I, I got to actually see my first, um, uh, Shakespeare play, which
Speaker:was Henry V. Some people say Henry the VI don't, I think it's actually
Speaker:called Henry five, which is the, um.
Speaker:The one with the St. Crispin stay speech.
Speaker:Uh, once more into the breach, close the wall up with, with
Speaker:her English dead and all that.
Speaker:But anyway, sorry, I just bored everybody.
Speaker:Sorry.
Speaker:Sorry, you're asleep.
Speaker:Wake back up.
Speaker:Wake back up.
Speaker:We're, we're gonna talk about backup, which is so much more
Speaker:exciting than Shakespeare.
Speaker:Um, so, all right, so first let's talk about, do you want to define, uh, so what?
Speaker:let's talk about the old days first.
Speaker:that's what I, that's what I'm saying.
Speaker:That's what I'm saying.
Speaker:Let's, do you wanna define, uh, what a full and, and
Speaker:Yeah.
Speaker:an incremental backup is?
Speaker:Yeah.
Speaker:So.
Speaker:Normally when you think about backups, right, you talk,
Speaker:typically people talk about fulls.
Speaker:Incrementals, right?
Speaker:A full is basically your entire application, dataset, whatever
Speaker:it is, everything is backed up.
Speaker:Mm-hmm.
Speaker:then between doing every fulls, you would do incrementals, which are just the things
Speaker:that have changed since that last full.
Speaker:Right, right.
Speaker:you do these periodically, and normally you do incrementals
Speaker:because they will be smaller.
Speaker:So you could say do a full once a week and then incrementals during the week
Speaker:such that you can finish in time or during your backup window, and then the next
Speaker:weekend you would do like another full,
Speaker:Right, right now, slightly, I'll slightly change your
Speaker:definition or a pen or whatever.
Speaker:Um, so you specifically described what I would call a
Speaker:cumulative incremental, right?
Speaker:So a, uh, what is referred to as a sort of a regular, incremental
Speaker:or a differential incremental.
Speaker:Yep,
Speaker:it is one where it's the changes just since the last incremental or since
Speaker:the last backup of any kind, right?
Speaker:And then you
Speaker:Um,
Speaker:levels and all the
Speaker:yeah.
Speaker:Yeah, yeah,
Speaker:But yeah, at a high level, yeah, there are those three types, if you will.
Speaker:yeah.
Speaker:And, and, and then what really, um, what really what it really
Speaker:came down to is how things behaved when it came time for a restore.
Speaker:Right.
Speaker:So if you were going to restore something, then you needed the full,
Speaker:the latest cumulative, incremental, if you were doing such a thing, and
Speaker:then any incrementals since that day.
Speaker:And so, um, those of us, uh.
Speaker:I, I remember at some point we, we used to do weekly fulls.
Speaker:I remember my preferred backup design switching to monthly fulls,
Speaker:weekly cumulative incrementals, and then daily incrementals.
Speaker:So if you were, let's say three weeks into the month, you would need the full
Speaker:from the first of the month, the latest, uh, weekly cumulative, incremental,
Speaker:and then three or four days worth of.
Speaker:Incrementals
Speaker:Yeah,
Speaker:and, uh, in order to do a restore.
Speaker:. So what it meant was that you would often, the way the Restore worked is you
Speaker:would restore everything that was in the full, then you would restore everything
Speaker:that was in the cumulative incremental, and then you would restore everything
Speaker:that was in each of the incrementals.
Speaker:What that meant was there were many files that you were actually restoring
Speaker:multiple times and you were overwriting them, which was just inefficient.
Speaker:Um, and what that also meant was.
Speaker:That you had, uh, uh, duplicate data, the old way of doing backups created
Speaker:a ton of duplicate data on the tapes.
Speaker:You could say duplicate, or you could also say it's independent,
Speaker:depending on the glass.
Speaker:Half full glass, half empty.
Speaker:Well, so no, I, I understand what you're saying.
Speaker:What I'm talking about is the fulls,
Speaker:Yeah,
Speaker:the fulls over time created a ton of duplicate data.
Speaker:that's true.
Speaker:Right.
Speaker:I know what you're thinking.
Speaker:You're thinking, you're talking about the, within the, within the, the incrementals.
Speaker:I'm mainly talking about
Speaker:yeah,
Speaker:how many different copies of the exact same file were all
Speaker:over, you know, everywhere.
Speaker:and especially because most times, right, you look at sort of like a
Speaker:2% daily change rate, which maybe
Speaker:Yeah.
Speaker:a seven or 8% change rate per week.
Speaker:And
Speaker:Yeah.
Speaker:those weekly cumulatives or even the monthly fulls, right, you're
Speaker:basically backing up a bunch of data, which you don't necessarily
Speaker:need that haven't changed.
Speaker:Right, right.
Speaker:You, you, the reason we did it the way we did, it wasn't so much about independence.
Speaker:It was about.
Speaker:Minimize.
Speaker:So if you did a monthly full, followed by nothing but regular
Speaker:incrementals, you would have to load 30 tapes at the end of the day, right?
Speaker:At the end of, and even if it wasn't tapes, you would have
Speaker:to, you have to do 30 restores.
Speaker:Yeah,
Speaker:So, so the, the, the, the differential or the cumulative incremental, um,
Speaker:was about minimizing the level of effort during a restore, right?
Speaker:And then came along this thing called dedupe.
Speaker:Uh, which at least helped minimize, I think the, all of that duplicate data.
Speaker:because I think with deduplication it was nice because you didn't
Speaker:have to change what you were doing from like a policy perspective
Speaker:Right,
Speaker:perspective.
Speaker:But you got, like you said, it was able to.
Speaker:Dedupe, of course,
Speaker:right.
Speaker:in the backend and save you the storage costs.
Speaker:And that's why when, if you, if you were doing a, a typical weekly, by
Speaker:the way, nobody did the, the monthly full thing that I recommended, most
Speaker:people did weekly fulls, right?
Speaker:And so when you had weekly fulls, um.
Speaker:This is why, and you, and you had like 90 day retention.
Speaker:This is why just with even crappy dedupe, you got 20 to one,
Speaker:Yeah.
Speaker:uh, data reduction, right?
Speaker:And this is what helped, um, pay for, you know, the what to help
Speaker:created the, the market for all that, all that dedupe work, right?
Speaker:And then we could argue over, um, you know, well we're, we're
Speaker:20 to one, we're 30 to one, we're, you know, whatever, right?
Speaker:And, um, we, we saved this much more money.
Speaker:Um.
Speaker:You know, once you get rid of all those fulls, everything else
Speaker:was, was really kind of incre
Speaker:Yeah.
Speaker:incremental.
Speaker:Um, but the real, I think the real challenge was just the core backup design
Speaker:of when we go to do a restore, we have to restore, uh, data multiple times.
Speaker:Right.
Speaker:It just simply wasn't efficient.
Speaker:Well, and I think even before we get there, I'd like to cover
Speaker:Mm-hmm.
Speaker:uh,
Speaker:Okay.
Speaker:if you had the existing backup schedule, right, where I'm still doing
Speaker:weekly fulls and daily incrementals,
Speaker:Mm-hmm.
Speaker:still transferring all that data over the network.
Speaker:Right?
Speaker:And I think this is where client side deduplication would probably
Speaker:come next, in my opinion, right?
Speaker:Yep.
Speaker:it would help you reduce the amount of data you're actually
Speaker:sending over the network.
Speaker:Right.
Speaker:And that'll help save you cost as well as time.
Speaker:Yeah, that's actually a really good point.
Speaker:Thanks for bringing that up.
Speaker:So the, one of the other like, like basically what you're saying that one
Speaker:of the other, um, downsides to doing it the old way was both the transfer
Speaker:across the network as well as the impact, uh, to the clients, right?
Speaker:If you're doing a full, on a regular basis, um, you're, um.
Speaker:There's a, there's a cost to that
Speaker:Yep.
Speaker:in both, you know, you brought up bandwidth, it's
Speaker:bandwidth storage and compute.
Speaker:Right?
Speaker:Um, it's like all the things.
Speaker:There was also a, um, sort of a, like a perfect storm brewing.
Speaker:Yeah.
Speaker:So we're doing fulls on a regular basis.
Speaker:We're doing cumulative incrementals on a regular basis, and this is
Speaker:how we were doing things, and then along came virtualization.
Speaker:Why was that a problem?
Speaker:Because a lot of the data was
Speaker:I,
Speaker:the same, right?
Speaker:When you look across all of your VMs, they're probably spun up
Speaker:from the same golden copy, of your
Speaker:Well, the,
Speaker:and
Speaker:yeah, well, the.
Speaker:amongst them.
Speaker:Well, that, that's a, this is another example of, that's a great answer.
Speaker:It's not the question I was asking.
Speaker:It's
Speaker:Um,
Speaker:resource on the backend because
Speaker:that's what I was going after.
Speaker:Yeah,
Speaker:and the compute, right?
Speaker:You don't have your
Speaker:yeah,
Speaker:instance
Speaker:yeah.
Speaker:That's, that's what I was going after.
Speaker:Right?
Speaker:So, so that was a real problem when VMware got really popular and we just
Speaker:kept doing the same old thing because.
Speaker:Remember, for a long time we backed up VMware.
Speaker:We backed up the physical, the virtual machines, just like
Speaker:they were physical machines.
Speaker:Yes.
Speaker:Guest ba. We just basically installed an A, an agent and every guest,
Speaker:and that's how we did the backups.
Speaker:And, you know, and what happened was you'd do just randomly, you'd end up
Speaker:doing a bunch of, at the same time.
Speaker:Which worked fine when we were 20 individual machines, but now we're 20
Speaker:VMs all on the same physical machine, and that just broke, like virtualization
Speaker:just broke back up overnight.
Speaker:Right.
Speaker:Um, the latest thing to break back up overnight is, um, containers.
Speaker:Yep.
Speaker:That's a whole other thing.
Speaker:We're not talking about that right now, but, um, the
Speaker:so
Speaker:ahead.
Speaker:though, this is what sort of VMware, Plus the backup vendors to figure out
Speaker:is there something better we can do
Speaker:Yeah.
Speaker:order to help satisfy backup needs?
Speaker:Because it's not going away, right?
Speaker:You have to
Speaker:Right,
Speaker:VMs.
Speaker:right, right.
Speaker:So the the real key I think, I think the real goal, I. Like, like if we could
Speaker:go back in time and I, I don't know.
Speaker:I don't know.
Speaker:This is one of the things where like, I don't know if we, I, I wasn't in the
Speaker:rooms right when they were designing this stuff, but I'd like to think that
Speaker:somebody said, you know, if, because my memory is, we had issues with tape.
Speaker:The, the, there was the fundamental mismatch of speed tape.
Speaker:You know, people thought it was slow, it was actually fast.
Speaker:That was a problem.
Speaker:And we need, and so we started doing disc staging.
Speaker:We started doing disc backups because of that, right?
Speaker:And then DDU came along and DDU said, Hey, we can make disc better.
Speaker:We can make disc, you know, more affordable.
Speaker:And, and so, so that was, and then it just sort of happened.
Speaker:But even if.
Speaker:We if, if that wasn't the case, I think the design of backups ultimately ending
Speaker:up being, uh, what we're talking about today, which is forever incremental,
Speaker:it, it accomplishes multiple purposes.
Speaker:One is it, it, it solves the issues that we have with tape.
Speaker:Because in order for forever incremental work, you've gotta be using disc, right?
Speaker:Uh, for, for true forever, incremental work to work, you've gotta be using disc.
Speaker:Um, and then the other thing is that it, it solves the, the cost issues.
Speaker:It solves the performance issues, restore, you know, it does a lot of things right.
Speaker:Um,
Speaker:you first define what forever incremental is?
Speaker:'cause I don't
Speaker:I don't want to,
Speaker:that.
Speaker:Before, before we get to forever incremental, we gotta talk about
Speaker:something that was, that was a stop gap technology bef to get from we'd,
Speaker:we wanted to get to a point where we weren't doing fullsS anymore.
Speaker:Everybody agreed that full stunk, except especially when we're
Speaker:talking about virtualization, right?
Speaker:What was the stop gap technology that backup vendors came out with before
Speaker:we went to the whole For incremental?
Speaker:Uh, they created what they called synthetic fulls,
Speaker:Mm-hmm.
Speaker:which is basically you are.
Speaker:Creating a full on the backend, but you're sending data and stitching it together and
Speaker:piecing it together to make it look like what the full would look like, because you
Speaker:Yeah.
Speaker:backups were on that
Speaker:Right.
Speaker:So it's like we already know, we, we already know what the files are in
Speaker:the full, because, you know, we did it on we, we did another incremental.
Speaker:So we know what's deleted.
Speaker:We know what's new.
Speaker:We know what a full would look like.
Speaker:If we went and made one, why don't we just make one over here
Speaker:without having to do it over there.
Speaker:Um, and there were a couple of different ways to do that.
Speaker:Uh, there was the backup vendor way and there was the, um, the, well, the
Speaker:backup software vendor way, and then there was the backup hardware vendor way.
Speaker:Yeah.
Speaker:I'll talk about the backup software vendor way first.
Speaker:It's like, we know what all the files should be on a full,
Speaker:we know where all of them are.
Speaker:We're just gonna copy them all together.
Speaker:You know, in one image, and then we're gonna copy that over to a new
Speaker:set of tapes or a new set of disc
Speaker:Mm.
Speaker:so that we essentially create an actual full, um, without actually having to
Speaker:transfer any data around the network.
Speaker:It might be transferring it, uh, well it is transferring
Speaker:it within the backup system.
Speaker:Right.
Speaker:And then there was the, the, and, and that took a certain amount of time.
Speaker:Right.
Speaker:Um, and then.
Speaker:A, a vendor that you might be familiar with came out with
Speaker:a different way to do it.
Speaker:What was that?
Speaker:And basically, um, they basically said, just tell us how you wanna
Speaker:stitch everything together,
Speaker:Yeah.
Speaker:it all together.
Speaker:Yeah.
Speaker:So it was like, it was like an API, right.
Speaker:You know where all the stuff is.
Speaker:We have all the stuff.
Speaker:Just tell us which stuff you want, where, and we'll go.
Speaker:Yep.
Speaker:And so they could make essentially a new full.
Speaker:When in reality it's just a bunch of pointers to what was already there.
Speaker:Right.
Speaker:this
Speaker:And it was really quick.
Speaker:Right,
Speaker:it was really quick because the key is, especially on a de-duplicated system,
Speaker:you're not actually copying the data.
Speaker:It's all pointer manipulation.
Speaker:right, right.
Speaker:moving any data, you're just changing metadata, which is very quick, very fast.
Speaker:That was a great singing group from the nineties,
Speaker:What
Speaker:I
Speaker:metadata.
Speaker:manipulation.
Speaker:But I'm, anyway, so yeah, so that was sort of a stop gap solution.
Speaker:It allowed people to stop doing real fulls, solve the issue of the, the,
Speaker:the load on the client, solve the issue of the load on the network.
Speaker:But we still, we were still creating fulls.
Speaker:So the concept of forever incremental.
Speaker:And, and true forever incremental is that we first have to design a backup product
Speaker:that just no longer needs repeated fulls.
Speaker:Right?
Speaker:And, and by the way, this generally does only apply to, um.
Speaker:Uh, file system backups for the most part.
Speaker:Um, not database and application backups.
Speaker:Most of them still require repeated fullss.
Speaker:Um, you know, we'll, we'll, some of, some of them have forever incremental, but.
Speaker:I would also say, or would you also say for virtualized
Speaker:workloads, this also applies 'cause
Speaker:Um, oh, you mean, uh, the, the backup of the di of the system itself?
Speaker:Yes.
Speaker:Yes.
Speaker:So, well, so I'll, so if I, if, if I was to use a term that would apply to,
Speaker:to both of them, I'd say volume-based.
Speaker:Yeah.
Speaker:Backups.
Speaker:Right?
Speaker:So either a file system or the, or a,
Speaker:Disc.
Speaker:file or a VDK file, right?
Speaker:Um, and the idea is that we're just gonna do one full, and then
Speaker:we're generally going to be doing block level incremental backups.
Speaker:Um, not just incremental, but block level incremental.
Speaker:What's the difference between those?
Speaker:Well, a block level incremental allows you to look at changes
Speaker:within, a specific object.
Speaker:As an example, say you had, uh, 10 gigabyte PST.
Speaker:File representing your email and instead of doing a normal incremental, which
Speaker:would transfer the entire 10 gig file over when you back it up using traditional
Speaker:incrementals, block level incremental would just take the changes in within that
Speaker:10 gig PST file and transfer that over.
Speaker:Exactly, and so it, it, it's another data reduction technology like deduplication.
Speaker:And it, it, it just, it, I I think it's a, it was an order of
Speaker:magnitude reduction in the number of bytes that has to be transferred.
Speaker:And, and the way this worked in, in, uh, file system backups was that
Speaker:you had to have like an API into the actual blocks, the, the discs, right.
Speaker:Uh, which.
Speaker:Was somewhat problematic.
Speaker:The, the, the easier way to get this was in the virtualization
Speaker:world where you had an API, right, like the change block tracking API
Speaker:Mm-hmm.
Speaker:you can just, the backup app can just say to the, the virtualization
Speaker:vendor, Hey, I'm here to back up again.
Speaker:Just give me a pointer to all of the blocks that have changed
Speaker:since the last time I was here.
Speaker:Yep.
Speaker:Um, and then we can transfer just those change blocks, but.
Speaker:But.
Speaker:importantly, for a true increment, for a true forever incremental, um, to, to, to,
Speaker:for me to think that it's a truly forever incremental is that you're going to change
Speaker:how the data is stored on the backend.
Speaker:So there really isn't a concept of full, right?
Speaker:Um, the way that you're storing each new backup is that each new backup from a
Speaker:restore perspective behaves like a full.
Speaker:Yeah.
Speaker:Right.
Speaker:You
Speaker:Yeah.
Speaker:want to like put that in?
Speaker:Go ahead.
Speaker:so I was just gonna put it in my own words, which
Speaker:Yeah.
Speaker:basically you're able to send only the changes and end up with
Speaker:the full on the backend, such
Speaker:Right.
Speaker:each one is a full, they can be independently restored without
Speaker:any dependencies on others.
Speaker:Yeah.
Speaker:So you don't have this idea of restoring the full and restoring
Speaker:the incrementals individually.
Speaker:Right.
Speaker:You just restore the late, you know, it's sort of, it's, it's sort of like take what
Speaker:we used to do, take the synthetic full.
Speaker:Every time we do an incremental, we're essentially creating a new synthetic full.
Speaker:Yep,
Speaker:How's that?
Speaker:yep.
Speaker:Yeah.
Speaker:Yeah, because if you think about it, it's just kind of changing
Speaker:where the processing happened,
Speaker:Right.
Speaker:Because with Synthetic fulls, the client was telling the storage, okay, these
Speaker:are the pieces I want you to stitch together, and it might have been pieces
Speaker:across multiple different backups,
Speaker:Mm-hmm.
Speaker:right?
Speaker:In the case of an incremental forever, it's like, Hey, you know
Speaker:your last backup go apply these changes on top of that last backup
Speaker:Yeah.
Speaker:Yeah.
Speaker:offsets.
Speaker:Right.
Speaker:Um, and go ahead.
Speaker:it needs to be a system that allows you to keep those copies as independent,
Speaker:What do you, what do you mean by that?
Speaker:um, uh, backup technologies which say, which claim to be incremental forever.
Speaker:Mm-hmm.
Speaker:They only keep the latest copy,
Speaker:Right.
Speaker:for
Speaker:I.
Speaker:you have a full, you apply the changes, you now get a new full.
Speaker:That old full is gone from the system.
Speaker:Hmm.
Speaker:Yeah.
Speaker:have the most recent copy, right?
Speaker:And normally for backup, because you want to be able to go back further in
Speaker:time, you need to make sure you are preserving those previous copies as well.
Speaker:And so it
Speaker:Yeah.
Speaker:require a technology like deduplication, which allows you to efficiently store
Speaker:multiple copies, snapshots, whatever technology is available on the backend
Speaker:in order to make that possible.
Speaker:Yeah, so snapshots and replication.
Speaker:A great example of forever incremental technology, right?
Speaker:Each new, um.
Speaker:Each time you do a snapshot and a replication, you get a new, something
Speaker:that looks like a full, uh, when really all you only transferred was the, the,
Speaker:you know, the bites that have changed.
Speaker:And then, um, I can, I can immediately think of when I was,
Speaker:when you were talking, I was like, what is he talking about?
Speaker:And then I realized what you were talking about.
Speaker:So here's an example of forever incremental that isn't what we're
Speaker:talking about, and that's replication.
Speaker:It
Speaker:Yep.
Speaker:Right.
Speaker:Re just regular replication without anything
Speaker:Yep,
Speaker:beside it.
Speaker:You end up with each new, every time you do a a, an if, if you want to call
Speaker:it that, in an incremental backup, all you're left with is the latest
Speaker:version and you can't go back in time.
Speaker:You know, you know what I was actually thinking of when I made those comments?
Speaker:Oh, what were you,
Speaker:Oracle incremental merge
Speaker:oh, okay.
Speaker:because Oracle incremental merge applies to changes on top of a full, but it
Speaker:does not preserve the previous versions.
Speaker:Right.
Speaker:Yeah, that's a good point.
Speaker:Um, which, which wouldn't it, that's really just a, a type
Speaker:of, synthetic full, right?
Speaker:Yeah, yeah, I didn't even realize that, by the way.
Speaker:Um, so how do you preserve the history?
Speaker:That's why
Speaker:Do you just.
Speaker:do you need a application that, or a storage system that
Speaker:allows you to take snapshots or
Speaker:okay.
Speaker:copy of the file before it
Speaker:gotcha.
Speaker:Why do
Speaker:where was it?
Speaker:Curtis?
Speaker:Why do I still remember these things?
Speaker:It's why you're here?
Speaker:So we've defined what a
Speaker:Mm-hmm.
Speaker:a little bit about the benefits,
Speaker:Yeah.
Speaker:Being able to quickly restore because you have the full copy, uh, being independent,
Speaker:um, not having to transfer all the data.
Speaker:Right?
Speaker:Um, can
Speaker:And also also cost savings.
Speaker:Right, right.
Speaker:Yeah.
Speaker:you think of any cons?
Speaker:Well, the, the only thing you know, one of the things I said
Speaker:in the beginning was that.
Speaker:In order to do forever, incremental, successfully, you've
Speaker:got to store the data on disc.
Speaker:And, and again, not, not to be, you know, pro tape and all that stuff,
Speaker:but we did an episode relatively recently where we talked about how you
Speaker:have to, uh, protect the backup disc.
Speaker:The only con that I can think of is that your entire backup is just sitting
Speaker:there ready to be attacked, right.
Speaker:Yeah,
Speaker:Um.
Speaker:I have, I have a
Speaker:Go ahead.
Speaker:You have, you have a con, okay?
Speaker:So having worked with vendors, and I'm not saying this has happened, but
Speaker:it is a potential thing you should think about is when you have forever
Speaker:incrementals, You're always re.
Speaker:Copying over the existing backup
Speaker:Mm-hmm.
Speaker:bug that somehow makes, that corrupts a part of your backup, right?
Speaker:You've now impacted how many versions back because you're never getting a
Speaker:full copy of that data again, right?
Speaker:It's always
Speaker:Right,
Speaker:And so
Speaker:Yeah.
Speaker:be one of the cons a
Speaker:Yeah.
Speaker:approach.
Speaker:That was, that was a, um, a critique of ddu, uh, because that
Speaker:has the same issue, a critique of dedupe when Dedupe first came out.
Speaker:And all you can say is that you just do your best to ensure
Speaker:that that's not happening.
Speaker:And that's why we do fingerprinting and, and, you know, integrity
Speaker:checks and all that kind of stuff.
Speaker:your backup.
Speaker:Yes.
Speaker:And also test your backups.
Speaker:Right.
Speaker:Test your backup, test your backup, test your backups.
Speaker:Um, but go ahead.
Speaker:I do have another one too.
Speaker:If you, unless you
Speaker:Oh, you do?
Speaker:Okay.
Speaker:Go ahead.
Speaker:The other one I also wanted to mention is, right if Forever incremental backups
Speaker:are so great, why isn't everyone using it?
Speaker:Mm-hmm.
Speaker:one of the challenges is it requires vendor support.
Speaker:Yeah.
Speaker:application to support it.
Speaker:So VMware is supported for virtualization.
Speaker:companies support it for their own applications, but
Speaker:not everyone supports it.
Speaker:Not everyone makes it available.
Speaker:Yeah.
Speaker:so you sort of are limited in terms of what applications, what
Speaker:APIs are available, and can you get the changes that you need in
Speaker:order to be able to create your forever incremental in the backend.
Speaker:Exactly, and, and know, I can think of modern database applications
Speaker:that do support it, right?
Speaker:Um, I can think, for example, uh, when you back up Salesforce, right?
Speaker:You're backing up at the object level and it's very easy to.
Speaker:Uh, and, and let me rephrase that, because Salesforce uses
Speaker:the term object to mean something very different than what we mean.
Speaker:Um, because why should we all use the same terms?
Speaker:of course.
Speaker:A Salesforce object is like the user's table, right?
Speaker:Um, so I'm talking about like, I made an update to the, to a, you know, a
Speaker:field in a record in a table, and, um.
Speaker:Uh, you can back up that change as right, and, and, and you can do it incrementally
Speaker:or you can do it independently.
Speaker:And so it would be very easy to store all of those, those
Speaker:objects in a, in a way that.
Speaker:It, you know, would, would foster this idea of a forever incremental.
Speaker:Um, whereas like Oracle for example, there really isn't a way to do it.
Speaker:The, at best what we can get out of Oracle is the concept of a, um, synthetic full,
Speaker:uh, and the synthetic full that they have.
Speaker:Has the issue that you, that you mentioned before, whereas once you
Speaker:do the synthetic full, you don't have the previous incrementals.
Speaker:Um, so yeah, it, that is, that is definitely a, um, it, it
Speaker:requires your backup vendors to support it, and it requires your
Speaker:application vendors to support it.
Speaker:Yeah.
Speaker:Absolutely.
Speaker:I can't remember where I was going before you brought up that other, um, I, I,
Speaker:I will say this, that, you know, the title of this is, you know, how Forever
Speaker:Incremental Backup changed the world.
Speaker:It, it, it really has, right?
Speaker:The idea, when you look at these modern backup products, the
Speaker:idea of doing a, of a, of a full backup is just, it's just gone.
Speaker:Right?
Speaker:Um, all of the downsides to doing a regular full backup.
Speaker:Are, are gone.
Speaker:Uh, the, the, the, the, the compute considerations, the
Speaker:network considerations, the storage considerations, they're all gone without
Speaker:any negative like impact to the restore.
Speaker:In fact, it's the opposite.
Speaker:The restore is now, since we know what a full looks like when we go to do the
Speaker:restore, that is to going back to Harken, back to the original way we did restores.
Speaker:We're no longer restoring the same file multiple times during a single restore.
Speaker:Why would we do that?
Speaker:We know where all the files are, we know where all the blocks are, whatever
Speaker:it is that we're, we're restoring.
Speaker:We know what the latest version of that block here file is, and
Speaker:we're just gonna restore that one.
Speaker:I have a story.
Speaker:Um, how you do forever, incremental Matters.
Speaker:The original, the OG, forever incremental product is TSM.
Speaker:mm-hmm.
Speaker:And when, and the way it was implemented back then was to tape.
Speaker:And they, tried to bring this concept of forever incremental to the world without
Speaker:changing the underlying storage technology because back then it was, it was, uh.
Speaker:tape.
Speaker:It was, uh, if you, if you brought disc into it, it was, it was crazy talk, right?
Speaker:So the, so the, so, so they can get a lot of credit for trying to
Speaker:accomplish this, this, um, this idea.
Speaker:But I, I still think when we look back to TSM of old, I, I think it was, I
Speaker:think it was a failed attempt because my earlier statement of the only proper way
Speaker:to do forever incremental is to do it on disc when they were doing it on tape.
Speaker:What it meant was that when you, um, when you went to go to do a restore, the, the,
Speaker:the full was on like 5,700 tapes, right?
Speaker:sounds worse than just doing your traditional fulls and incrementals.
Speaker:Yeah, well, it was, and I remember, I, I remember doing a, there was a one
Speaker:particular restore that I remember doing using TSM in a production environment.
Speaker:And I, I don't remember the actual numbers, but I'll, I'll just use like,
Speaker:uh, I'll just use fake numbers just to, to, to to, to describe what it was like.
Speaker:It was like we were doing like a 10 gigabyte restore and it took like.
Speaker:Two weeks because of how many hundreds of tapes that it was.
Speaker:Thousands of tapes.
Speaker:Um, because of the number of files, the number of tapes that they were
Speaker:on, and how long it takes to load and unload an individual tape.
Speaker:Right?
Speaker:Um, and they had technologies, uh, called reclamation and they, they,
Speaker:they had a number of technologies that tried to mini minimize this issue.
Speaker:Um, but
Speaker:It still existed.
Speaker:well, yeah, it still is existed.
Speaker:And also some people, they would run out of time in the day to do reclamation.
Speaker:Reclamation is where you take a bunch of tapes that, um, like they were, that
Speaker:most of the files on those, on those tapes were no longer needed because
Speaker:they, they'd expired and then you take, yeah, I've got a bunch of files that are
Speaker:10% and then you copy them to one tape
Speaker:It's
Speaker:to that's now a hundred percent.
Speaker:It's like garbage collection, but on tape.
Speaker:And it was a very, um, labor intensive process and, um, IO intensive process.
Speaker:And as a result, it took a long time.
Speaker:And so some people would, would turn it off.
Speaker:Um, and, uh, I think that's what happened in, in, in this case.
Speaker:And so there, there were literally just, I don't rem there were so many
Speaker:tapes that we had to restore and it, you know, it sounds like I'm bagging
Speaker:on T-S-M-T-S-M is still around.
Speaker:It's now called, uh, spectrum Protect.
Speaker:Yeah, but the addition of bringing in disc, I think makes this, this whole
Speaker:issue kind of, uh, not a big deal,
Speaker:Yeah.
Speaker:but, um, but yeah, I still think that you look at all of the modern backup
Speaker:products that have come out in the last 10 plus years, and they're all like this.
Speaker:They're all one full, followed by a whole bunch of incrementals, and
Speaker:the world is a better place for it.
Speaker:And, um, you young whippersnappers have never had to deal with
Speaker:tape
Speaker:incremental backups.
Speaker:You just don't know
Speaker:what
Speaker:what you uh, yeah.
Speaker:You don't know the pain that you, that Yeah.
Speaker:That you had.
Speaker:You know, it's funny, this morning I was hanging out with some people and I
Speaker:was talking to 'em, and they're, it's this lady who shares my birthday, but
Speaker:she's about 15 years older than me,
Speaker:Hmm.
Speaker:we started talking about all of the technology that we
Speaker:remember when it came out.
Speaker:Right.
Speaker:And the first, the first.
Speaker:Piece of technology I remember being introduced in my life was the, the VCR
Speaker:Hmm.
Speaker:and it, it came, they, I remember them wheeling one into the classroom
Speaker:Hmm.
Speaker:in my school.
Speaker:And it was gigantic.
Speaker:Right.
Speaker:It was like, and it was the kind where you put the tape in and push it down, right?
Speaker:Yeah.
Speaker:Um, anyway, this is, this is like that, this is like the
Speaker:way, the way we did backups.
Speaker:And by the way, you know.
Speaker:I'm, I'm glad I, I'm glad I remembered this because you asked a question earlier.
Speaker:Why is it that not everybody does things in this new way?
Speaker:Yeah.
Speaker:Because backup is the stickiest application in the data center.
Speaker:Yep.
Speaker:So many people are still using the same backup application
Speaker:that they were 15 years ago.
Speaker:Some people change their applications like they change
Speaker:their underwear, but other people.
Speaker:They just, it, it's a pain.
Speaker:It's costly, it's risk, uh, it's fraught with risk and, and, and
Speaker:difficulty and a giant learning curve.
Speaker:And during that learning curve, your, your data's at risk.
Speaker:And so most people just, they'd rather stick with the ugly, old backup product.
Speaker:it.
Speaker:Because it works.
Speaker:Yeah.
Speaker:And, and, and so it, it, it gave birth to incrementally helpful
Speaker:technologies like deduplication.
Speaker:Don't change anything.
Speaker:Just send your backups over here instead of sending your backups over there.
Speaker:And we'll get rid of the duplicates and we'll make it sort of look like a,
Speaker:an incremental forever, but just keep doing it the dumb old way because we
Speaker:don't want you to have to change your backup product in order to get to,
Speaker:and, and I guess this is, you know.
Speaker:If, if you're one of those people that's still using the backup, the
Speaker:same backup product that you had 15 years ago and you're still using it
Speaker:the same way you were using it 15 years ago, and you're not using incremental
Speaker:forever as a, a true technology in your world, maybe time to think about it.
Speaker:but I, I would probably say.
Speaker:Newer workloads, people are probably using that, right?
Speaker:Because once again, environments are a mishmash of technologies, right?
Speaker:Yeah.
Speaker:people probably do have those old databases, old applications,
Speaker:which they are using an old infrastructure backup infrastructure
Speaker:for, and for their virtualization or other newer workloads, they're
Speaker:probably using something different.
Speaker:Yeah.
Speaker:Uh, remember I did that, I did that survey a few, uh, months ago and
Speaker:I asked them how many backup apps they had, and 30% had 30, I think
Speaker:more than 30% had three or more backup apps in their environment.
Speaker:Yep.
Speaker:And it's because of what you're talking about.
Speaker:They have, they start using.
Speaker:Containers and their current backup app doesn't know what to do.
Speaker:And so they use that for the new technology and yeah.
Speaker:Yeah, that's very common.
Speaker:And I guess what I'm saying is maybe it's time to check out the water of
Speaker:the, of, of truly incremental backups and, and experience all the beauty.
Speaker:Yeah.
Speaker:All right.
Speaker:Well thanks for the chat again.
Speaker:Anytime Curtis.
Speaker:Anytime.
Speaker:All right.
Speaker:And we wanna thank our listeners.
Speaker:Yo, you're why we're here, you know, otherwise, just two guys sitting here
Speaker:talking over a couple of microphones.
Speaker:Uh, that is a wrap.