April 7, 2025

How Forever Incremental Backup Changed the World

How Forever Incremental Backup Changed the World

In this episode of The Backup Wrap-Up, Curtis and Prasanna explore how forever incremental backup technology revolutionized the data protection industry. They discuss the evolution from traditional backup methods to modern approaches that eliminate the need for regular full backups, dramatically reducing network traffic, storage requirements, and backup windows.

The hosts examine the technical foundations of forever incremental backups, from block-level incremental tracking to backend storage innovations that make multiple recovery points possible without redundant data transfers. They compare older approaches like synthetic fulls with true forever incremental implementations, highlighting the critical differences and benefits. Whether you're still using legacy backup tools or evaluating modern solutions, this episode provides essential insights into why forever incremental has become the standard for efficient, reliable backup systems.

Speaker:

You found the backup wrap up your go-to podcast for all things

Speaker:

backup recovery and cyber recovery.

Speaker:

In this episode, we're covering forever incremental backup technology and

Speaker:

how that completely revolutionized the entire backup industry.

Speaker:

I. Remember the old days of doing weekly fos that brought

Speaker:

your systems to their knees?

Speaker:

Well, those days are thankfully gone for most people.

Speaker:

We're gonna walk through the evolution of, from traditional approaches with

Speaker:

fools and incrementals to synthetic fos, and finally to true forever incremental.

Speaker:

If you're still doing things the old way, well, you're missing

Speaker:

out on some serious benefits.

Speaker:

Better performance, lower storage costs, and way faster restores.

Speaker:

Perhaps this episode will cause you to rethink things a little bit.

Speaker:

By the way, if you don't know who I am, I'm w Curtis Preston, AKA, Mr. Backup,

Speaker:

and I've been passionate about backup and recovery for over 30 years, ever since.

Speaker:

I had to tell my boss that we had no backups of the production

Speaker:

database that we had just lost.

Speaker:

I don't want that to happen to you, and that's why I do this.

Speaker:

On this podcast, we turn unappreciated backup admins into Cyber Recovery Heroes.

Speaker:

This is the backup wrap up.

Speaker:

Welcome to the show.

Speaker:

Hi, I am w Curtis Preston, AKA, Mr. Backup, and I have with me a guy that

Speaker:

I bet is pretty dang happy right now.

Speaker:

Prasanna Malaiyandi.

Speaker:

Why are you looking like that?

Speaker:

very confused about what I'm happy about.

Speaker:

Really?

Speaker:

Do I need to explain recent events that happened at your house

Speaker:

Oh

Speaker:

that, that.

Speaker:

For those of you by the way, um, who are listening to this podcast via

Speaker:

audio, we also provide video on YouTube.

Speaker:

So if you wanted to see our expressions just then and really see what Curtis

Speaker:

is laughing about, uh, go check us

Speaker:

Yeah,

Speaker:

Backup wrap up.

Speaker:

so what events happened in the last 48 hours?

Speaker:

my wife was in India

Speaker:

Yeah.

Speaker:

Visiting family and she just got back.

Speaker:

Yes.

Speaker:

it was good.

Speaker:

Suddenly, you're so much harder to get on the phone between that and your new job.

Speaker:

I don't even know you anymore.

Speaker:

He's like, why would I be happy?

Speaker:

I, I was really, I was, I thought it was like, I was thinking, I was like,

Speaker:

was it something YouTube related?

Speaker:

Was it something like I, we had watched or we had talked about and.

Speaker:

No, the fact that your wife is back

Speaker:

although she's also slightly jet lagged, so,

Speaker:

so well, it's what, what, what's it is 12 hours difference.

Speaker:

right now it is 13 and a half.

Speaker:

Yeah.

Speaker:

Yeah.

Speaker:

So

Speaker:

does

Speaker:

of course she's,

Speaker:

jet lagged than I do.

Speaker:

So

Speaker:

yeah.

Speaker:

and come back, it takes me like three weeks.

Speaker:

And,

Speaker:

Yeah.

Speaker:

have a dog, which I think some people have seen on the podcast.

Speaker:

Um, it's funny because he gets jet lagged when I'm jet lagged.

Speaker:

So he, his hours will shift to match with mine.

Speaker:

So

Speaker:

That's really adorable.

Speaker:

and he would be like, Hey, I'm ready.

Speaker:

Let's go, let's go.

Speaker:

And then when I'm passed out, he's asleep.

Speaker:

So he's

Speaker:

Interesting.

Speaker:

Yes.

Speaker:

Your dog that's named after an Indian dessert.

Speaker:

cream.

Speaker:

Yep.

Speaker:

Ice cream.

Speaker:

Oh yeah.

Speaker:

Ice cream.

Speaker:

Yeah.

Speaker:

Kulfi, which I have now had, um.

Speaker:

your, did you know they have various flavors of Kulfi.

Speaker:

They're not all just like one.

Speaker:

Oh, I thought it was all mango.

Speaker:

So it's met like mango Kulfi and then what?

Speaker:

Coconut Kulfi.

Speaker:

What are what?

Speaker:

What are the other typical

Speaker:

and pistachio.

Speaker:

Pistachio and, and.

Speaker:

I think there are other ones too.

Speaker:

Yeah.

Speaker:

I don't, yeah, I don't.

Speaker:

I've used cardamom as a, an ingredient, you know, when I made the, the

Speaker:

mango pie thing, but I don't think I know what it tastes like by itself.

Speaker:

So I'll have to, I'll have to sprinkle some and, uh, see what that tastes like.

Speaker:

Or maybe just go and

Speaker:

Yeah.

Speaker:

some

Speaker:

next time you're at the Indian store.

Speaker:

Actually, sometimes even the Costcos carry like a sampler pack

Speaker:

Oh, yeah, so like when I go to the cash and carry down in Mira Mesa.

Speaker:

the cash and carry will definitely have it.

Speaker:

If you go check out the freezer section.

Speaker:

Just

Speaker:

Interesting.

Speaker:

all before you get home.

Speaker:

Yeah.

Speaker:

I, oh man.

Speaker:

I'm, you're, you're literally like, like my mouth is Yeah, yeah.

Speaker:

Thinking about, thinking about it sounds yummy.

Speaker:

Hmm.

Speaker:

Uh, so we're gonna talk about something that's just as yummy.

Speaker:

That is if you're into, into the backup thing, uh, we're gonna talk

Speaker:

about forever incremental and sort of.

Speaker:

Uh, just how big of a deal that is or was.

Speaker:

Um, it, at, at this point?

Speaker:

It's in, in most modern backup products that have come out in

Speaker:

the last 10 years, I'd say, uh, ones that are actually new, right?

Speaker:

So, you know, when we look at the products like Rubrik, Cohesity, or

Speaker:

modern SaaS applications like Druva, Hyku, keep it, Alcion, uh, right.

Speaker:

You know, these are all of these products.

Speaker:

Have forever.

Speaker:

Incremental in common, right?

Speaker:

Nobody wants to do full backups anymore.

Speaker:

Uh, or, or anything like full backups, right?

Speaker:

So, um, so let's talk about that.

Speaker:

Let's talk about what, you know, where were we, what was it like back in the

Speaker:

day, and, um, and then sort of how we got the, from there to here and why it's

Speaker:

such a big deal and why you should care.

Speaker:

Right.

Speaker:

And why this perhaps may give you a reason to finally, um, you

Speaker:

know, what, what is it, shuck?

Speaker:

The mortal coil.

Speaker:

What, what, what, what is that?

Speaker:

What,

Speaker:

I'm just

Speaker:

thinking are there company, and maybe as we're talking we can figure this out, like

Speaker:

are there still vendors that exist that don't support forever Incrementals?

Speaker:

That's actually a really good question and I'm gonna say, uh, absolutely

Speaker:

right, because my concept of what a true forever incremental will be based on.

Speaker:

Like a, a complete redesign of the, of the underneath.

Speaker:

Right.

Speaker:

Um, so the phrase I was looking for is shuttle off sh shuffle off the

Speaker:

mortal coil of your old backup product,

Speaker:

by the way, wait, quick story.

Speaker:

So I'm on, I'm on a plane one time and this, this, um.

Speaker:

This pilot, he was talking about the fact that we were about to take off and

Speaker:

he referred to taking off as shuffling off the mortal coil of this earth.

Speaker:

And I'm like, dude, that's a phrase for dying.

Speaker:

What?

Speaker:

Why the hell are you, why would you do that?

Speaker:

Right.

Speaker:

, it's a Shakespeare, it's from the to be or not to be soliloquy

Speaker:

I,

Speaker:

Hamlet.

Speaker:

in all fairness though, I would say 97% of people probably had

Speaker:

no idea what that phrase meant.

Speaker:

They, they never even heard.

Speaker:

Yeah.

Speaker:

Yeah.

Speaker:

And, and, and, and, and just to, just to, uh, like, you know, since I'm over

Speaker:

here quoting Shakespeare, let me just put this, I'm not a huge Shakespeare person.

Speaker:

In fact, it, it wasn't until, I dunno, several years ago, and we were in England

Speaker:

where I, I, I got to actually see my first, um, uh, Shakespeare play, which

Speaker:

was Henry V. Some people say Henry the VI don't, I think it's actually

Speaker:

called Henry five, which is the, um.

Speaker:

The one with the St. Crispin stay speech.

Speaker:

Uh, once more into the breach, close the wall up with, with

Speaker:

her English dead and all that.

Speaker:

But anyway, sorry, I just bored everybody.

Speaker:

Sorry.

Speaker:

Sorry, you're asleep.

Speaker:

Wake back up.

Speaker:

Wake back up.

Speaker:

We're, we're gonna talk about backup, which is so much more

Speaker:

exciting than Shakespeare.

Speaker:

Um, so, all right, so first let's talk about, do you want to define, uh, so what?

Speaker:

let's talk about the old days first.

Speaker:

that's what I, that's what I'm saying.

Speaker:

That's what I'm saying.

Speaker:

Let's, do you wanna define, uh, what a full and, and

Speaker:

Yeah.

Speaker:

an incremental backup is?

Speaker:

Yeah.

Speaker:

So.

Speaker:

Normally when you think about backups, right, you talk,

Speaker:

typically people talk about fulls.

Speaker:

Incrementals, right?

Speaker:

A full is basically your entire application, dataset, whatever

Speaker:

it is, everything is backed up.

Speaker:

Mm-hmm.

Speaker:

then between doing every fulls, you would do incrementals, which are just the things

Speaker:

that have changed since that last full.

Speaker:

Right, right.

Speaker:

you do these periodically, and normally you do incrementals

Speaker:

because they will be smaller.

Speaker:

So you could say do a full once a week and then incrementals during the week

Speaker:

such that you can finish in time or during your backup window, and then the next

Speaker:

weekend you would do like another full,

Speaker:

Right, right now, slightly, I'll slightly change your

Speaker:

definition or a pen or whatever.

Speaker:

Um, so you specifically described what I would call a

Speaker:

cumulative incremental, right?

Speaker:

So a, uh, what is referred to as a sort of a regular, incremental

Speaker:

or a differential incremental.

Speaker:

Yep,

Speaker:

it is one where it's the changes just since the last incremental or since

Speaker:

the last backup of any kind, right?

Speaker:

And then you

Speaker:

Um,

Speaker:

levels and all the

Speaker:

yeah.

Speaker:

Yeah, yeah,

Speaker:

But yeah, at a high level, yeah, there are those three types, if you will.

Speaker:

yeah.

Speaker:

And, and, and then what really, um, what really what it really

Speaker:

came down to is how things behaved when it came time for a restore.

Speaker:

Right.

Speaker:

So if you were going to restore something, then you needed the full,

Speaker:

the latest cumulative, incremental, if you were doing such a thing, and

Speaker:

then any incrementals since that day.

Speaker:

And so, um, those of us, uh.

Speaker:

I, I remember at some point we, we used to do weekly fulls.

Speaker:

I remember my preferred backup design switching to monthly fulls,

Speaker:

weekly cumulative incrementals, and then daily incrementals.

Speaker:

So if you were, let's say three weeks into the month, you would need the full

Speaker:

from the first of the month, the latest, uh, weekly cumulative, incremental,

Speaker:

and then three or four days worth of.

Speaker:

Incrementals

Speaker:

Yeah,

Speaker:

and, uh, in order to do a restore.

Speaker:

. So what it meant was that you would often, the way the Restore worked is you

Speaker:

would restore everything that was in the full, then you would restore everything

Speaker:

that was in the cumulative incremental, and then you would restore everything

Speaker:

that was in each of the incrementals.

Speaker:

What that meant was there were many files that you were actually restoring

Speaker:

multiple times and you were overwriting them, which was just inefficient.

Speaker:

Um, and what that also meant was.

Speaker:

That you had, uh, uh, duplicate data, the old way of doing backups created

Speaker:

a ton of duplicate data on the tapes.

Speaker:

You could say duplicate, or you could also say it's independent,

Speaker:

depending on the glass.

Speaker:

Half full glass, half empty.

Speaker:

Well, so no, I, I understand what you're saying.

Speaker:

What I'm talking about is the fulls,

Speaker:

Yeah,

Speaker:

the fulls over time created a ton of duplicate data.

Speaker:

that's true.

Speaker:

Right.

Speaker:

I know what you're thinking.

Speaker:

You're thinking, you're talking about the, within the, within the, the incrementals.

Speaker:

I'm mainly talking about

Speaker:

yeah,

Speaker:

how many different copies of the exact same file were all

Speaker:

over, you know, everywhere.

Speaker:

and especially because most times, right, you look at sort of like a

Speaker:

2% daily change rate, which maybe

Speaker:

Yeah.

Speaker:

a seven or 8% change rate per week.

Speaker:

And

Speaker:

Yeah.

Speaker:

those weekly cumulatives or even the monthly fulls, right, you're

Speaker:

basically backing up a bunch of data, which you don't necessarily

Speaker:

need that haven't changed.

Speaker:

Right, right.

Speaker:

You, you, the reason we did it the way we did, it wasn't so much about independence.

Speaker:

It was about.

Speaker:

Minimize.

Speaker:

So if you did a monthly full, followed by nothing but regular

Speaker:

incrementals, you would have to load 30 tapes at the end of the day, right?

Speaker:

At the end of, and even if it wasn't tapes, you would have

Speaker:

to, you have to do 30 restores.

Speaker:

Yeah,

Speaker:

So, so the, the, the, the differential or the cumulative incremental, um,

Speaker:

was about minimizing the level of effort during a restore, right?

Speaker:

And then came along this thing called dedupe.

Speaker:

Uh, which at least helped minimize, I think the, all of that duplicate data.

Speaker:

because I think with deduplication it was nice because you didn't

Speaker:

have to change what you were doing from like a policy perspective

Speaker:

Right,

Speaker:

perspective.

Speaker:

But you got, like you said, it was able to.

Speaker:

Dedupe, of course,

Speaker:

right.

Speaker:

in the backend and save you the storage costs.

Speaker:

And that's why when, if you, if you were doing a, a typical weekly, by

Speaker:

the way, nobody did the, the monthly full thing that I recommended, most

Speaker:

people did weekly fulls, right?

Speaker:

And so when you had weekly fulls, um.

Speaker:

This is why, and you, and you had like 90 day retention.

Speaker:

This is why just with even crappy dedupe, you got 20 to one,

Speaker:

Yeah.

Speaker:

uh, data reduction, right?

Speaker:

And this is what helped, um, pay for, you know, the what to help

Speaker:

created the, the market for all that, all that dedupe work, right?

Speaker:

And then we could argue over, um, you know, well we're, we're

Speaker:

20 to one, we're 30 to one, we're, you know, whatever, right?

Speaker:

And, um, we, we saved this much more money.

Speaker:

Um.

Speaker:

You know, once you get rid of all those fulls, everything else

Speaker:

was, was really kind of incre

Speaker:

Yeah.

Speaker:

incremental.

Speaker:

Um, but the real, I think the real challenge was just the core backup design

Speaker:

of when we go to do a restore, we have to restore, uh, data multiple times.

Speaker:

Right.

Speaker:

It just simply wasn't efficient.

Speaker:

Well, and I think even before we get there, I'd like to cover

Speaker:

Mm-hmm.

Speaker:

uh,

Speaker:

Okay.

Speaker:

if you had the existing backup schedule, right, where I'm still doing

Speaker:

weekly fulls and daily incrementals,

Speaker:

Mm-hmm.

Speaker:

still transferring all that data over the network.

Speaker:

Right?

Speaker:

And I think this is where client side deduplication would probably

Speaker:

come next, in my opinion, right?

Speaker:

Yep.

Speaker:

it would help you reduce the amount of data you're actually

Speaker:

sending over the network.

Speaker:

Right.

Speaker:

And that'll help save you cost as well as time.

Speaker:

Yeah, that's actually a really good point.

Speaker:

Thanks for bringing that up.

Speaker:

So the, one of the other like, like basically what you're saying that one

Speaker:

of the other, um, downsides to doing it the old way was both the transfer

Speaker:

across the network as well as the impact, uh, to the clients, right?

Speaker:

If you're doing a full, on a regular basis, um, you're, um.

Speaker:

There's a, there's a cost to that

Speaker:

Yep.

Speaker:

in both, you know, you brought up bandwidth, it's

Speaker:

bandwidth storage and compute.

Speaker:

Right?

Speaker:

Um, it's like all the things.

Speaker:

There was also a, um, sort of a, like a perfect storm brewing.

Speaker:

Yeah.

Speaker:

So we're doing fulls on a regular basis.

Speaker:

We're doing cumulative incrementals on a regular basis, and this is

Speaker:

how we were doing things, and then along came virtualization.

Speaker:

Why was that a problem?

Speaker:

Because a lot of the data was

Speaker:

I,

Speaker:

the same, right?

Speaker:

When you look across all of your VMs, they're probably spun up

Speaker:

from the same golden copy, of your

Speaker:

Well, the,

Speaker:

and

Speaker:

yeah, well, the.

Speaker:

amongst them.

Speaker:

Well, that, that's a, this is another example of, that's a great answer.

Speaker:

It's not the question I was asking.

Speaker:

It's

Speaker:

Um,

Speaker:

resource on the backend because

Speaker:

that's what I was going after.

Speaker:

Yeah,

Speaker:

and the compute, right?

Speaker:

You don't have your

Speaker:

yeah,

Speaker:

instance

Speaker:

yeah.

Speaker:

That's, that's what I was going after.

Speaker:

Right?

Speaker:

So, so that was a real problem when VMware got really popular and we just

Speaker:

kept doing the same old thing because.

Speaker:

Remember, for a long time we backed up VMware.

Speaker:

We backed up the physical, the virtual machines, just like

Speaker:

they were physical machines.

Speaker:

Yes.

Speaker:

Guest ba. We just basically installed an A, an agent and every guest,

Speaker:

and that's how we did the backups.

Speaker:

And, you know, and what happened was you'd do just randomly, you'd end up

Speaker:

doing a bunch of, at the same time.

Speaker:

Which worked fine when we were 20 individual machines, but now we're 20

Speaker:

VMs all on the same physical machine, and that just broke, like virtualization

Speaker:

just broke back up overnight.

Speaker:

Right.

Speaker:

Um, the latest thing to break back up overnight is, um, containers.

Speaker:

Yep.

Speaker:

That's a whole other thing.

Speaker:

We're not talking about that right now, but, um, the

Speaker:

so

Speaker:

ahead.

Speaker:

though, this is what sort of VMware, Plus the backup vendors to figure out

Speaker:

is there something better we can do

Speaker:

Yeah.

Speaker:

order to help satisfy backup needs?

Speaker:

Because it's not going away, right?

Speaker:

You have to

Speaker:

Right,

Speaker:

VMs.

Speaker:

right, right.

Speaker:

So the the real key I think, I think the real goal, I. Like, like if we could

Speaker:

go back in time and I, I don't know.

Speaker:

I don't know.

Speaker:

This is one of the things where like, I don't know if we, I, I wasn't in the

Speaker:

rooms right when they were designing this stuff, but I'd like to think that

Speaker:

somebody said, you know, if, because my memory is, we had issues with tape.

Speaker:

The, the, there was the fundamental mismatch of speed tape.

Speaker:

You know, people thought it was slow, it was actually fast.

Speaker:

That was a problem.

Speaker:

And we need, and so we started doing disc staging.

Speaker:

We started doing disc backups because of that, right?

Speaker:

And then DDU came along and DDU said, Hey, we can make disc better.

Speaker:

We can make disc, you know, more affordable.

Speaker:

And, and so, so that was, and then it just sort of happened.

Speaker:

But even if.

Speaker:

We if, if that wasn't the case, I think the design of backups ultimately ending

Speaker:

up being, uh, what we're talking about today, which is forever incremental,

Speaker:

it, it accomplishes multiple purposes.

Speaker:

One is it, it, it solves the issues that we have with tape.

Speaker:

Because in order for forever incremental work, you've gotta be using disc, right?

Speaker:

Uh, for, for true forever, incremental work to work, you've gotta be using disc.

Speaker:

Um, and then the other thing is that it, it solves the, the cost issues.

Speaker:

It solves the performance issues, restore, you know, it does a lot of things right.

Speaker:

Um,

Speaker:

you first define what forever incremental is?

Speaker:

'cause I don't

Speaker:

I don't want to,

Speaker:

that.

Speaker:

Before, before we get to forever incremental, we gotta talk about

Speaker:

something that was, that was a stop gap technology bef to get from we'd,

Speaker:

we wanted to get to a point where we weren't doing fullsS anymore.

Speaker:

Everybody agreed that full stunk, except especially when we're

Speaker:

talking about virtualization, right?

Speaker:

What was the stop gap technology that backup vendors came out with before

Speaker:

we went to the whole For incremental?

Speaker:

Uh, they created what they called synthetic fulls,

Speaker:

Mm-hmm.

Speaker:

which is basically you are.

Speaker:

Creating a full on the backend, but you're sending data and stitching it together and

Speaker:

piecing it together to make it look like what the full would look like, because you

Speaker:

Yeah.

Speaker:

backups were on that

Speaker:

Right.

Speaker:

So it's like we already know, we, we already know what the files are in

Speaker:

the full, because, you know, we did it on we, we did another incremental.

Speaker:

So we know what's deleted.

Speaker:

We know what's new.

Speaker:

We know what a full would look like.

Speaker:

If we went and made one, why don't we just make one over here

Speaker:

without having to do it over there.

Speaker:

Um, and there were a couple of different ways to do that.

Speaker:

Uh, there was the backup vendor way and there was the, um, the, well, the

Speaker:

backup software vendor way, and then there was the backup hardware vendor way.

Speaker:

Yeah.

Speaker:

I'll talk about the backup software vendor way first.

Speaker:

It's like, we know what all the files should be on a full,

Speaker:

we know where all of them are.

Speaker:

We're just gonna copy them all together.

Speaker:

You know, in one image, and then we're gonna copy that over to a new

Speaker:

set of tapes or a new set of disc

Speaker:

Mm.

Speaker:

so that we essentially create an actual full, um, without actually having to

Speaker:

transfer any data around the network.

Speaker:

It might be transferring it, uh, well it is transferring

Speaker:

it within the backup system.

Speaker:

Right.

Speaker:

And then there was the, the, and, and that took a certain amount of time.

Speaker:

Right.

Speaker:

Um, and then.

Speaker:

A, a vendor that you might be familiar with came out with

Speaker:

a different way to do it.

Speaker:

What was that?

Speaker:

And basically, um, they basically said, just tell us how you wanna

Speaker:

stitch everything together,

Speaker:

Yeah.

Speaker:

it all together.

Speaker:

Yeah.

Speaker:

So it was like, it was like an API, right.

Speaker:

You know where all the stuff is.

Speaker:

We have all the stuff.

Speaker:

Just tell us which stuff you want, where, and we'll go.

Speaker:

Yep.

Speaker:

And so they could make essentially a new full.

Speaker:

When in reality it's just a bunch of pointers to what was already there.

Speaker:

Right.

Speaker:

this

Speaker:

And it was really quick.

Speaker:

Right,

Speaker:

it was really quick because the key is, especially on a de-duplicated system,

Speaker:

you're not actually copying the data.

Speaker:

It's all pointer manipulation.

Speaker:

right, right.

Speaker:

moving any data, you're just changing metadata, which is very quick, very fast.

Speaker:

That was a great singing group from the nineties,

Speaker:

What

Speaker:

I

Speaker:

metadata.

Speaker:

manipulation.

Speaker:

But I'm, anyway, so yeah, so that was sort of a stop gap solution.

Speaker:

It allowed people to stop doing real fulls, solve the issue of the, the,

Speaker:

the load on the client, solve the issue of the load on the network.

Speaker:

But we still, we were still creating fulls.

Speaker:

So the concept of forever incremental.

Speaker:

And, and true forever incremental is that we first have to design a backup product

Speaker:

that just no longer needs repeated fulls.

Speaker:

Right?

Speaker:

And, and by the way, this generally does only apply to, um.

Speaker:

Uh, file system backups for the most part.

Speaker:

Um, not database and application backups.

Speaker:

Most of them still require repeated fullss.

Speaker:

Um, you know, we'll, we'll, some of, some of them have forever incremental, but.

Speaker:

I would also say, or would you also say for virtualized

Speaker:

workloads, this also applies 'cause

Speaker:

Um, oh, you mean, uh, the, the backup of the di of the system itself?

Speaker:

Yes.

Speaker:

Yes.

Speaker:

So, well, so I'll, so if I, if, if I was to use a term that would apply to,

Speaker:

to both of them, I'd say volume-based.

Speaker:

Yeah.

Speaker:

Backups.

Speaker:

Right?

Speaker:

So either a file system or the, or a,

Speaker:

Disc.

Speaker:

file or a VDK file, right?

Speaker:

Um, and the idea is that we're just gonna do one full, and then

Speaker:

we're generally going to be doing block level incremental backups.

Speaker:

Um, not just incremental, but block level incremental.

Speaker:

What's the difference between those?

Speaker:

Well, a block level incremental allows you to look at changes

Speaker:

within, a specific object.

Speaker:

As an example, say you had, uh, 10 gigabyte PST.

Speaker:

File representing your email and instead of doing a normal incremental, which

Speaker:

would transfer the entire 10 gig file over when you back it up using traditional

Speaker:

incrementals, block level incremental would just take the changes in within that

Speaker:

10 gig PST file and transfer that over.

Speaker:

Exactly, and so it, it, it's another data reduction technology like deduplication.

Speaker:

And it, it, it just, it, I I think it's a, it was an order of

Speaker:

magnitude reduction in the number of bytes that has to be transferred.

Speaker:

And, and the way this worked in, in, uh, file system backups was that

Speaker:

you had to have like an API into the actual blocks, the, the discs, right.

Speaker:

Uh, which.

Speaker:

Was somewhat problematic.

Speaker:

The, the, the easier way to get this was in the virtualization

Speaker:

world where you had an API, right, like the change block tracking API

Speaker:

Mm-hmm.

Speaker:

you can just, the backup app can just say to the, the virtualization

Speaker:

vendor, Hey, I'm here to back up again.

Speaker:

Just give me a pointer to all of the blocks that have changed

Speaker:

since the last time I was here.

Speaker:

Yep.

Speaker:

Um, and then we can transfer just those change blocks, but.

Speaker:

But.

Speaker:

importantly, for a true increment, for a true forever incremental, um, to, to, to,

Speaker:

for me to think that it's a truly forever incremental is that you're going to change

Speaker:

how the data is stored on the backend.

Speaker:

So there really isn't a concept of full, right?

Speaker:

Um, the way that you're storing each new backup is that each new backup from a

Speaker:

restore perspective behaves like a full.

Speaker:

Yeah.

Speaker:

Right.

Speaker:

You

Speaker:

Yeah.

Speaker:

want to like put that in?

Speaker:

Go ahead.

Speaker:

so I was just gonna put it in my own words, which

Speaker:

Yeah.

Speaker:

basically you're able to send only the changes and end up with

Speaker:

the full on the backend, such

Speaker:

Right.

Speaker:

each one is a full, they can be independently restored without

Speaker:

any dependencies on others.

Speaker:

Yeah.

Speaker:

So you don't have this idea of restoring the full and restoring

Speaker:

the incrementals individually.

Speaker:

Right.

Speaker:

You just restore the late, you know, it's sort of, it's, it's sort of like take what

Speaker:

we used to do, take the synthetic full.

Speaker:

Every time we do an incremental, we're essentially creating a new synthetic full.

Speaker:

Yep,

Speaker:

How's that?

Speaker:

yep.

Speaker:

Yeah.

Speaker:

Yeah, because if you think about it, it's just kind of changing

Speaker:

where the processing happened,

Speaker:

Right.

Speaker:

Because with Synthetic fulls, the client was telling the storage, okay, these

Speaker:

are the pieces I want you to stitch together, and it might have been pieces

Speaker:

across multiple different backups,

Speaker:

Mm-hmm.

Speaker:

right?

Speaker:

In the case of an incremental forever, it's like, Hey, you know

Speaker:

your last backup go apply these changes on top of that last backup

Speaker:

Yeah.

Speaker:

Yeah.

Speaker:

offsets.

Speaker:

Right.

Speaker:

Um, and go ahead.

Speaker:

it needs to be a system that allows you to keep those copies as independent,

Speaker:

What do you, what do you mean by that?

Speaker:

um, uh, backup technologies which say, which claim to be incremental forever.

Speaker:

Mm-hmm.

Speaker:

They only keep the latest copy,

Speaker:

Right.

Speaker:

for

Speaker:

I.

Speaker:

you have a full, you apply the changes, you now get a new full.

Speaker:

That old full is gone from the system.

Speaker:

Hmm.

Speaker:

Yeah.

Speaker:

have the most recent copy, right?

Speaker:

And normally for backup, because you want to be able to go back further in

Speaker:

time, you need to make sure you are preserving those previous copies as well.

Speaker:

And so it

Speaker:

Yeah.

Speaker:

require a technology like deduplication, which allows you to efficiently store

Speaker:

multiple copies, snapshots, whatever technology is available on the backend

Speaker:

in order to make that possible.

Speaker:

Yeah, so snapshots and replication.

Speaker:

A great example of forever incremental technology, right?

Speaker:

Each new, um.

Speaker:

Each time you do a snapshot and a replication, you get a new, something

Speaker:

that looks like a full, uh, when really all you only transferred was the, the,

Speaker:

you know, the bites that have changed.

Speaker:

And then, um, I can, I can immediately think of when I was,

Speaker:

when you were talking, I was like, what is he talking about?

Speaker:

And then I realized what you were talking about.

Speaker:

So here's an example of forever incremental that isn't what we're

Speaker:

talking about, and that's replication.

Speaker:

It

Speaker:

Yep.

Speaker:

Right.

Speaker:

Re just regular replication without anything

Speaker:

Yep,

Speaker:

beside it.

Speaker:

You end up with each new, every time you do a a, an if, if you want to call

Speaker:

it that, in an incremental backup, all you're left with is the latest

Speaker:

version and you can't go back in time.

Speaker:

You know, you know what I was actually thinking of when I made those comments?

Speaker:

Oh, what were you,

Speaker:

Oracle incremental merge

Speaker:

oh, okay.

Speaker:

because Oracle incremental merge applies to changes on top of a full, but it

Speaker:

does not preserve the previous versions.

Speaker:

Right.

Speaker:

Yeah, that's a good point.

Speaker:

Um, which, which wouldn't it, that's really just a, a type

Speaker:

of, synthetic full, right?

Speaker:

Yeah, yeah, I didn't even realize that, by the way.

Speaker:

Um, so how do you preserve the history?

Speaker:

That's why

Speaker:

Do you just.

Speaker:

do you need a application that, or a storage system that

Speaker:

allows you to take snapshots or

Speaker:

okay.

Speaker:

copy of the file before it

Speaker:

gotcha.

Speaker:

Why do

Speaker:

where was it?

Speaker:

Curtis?

Speaker:

Why do I still remember these things?

Speaker:

It's why you're here?

Speaker:

So we've defined what a

Speaker:

Mm-hmm.

Speaker:

a little bit about the benefits,

Speaker:

Yeah.

Speaker:

Being able to quickly restore because you have the full copy, uh, being independent,

Speaker:

um, not having to transfer all the data.

Speaker:

Right?

Speaker:

Um, can

Speaker:

And also also cost savings.

Speaker:

Right, right.

Speaker:

Yeah.

Speaker:

you think of any cons?

Speaker:

Well, the, the only thing you know, one of the things I said

Speaker:

in the beginning was that.

Speaker:

In order to do forever, incremental, successfully, you've

Speaker:

got to store the data on disc.

Speaker:

And, and again, not, not to be, you know, pro tape and all that stuff,

Speaker:

but we did an episode relatively recently where we talked about how you

Speaker:

have to, uh, protect the backup disc.

Speaker:

The only con that I can think of is that your entire backup is just sitting

Speaker:

there ready to be attacked, right.

Speaker:

Yeah,

Speaker:

Um.

Speaker:

I have, I have a

Speaker:

Go ahead.

Speaker:

You have, you have a con, okay?

Speaker:

So having worked with vendors, and I'm not saying this has happened, but

Speaker:

it is a potential thing you should think about is when you have forever

Speaker:

incrementals, You're always re.

Speaker:

Copying over the existing backup

Speaker:

Mm-hmm.

Speaker:

bug that somehow makes, that corrupts a part of your backup, right?

Speaker:

You've now impacted how many versions back because you're never getting a

Speaker:

full copy of that data again, right?

Speaker:

It's always

Speaker:

Right,

Speaker:

And so

Speaker:

Yeah.

Speaker:

be one of the cons a

Speaker:

Yeah.

Speaker:

approach.

Speaker:

That was, that was a, um, a critique of ddu, uh, because that

Speaker:

has the same issue, a critique of dedupe when Dedupe first came out.

Speaker:

And all you can say is that you just do your best to ensure

Speaker:

that that's not happening.

Speaker:

And that's why we do fingerprinting and, and, you know, integrity

Speaker:

checks and all that kind of stuff.

Speaker:

your backup.

Speaker:

Yes.

Speaker:

And also test your backups.

Speaker:

Right.

Speaker:

Test your backup, test your backup, test your backups.

Speaker:

Um, but go ahead.

Speaker:

I do have another one too.

Speaker:

If you, unless you

Speaker:

Oh, you do?

Speaker:

Okay.

Speaker:

Go ahead.

Speaker:

The other one I also wanted to mention is, right if Forever incremental backups

Speaker:

are so great, why isn't everyone using it?

Speaker:

Mm-hmm.

Speaker:

one of the challenges is it requires vendor support.

Speaker:

Yeah.

Speaker:

application to support it.

Speaker:

So VMware is supported for virtualization.

Speaker:

companies support it for their own applications, but

Speaker:

not everyone supports it.

Speaker:

Not everyone makes it available.

Speaker:

Yeah.

Speaker:

so you sort of are limited in terms of what applications, what

Speaker:

APIs are available, and can you get the changes that you need in

Speaker:

order to be able to create your forever incremental in the backend.

Speaker:

Exactly, and, and know, I can think of modern database applications

Speaker:

that do support it, right?

Speaker:

Um, I can think, for example, uh, when you back up Salesforce, right?

Speaker:

You're backing up at the object level and it's very easy to.

Speaker:

Uh, and, and let me rephrase that, because Salesforce uses

Speaker:

the term object to mean something very different than what we mean.

Speaker:

Um, because why should we all use the same terms?

Speaker:

of course.

Speaker:

A Salesforce object is like the user's table, right?

Speaker:

Um, so I'm talking about like, I made an update to the, to a, you know, a

Speaker:

field in a record in a table, and, um.

Speaker:

Uh, you can back up that change as right, and, and, and you can do it incrementally

Speaker:

or you can do it independently.

Speaker:

And so it would be very easy to store all of those, those

Speaker:

objects in a, in a way that.

Speaker:

It, you know, would, would foster this idea of a forever incremental.

Speaker:

Um, whereas like Oracle for example, there really isn't a way to do it.

Speaker:

The, at best what we can get out of Oracle is the concept of a, um, synthetic full,

Speaker:

uh, and the synthetic full that they have.

Speaker:

Has the issue that you, that you mentioned before, whereas once you

Speaker:

do the synthetic full, you don't have the previous incrementals.

Speaker:

Um, so yeah, it, that is, that is definitely a, um, it, it

Speaker:

requires your backup vendors to support it, and it requires your

Speaker:

application vendors to support it.

Speaker:

Yeah.

Speaker:

Absolutely.

Speaker:

I can't remember where I was going before you brought up that other, um, I, I,

Speaker:

I will say this, that, you know, the title of this is, you know, how Forever

Speaker:

Incremental Backup changed the world.

Speaker:

It, it, it really has, right?

Speaker:

The idea, when you look at these modern backup products, the

Speaker:

idea of doing a, of a, of a full backup is just, it's just gone.

Speaker:

Right?

Speaker:

Um, all of the downsides to doing a regular full backup.

Speaker:

Are, are gone.

Speaker:

Uh, the, the, the, the, the compute considerations, the

Speaker:

network considerations, the storage considerations, they're all gone without

Speaker:

any negative like impact to the restore.

Speaker:

In fact, it's the opposite.

Speaker:

The restore is now, since we know what a full looks like when we go to do the

Speaker:

restore, that is to going back to Harken, back to the original way we did restores.

Speaker:

We're no longer restoring the same file multiple times during a single restore.

Speaker:

Why would we do that?

Speaker:

We know where all the files are, we know where all the blocks are, whatever

Speaker:

it is that we're, we're restoring.

Speaker:

We know what the latest version of that block here file is, and

Speaker:

we're just gonna restore that one.

Speaker:

I have a story.

Speaker:

Um, how you do forever, incremental Matters.

Speaker:

The original, the OG, forever incremental product is TSM.

Speaker:

mm-hmm.

Speaker:

And when, and the way it was implemented back then was to tape.

Speaker:

And they, tried to bring this concept of forever incremental to the world without

Speaker:

changing the underlying storage technology because back then it was, it was, uh.

Speaker:

tape.

Speaker:

It was, uh, if you, if you brought disc into it, it was, it was crazy talk, right?

Speaker:

So the, so the, so, so they can get a lot of credit for trying to

Speaker:

accomplish this, this, um, this idea.

Speaker:

But I, I still think when we look back to TSM of old, I, I think it was, I

Speaker:

think it was a failed attempt because my earlier statement of the only proper way

Speaker:

to do forever incremental is to do it on disc when they were doing it on tape.

Speaker:

What it meant was that when you, um, when you went to go to do a restore, the, the,

Speaker:

the full was on like 5,700 tapes, right?

Speaker:

sounds worse than just doing your traditional fulls and incrementals.

Speaker:

Yeah, well, it was, and I remember, I, I remember doing a, there was a one

Speaker:

particular restore that I remember doing using TSM in a production environment.

Speaker:

And I, I don't remember the actual numbers, but I'll, I'll just use like,

Speaker:

uh, I'll just use fake numbers just to, to, to to, to describe what it was like.

Speaker:

It was like we were doing like a 10 gigabyte restore and it took like.

Speaker:

Two weeks because of how many hundreds of tapes that it was.

Speaker:

Thousands of tapes.

Speaker:

Um, because of the number of files, the number of tapes that they were

Speaker:

on, and how long it takes to load and unload an individual tape.

Speaker:

Right?

Speaker:

Um, and they had technologies, uh, called reclamation and they, they,

Speaker:

they had a number of technologies that tried to mini minimize this issue.

Speaker:

Um, but

Speaker:

It still existed.

Speaker:

well, yeah, it still is existed.

Speaker:

And also some people, they would run out of time in the day to do reclamation.

Speaker:

Reclamation is where you take a bunch of tapes that, um, like they were, that

Speaker:

most of the files on those, on those tapes were no longer needed because

Speaker:

they, they'd expired and then you take, yeah, I've got a bunch of files that are

Speaker:

10% and then you copy them to one tape

Speaker:

It's

Speaker:

to that's now a hundred percent.

Speaker:

It's like garbage collection, but on tape.

Speaker:

And it was a very, um, labor intensive process and, um, IO intensive process.

Speaker:

And as a result, it took a long time.

Speaker:

And so some people would, would turn it off.

Speaker:

Um, and, uh, I think that's what happened in, in, in this case.

Speaker:

And so there, there were literally just, I don't rem there were so many

Speaker:

tapes that we had to restore and it, you know, it sounds like I'm bagging

Speaker:

on T-S-M-T-S-M is still around.

Speaker:

It's now called, uh, spectrum Protect.

Speaker:

Yeah, but the addition of bringing in disc, I think makes this, this whole

Speaker:

issue kind of, uh, not a big deal,

Speaker:

Yeah.

Speaker:

but, um, but yeah, I still think that you look at all of the modern backup

Speaker:

products that have come out in the last 10 plus years, and they're all like this.

Speaker:

They're all one full, followed by a whole bunch of incrementals, and

Speaker:

the world is a better place for it.

Speaker:

And, um, you young whippersnappers have never had to deal with

Speaker:

tape

Speaker:

incremental backups.

Speaker:

You just don't know

Speaker:

what

Speaker:

what you uh, yeah.

Speaker:

You don't know the pain that you, that Yeah.

Speaker:

That you had.

Speaker:

You know, it's funny, this morning I was hanging out with some people and I

Speaker:

was talking to 'em, and they're, it's this lady who shares my birthday, but

Speaker:

she's about 15 years older than me,

Speaker:

Hmm.

Speaker:

we started talking about all of the technology that we

Speaker:

remember when it came out.

Speaker:

Right.

Speaker:

And the first, the first.

Speaker:

Piece of technology I remember being introduced in my life was the, the VCR

Speaker:

Hmm.

Speaker:

and it, it came, they, I remember them wheeling one into the classroom

Speaker:

Hmm.

Speaker:

in my school.

Speaker:

And it was gigantic.

Speaker:

Right.

Speaker:

It was like, and it was the kind where you put the tape in and push it down, right?

Speaker:

Yeah.

Speaker:

Um, anyway, this is, this is like that, this is like the

Speaker:

way, the way we did backups.

Speaker:

And by the way, you know.

Speaker:

I'm, I'm glad I, I'm glad I remembered this because you asked a question earlier.

Speaker:

Why is it that not everybody does things in this new way?

Speaker:

Yeah.

Speaker:

Because backup is the stickiest application in the data center.

Speaker:

Yep.

Speaker:

So many people are still using the same backup application

Speaker:

that they were 15 years ago.

Speaker:

Some people change their applications like they change

Speaker:

their underwear, but other people.

Speaker:

They just, it, it's a pain.

Speaker:

It's costly, it's risk, uh, it's fraught with risk and, and, and

Speaker:

difficulty and a giant learning curve.

Speaker:

And during that learning curve, your, your data's at risk.

Speaker:

And so most people just, they'd rather stick with the ugly, old backup product.

Speaker:

it.

Speaker:

Because it works.

Speaker:

Yeah.

Speaker:

And, and, and so it, it, it gave birth to incrementally helpful

Speaker:

technologies like deduplication.

Speaker:

Don't change anything.

Speaker:

Just send your backups over here instead of sending your backups over there.

Speaker:

And we'll get rid of the duplicates and we'll make it sort of look like a,

Speaker:

an incremental forever, but just keep doing it the dumb old way because we

Speaker:

don't want you to have to change your backup product in order to get to,

Speaker:

and, and I guess this is, you know.

Speaker:

If, if you're one of those people that's still using the backup, the

Speaker:

same backup product that you had 15 years ago and you're still using it

Speaker:

the same way you were using it 15 years ago, and you're not using incremental

Speaker:

forever as a, a true technology in your world, maybe time to think about it.

Speaker:

but I, I would probably say.

Speaker:

Newer workloads, people are probably using that, right?

Speaker:

Because once again, environments are a mishmash of technologies, right?

Speaker:

Yeah.

Speaker:

people probably do have those old databases, old applications,

Speaker:

which they are using an old infrastructure backup infrastructure

Speaker:

for, and for their virtualization or other newer workloads, they're

Speaker:

probably using something different.

Speaker:

Yeah.

Speaker:

Uh, remember I did that, I did that survey a few, uh, months ago and

Speaker:

I asked them how many backup apps they had, and 30% had 30, I think

Speaker:

more than 30% had three or more backup apps in their environment.

Speaker:

Yep.

Speaker:

And it's because of what you're talking about.

Speaker:

They have, they start using.

Speaker:

Containers and their current backup app doesn't know what to do.

Speaker:

And so they use that for the new technology and yeah.

Speaker:

Yeah, that's very common.

Speaker:

And I guess what I'm saying is maybe it's time to check out the water of

Speaker:

the, of, of truly incremental backups and, and experience all the beauty.

Speaker:

Yeah.

Speaker:

All right.

Speaker:

Well thanks for the chat again.

Speaker:

Anytime Curtis.

Speaker:

Anytime.

Speaker:

All right.

Speaker:

And we wanna thank our listeners.

Speaker:

Yo, you're why we're here, you know, otherwise, just two guys sitting here

Speaker:

talking over a couple of microphones.

Speaker:

Uh, that is a wrap.