Best practices for layering multiple images?

I want to recreate this animation here:

My problem is not so much how to create the animations, but how to layer them and make them stay where they’re supposed to without it getting so messy. I understand the z indext part. I just don’t know how to go about making them overlap. I’m assuming it has to do with negative margins?

Is there perhaps a video tutorial where someone makes an animation similar to this? One with many images? That’s how I learn best.