History is Haunting me

So I was building an improvement in SmartGit. Not like it's often done these days, but rather the old-fashioned (a bit of artificial help - creating random test-repositories isn't something I look forward to at all - was very welcome indeed) way, typing the code.

As it so happens, I'm quite new to this codebase, language, standard library and whatnot - you could say I'm a total beginner, which means learning this stuff on the go is the name of the game. No problem per se, but producing some readable, well-fitting code in the process is a bit of a challenge.

It's even more challenging to do so while not producing only a single commit in the end (which would've turned out too big to be reviewable), but a clean commit history where every commit tells the chapter of a story.

Learning on the go, as you likely know, involves a lot of exploration and backtracking. There's areas to get to know, concepts to learn, patterns to detect (in various flavours, as an established codebase like SmartGit wasn't built in a day, a year, or even a decade), new language features to come accustomed to (and other language features you find missing, only to find out the idiomatic way to perform the same in the new language), and so on. It's elevated when you're learning a new programming language, but realistically every sizable project has its own language and feel, given enough development.

I'm a fan of clean commits, and it should come as no surprise that at syntevo we're taking version control seriously, but still: Keeping my branch clean from the get-go was impossible for me. There were formatting changes (left overs from explorations), not using the language correctly, breaking all the coding guidelines left and right (I'm looking at you, final, and also you, var, and don't even get me started on generics), and some commits didn't even build, even though they absolutely did when I committed them, I swear! But hidden in all this trash there was also some useful stuff, good refactorings, and my improvement somewhere buried in between.

Anyway, calling my losses and starting from scratch wasn't very appealing, so I got myself a coffee and stared at my git history for quite a while. Split this, undo that, move this one up, … - some might call this crazy, as the goal was to come up with a branch whose' head was tree-identical (well, not absolutely identical, turns out you do find issues when reading code) to the one I already had. But who is going to read my story when I leave in all the edit marks and redactions? I had to do this, and luckily I knew that git is totally able to support me in such a journey.

Full disclosure: We're using SmartGit while developing SmartGit, but for some tasks a commandline is hard to beat. I got to learn quite a few new things about git, and by far the most useful git command here was

git rebase -i `git merge-base feature/my-awesome-smartgit-improvement develop`

which makes it a breeze to scuttle through the history and get it cleaned up. What does it do, a rebase of the feature branch to the merge-base of the feature branch? What a nonsensical thing to do?! But rest assured, it's not, it's just that git is often naming commands after their implementation, and not after their purpose. Translated, the above command basically means.

reset my feature-branch back to the merge base
from there, let me decide what to do with each commit which came afterwards
and also let me modify the contents, if I so want
until we reach HEAD

Can you do this with SmartGit as well? Well, I found out the hard way, but it turns out you can: Just right-click on the first (i.e. the ‘oldest’) commit you want to modify and select Modify…, and you get essentially the same thing in SmartGit, with a nice UI.

Suddenly a clean commit history got much more approachable, and the history quickly went from an almost uncomprehensible meta-horror-story to a nice novel with a rhythm and an ending. Nice.