Mini-git, Understanding How Files Are Stored in Git Objects

Front page > Programming > Mini-git, Understanding How Files Are Stored in Git Objects

Mini-git, Understanding How Files Are Stored in Git Objects

Published on 2024-08-24

Browse:664

Mini-git, Understanding How Files Are Stored in Git Objects

Yesterday, I set out to implement one of Git's core functionalities on my own—specifically, how files are stored, what Git objects are, and the processes of hashing and compressing. It took me 4 hours to develop, and in this article, I'll walk you through my thought process and approach.

What Happens When You Commit a File?

When you commit a file in Git, several important steps occur under the hood:

File Compression:

The content of the file is compressed using a zlib algorithm to reduce its size. This compressed content is what gets stored in the Git object database.

Hash Calculation:

A unique SHA-1 hash is generated from the compressed file content. This hash serves as the identifier for the file in the Git object database.

Storing the Object:

The object file is stored in the .mygit/objects directory, organized by the first two characters of the hash. This structure makes it easier to manage and retrieve objects efficiently.
Updating Commit Information:

To demonstrate how files are stored in git.
I have implemented commit functionality, taking one file in to consideration

For every file, I have calculated hash
Inside objects folder, new folder is created with name equal to first two characters of hash.
And a file is created inside that folder with remaining hash as name.(this file stores the compressed format of committed file)
Detected changes by comparing newly calculated hash and last calculated hash of the file

Detecting Changes

I implemented this algorithm based on my own approach, but Git uses more efficient algorithms for these operations.

Extracted array of lines from oldContent and newContent
Created a Map to store line as key and index as value
Created two new arrays to store indexes of common lines in oldContent and newContent 4.eg: OldCommonarray = [0 , 3] then deleted lines will be [1,2]

GitHub Repo
Linkedin

Thanks a lot for you time.

Release Statement This article is reproduced at: https://dev.to/keerthivardhan1/mini-git-understanding-how-files-are-stored-in-git-objects-5bfb?1 If there is any infringement, please contact [email protected] to delete it

Latest tutorial More>

Using WebSockets in Go for Real-Time Communication
Building apps that require real-time updates—like chat applications, live notifications, or collaborative tools—requires a communication method faster...

Programming Published on 2024-11-17
$How Can I Find Users with Today\'s Birthdays Using MySQL?$
How Can I Find Users with Today\'s Birthdays Using MySQL?
How to Identify Users with Today's Birthdays Using MySQLDetermining if today is a user's birthday using MySQL involves finding all rows where ...

Programming Published on 2024-11-17
How to Resolve "Unknown column 'sequence_name' in 'where clause'" Error When Using @GeneratedValue GenerationType.TABLE with a Polymorphic Abstract Superclass in MySQL?
@GeneratedValue Polymorphic Abstract Superclass over MySQLIn a Spring MVC application utilizing Hibernate and MySQL, it has been observed that attempt...

Programming Published on 2024-11-17
How do I combine two associative arrays in PHP while preserving unique IDs and handling duplicate names?
Combining Associative Arrays in PHPIn PHP, combining two associative arrays into a single array is a common task. Consider the following request:Descr...

Programming Published on 2024-11-17
How to Access Nested Struct Fields in HTML Templates in Go?
How to Access Struct Fields of Map Elements in HTML Templates in GoThis article addresses the issue of retrieving struct fields from map elements with...

Programming Published on 2024-11-17
$How to Fix \"ImproperlyConfigured: Error loading MySQLdb module\" in Django on macOS?$
How to Fix \"ImproperlyConfigured: Error loading MySQLdb module\" in Django on macOS?
MySQL Improperly Configured: The Problem with Relative PathsWhen running python manage.py runserver in Django, you may encounter the following error:I...

Programming Published on 2024-11-17
How Can I Dynamically Load JavaScript Files and Handle Their Load Events?
Dynamically Loading JavaScript FilesDynamic JavaScript file loading plays a crucial role in modularizing and optimizing web applications. Mainstream J...

Programming Published on 2024-11-17
What Happened to Column Offsetting in Bootstrap 4 Beta?
Bootstrap 4 Beta: The Removal and Restoration of Column OffsettingBootstrap 4, in its Beta 1 release, introduced significant changes to the way column...

Programming Published on 2024-11-17
Tkinter: Python&#s Secret Weapon for Stunning GUIs
Are your Python scripts feeling a bit... plain? Do you find yourself longing for a way to make your code not just functional, but visually appealing t...

Programming Published on 2024-11-17
Why is rune an alias for int32 in Go instead of uint32?
Why is rune an alias for int32 in Go, and not uint32?Despite its primary purpose of representing character values, the rune type in Go is not defined ...

Programming Published on 2024-11-17
How to Securely Implement a Member-Only Page Login System in PHP?
PHP: Secure Member-Only Pages with a Login SystemChallenges with the Provided CodeThe provided PHP code encounters several issues that hinder its func...

Programming Published on 2024-11-17
How do I use escaped percentage signs in CSS class names to create dynamic layout elements?
What does .container.\31 25\25 mean in CSS?The backslash character () is used to escape special characters in CSS, such as the percentage sign (%)$. T...

Programming Published on 2024-11-17
Beyond `if` Statements: Where Else Can a Type with an Explicit `bool` Conversion Be Used Without Casting?
Contextual Conversion to bool Allowed Without a CastYour class defines an explicit conversion to bool, enabling you to use its instance 't' di...

Programming Published on 2024-11-17
How Can I Efficiently Split C++ Strings Using Tokens?
Efficiently Splitting C strings Using TokensFor splitting a C std::string into substrings based on specified tokens, there are several approaches ...

Programming Published on 2024-11-17
How to Preserve HTML Order When Using `float: right` for Spans?
Reversing Span Order with Float:rightIn the provided HTML, spans with the "button" class are styled with "float: right," causing t...

Programming Published on 2024-11-17