How to make an existing directory within a git repository a git submodule
Solution 1:
Use git submodule absorbgitdirs
This is what the docs state this command does:
If a git directory of a submodule is inside the submodule, move the git directory of the submodule into its superprojects
$GIT_DIR/modules
path and then connect the git directory and its working directory by setting thecore.worktree
and adding a .git file pointing to the git directory embedded in the superprojects git directory.
So instead of starting all over as suggested in the previous answers by @DomQ and myself, one can just add run the following:
- Without removing from the index the submodule, Add the submodule's url to
.gitmodules
and to.git/config
withgit submodule add <url> <path>
- Move the submodule's
$GIT_DIR
directory (.git
in regular repositories) to.git/modules/<path>
withgit submodule absorbgitdirs <path>
Original answer - pre v2.12.0
git submodule absorbgitdirs
was introduced only in v2.12.0-rc0 (see commit).
The Solution is quite simple. It was extracted from here.
-
git rm submodule-dir
This will delete all the files that git was tracking after insubmodule-dir
-
rm -rf submoduledir
This will delete all the other files that might have been left insubmodule-dir
because git ignored them. - Now, we have to commit in order to remove the files from the index:
git commit
After the commit, we cleaned the files that git followed and didn't followed insubmodul-dir
. Now it's time to do: -
git submodule add <remote-path-to-submodule>
This will re-add the submodule but as a true submodule. - At this point it might be a good idea to check
.gitmodules
and see if the submodules have been added successfully. In my case I already had an.gitmodules
file so I had to modify it.
Solution 2:
None of these solutions seemed to work for me so I figured my own:
-
Make sure a new git repo already exist that will hold the content of the new submodule, for example, we'll be using "[email protected]:/newemptyrepo"
-
Navigate to the directory you're modulizing:
cd myproject/submodule-dir
- Remove the to-be submodule from the parent's index:
git rm -r --cached .
- Init a new git repo inside the to-be submodule:
git init
- Set up the origin for the to-be submodule and make your first commit:
git remote add origin [email protected]:/newemptyrepo
git add . && git commit && git push --set-upstream origin master
- Now you must navigate to the parent repo's top-level path:
cd .. && cd `git rev-parse --show-toplevel`
- Finally, add the submodule as you would normally:
git submodule add [email protected]:/newemptyrepo ./myproject/submodule-dir
- Now commit & push the changes the above command makes and you're all set up!
Solution 3:
There is basically no better way than pretending to start over:
- ensure that everything is committed everywhere
- move your sub-repository out of the way
-
git submodule add
from the sub-repository's remote cd mysubmodule
git fetch ../wherever/you/stashed/the/sub-repository/in/step-1
git merge FETCH_HEAD
To explain why this is so, it seems to me than a deeper understanding of what submodules are is needed, than what one can glean from the git-submodule(1)
manual page (or even the relevant chapter from the Git book). I found some more in-depth explanations on this blog post, but since that post is a bit lengthy I am taking the liberty summarize them here.
At a low level, a git submodule consists of the following elements,
- A commit object at the top of the submodule tree,
- (In recent versions of Git) A subdirectory in
.git/modules
to host the Git objects for the submodule, - An entry in the
.gitmodules
configuration file.
The commit object is contained (or more precisely, referenced by SHA1) in the parent tree object. This is unusual, as things usually happen the other way round, but this explains why you see a directory appear in the main repository's git status
after you have performed a commit in the submodule. You can also make some experiments with git ls-tree
to observe this commit object in more detail.
The subdirectory in .git/modules
stands in for a .git
subdirectory in the submodule; and in fact, there is a .git
file in the submodule that points to the former using a gitdir:
line. This is the default behavior since version 1.7.8 of Git. Not sure why everything wouldn't Just Work if you just kept on having a separate .git
directory, except as noted in the release notes you would probably run into trouble when switching between a branch that has the submodule and another that doesn't.
The .gitmodules
file provides the URL that git submodule update --remote
and friends should pull from; which obviously is distinct from the main repository's set of remotes. Note also that .gitmodules
is copied in part into .git/config
by the git submodule sync
command and other commands that invoke it behind the scenes.
While it is fairly easy to do the necessary changes by hand for .gitmodules
+ .git/config
, and also for .git/modules
+ mysubmodule/.git
(and in fact, there is even git submodule absorbgitdirs
for the latter), there isn't really a porcelain to create only the in-tree commit object. Hence the proposed solution by moving + redoing changes presented above.