I'm going to take a quick first cut at this (great Q BTW!):
Would imposing a structure on the large project (i.e. into smaller
sub-projects) slow the compiler down?
Not by enough that it matters, the overhead is actually in Maven invocations.
Also, I have a slight concern on what impact this might have editing
time in IDEs (we principally use Intellij). Intellij seems to build
each project in turn through the dependency tree - i.e. if C depends
on B depends on A, and I change A, it won't try to build B unless A
compiles, and so on. Arguably that's advantageous, but I have found
that if - for example, I change an interface in A that is widely used
in B and C, it takes some time to fix all the errors from that
change...
Different IDEs have their different strengths with regards to Maven bindings and dependency management. The current state of play seems to be that it mostly just works on the latest Eclipse, Netbeans and IntelliJ - but you will have to teach your developers the emergency double whammy of "Refresh source files from disk and rebuild all related maven projects".
I find I'm having to do that less and less these days though. Having an SSD drive makes a massive difference here BTW.
snip factory classes paragraphs
Dependency management is incredibly important, regardless of what technology (Maven/Ivy/whatever) use use to help you implement it.
I'd start by getting the extensive reporting out of the Maven dependency plugin and take stock of what you've got. Generally speaking you set the dependency in the dependency management of the POM as high up the food chain as possible, but no higher. So if two of your submodules use an external dependency, then haul that into their parent POM and so on and so forth.
Upgrading external JARs should always be done as a proper mini-project. Evaluate why you're upgrading, alter source code to take advantage of any new features/bug fixes etc. Just bumping the version without this analysis and work will get you into trouble.
So, in general, my questions are:
Does anyone have any experience of breaking up large projects? Are there any >tips/tricks that you would be willing to share?
- Interfaces and Dependency injection are your friend.
- Michael Feather's book on dealing effectively with legacy code is a must read.
- Integration tests are your friend
- Splitting the sub projects into foo-api (interfaces only!) and foo-core and having modules only depend on the foo-api helps a great deal and enforces separation
- Jboss Modules and/or OSGi can help enforce clean separation
What impact did this have on your development and build times?
Very minor impact on dev and build times - a massive gain in time for our overall continuous delivery pipeline.
What advice could you offer on structuring such a break-up of such a project?
Do the little things right and the big things tend to fall out cleanly afterwards. So split things off bit by bit - don't do a massive restructure of the whole lot unless you've got a high percentage of coverage with your integration tests.
Write integration tests before the split - you should more or less get the same result(s) after the split.
Draw diagrams of the modularity you have now and where you want to get to. Design intermediate steps.
Don't give up - some Yak shaving now builds the foundation for being able to "rapidly build and refactor without fear" Shameless plug -> from Ben and I's The Well-Grounded Java Developer.
Best of luck!
You have 2 options:
- by git way: use submodules. Here is a documentation how git manage submodules [git submodules][1]. I personally didn't use it but it looks to fit your problem.
- by maven way: in maven it is not mandatory that your root project (configuration) to be hierarchically the parent directory of all your projects. You can have a structure like that:
configuration
+-- pom.xml (configuration:XXX)
project1
+-- pom.xml (project1:1.0-SNAPSHOT)
!
+-- module11
! +-- pom.xml (1.0-SNAPSHOT)
+-- module12
+-- pom.xml (1.0-SNAPSHOT)
project2
+-- pom.xml (project2:2.0-SNAPSHOT)
!
+-- module21
! +-- pom.xml (2.0-SNAPSHOT)
+-- module22
+-- pom.xml (2.0-SNAPSHOT)
configuration
, project1
and project2
are on the same directory level and each one could be a git repository. When build project1
or project2
you run the maven command from project1
or project2
level and maven will try to fetch the parent (configuration) from maven repository not from parent directory. You should pay attention to versions. I would recommend in project1
or project2
to keep a reference to a parent (configuration) with a release version.
To make a release you have to do it in 2 steps: release configuration first and release project second. Project1 and project2 can evolve independently and doesn't have to have the same configuration version as a parent.
Just for special cases when you want to have both configuration and projects as SNAPSHOT versions, in project1 or project2 you can use the <relativePath>
tag inside <parent>
tag to point to your local path of configuration. I don't recommend this because will create problems on development environment (at least for me in Eclipse)
I apologize for my English.
[1]: http://git-scm.com/book/en/Git-Tools-Submodules
Best Answer
You should not be doing this. Parent POMs have
pom
as their packaging, and even if you make it work technically, you break expectations and many Maven plugins will not forgive you.I would refactor the code into the two modules (as you said), and if the issue is resolved someday, you can restructure the two modules into one.