I'm going to take a quick first cut at this (great Q BTW!):
Would imposing a structure on the large project (i.e. into smaller
sub-projects) slow the compiler down?
Not by enough that it matters, the overhead is actually in Maven invocations.
Also, I have a slight concern on what impact this might have editing
time in IDEs (we principally use Intellij). Intellij seems to build
each project in turn through the dependency tree - i.e. if C depends
on B depends on A, and I change A, it won't try to build B unless A
compiles, and so on. Arguably that's advantageous, but I have found
that if - for example, I change an interface in A that is widely used
in B and C, it takes some time to fix all the errors from that
change...
Different IDEs have their different strengths with regards to Maven bindings and dependency management. The current state of play seems to be that it mostly just works on the latest Eclipse, Netbeans and IntelliJ - but you will have to teach your developers the emergency double whammy of "Refresh source files from disk and rebuild all related maven projects".
I find I'm having to do that less and less these days though. Having an SSD drive makes a massive difference here BTW.
snip factory classes paragraphs
Dependency management is incredibly important, regardless of what technology (Maven/Ivy/whatever) use use to help you implement it.
I'd start by getting the extensive reporting out of the Maven dependency plugin and take stock of what you've got. Generally speaking you set the dependency in the dependency management of the POM as high up the food chain as possible, but no higher. So if two of your submodules use an external dependency, then haul that into their parent POM and so on and so forth.
Upgrading external JARs should always be done as a proper mini-project. Evaluate why you're upgrading, alter source code to take advantage of any new features/bug fixes etc. Just bumping the version without this analysis and work will get you into trouble.
So, in general, my questions are:
Does anyone have any experience of breaking up large projects? Are there any >tips/tricks that you would be willing to share?
- Interfaces and Dependency injection are your friend.
- Michael Feather's book on dealing effectively with legacy code is a must read.
- Integration tests are your friend
- Splitting the sub projects into foo-api (interfaces only!) and foo-core and having modules only depend on the foo-api helps a great deal and enforces separation
- Jboss Modules and/or OSGi can help enforce clean separation
What impact did this have on your development and build times?
Very minor impact on dev and build times - a massive gain in time for our overall continuous delivery pipeline.
What advice could you offer on structuring such a break-up of such a project?
Do the little things right and the big things tend to fall out cleanly afterwards. So split things off bit by bit - don't do a massive restructure of the whole lot unless you've got a high percentage of coverage with your integration tests.
Write integration tests before the split - you should more or less get the same result(s) after the split.
Draw diagrams of the modularity you have now and where you want to get to. Design intermediate steps.
Don't give up - some Yak shaving now builds the foundation for being able to "rapidly build and refactor without fear" Shameless plug -> from Ben and I's The Well-Grounded Java Developer.
Best of luck!
Best Answer
It depends.
You need to think about how your product will be used by consumers. For example, if you are creating a library that will be used by others inside Maven, then you should not include the external dependencies in the jar - let Maven handle it by providing a proper pom.
However, if instead you are deploying a completed application for non-technical users, then you absolutely do want to keep everything as simple as possible for them, and you should provide the dependencies inside the Jar.
In short, think about who your users are and what their use case would be, and then decide accordingly.