Java library for reading Word documents

javams-word

Is there an open-source Java library for reading Word documents (both .docx and the older .doc format)?

Read-only access if sufficient; I do not need to modify the Word documents using Java. However, I would like to have access to images and style information.

EDIT

I've checked out Apache POI, but it doesn't look like it is being actively maintained. See http://poi.apache.org/hwpf/index.html:

At the moment we unfortunately do not have someone taking care for HWPF and fostering its development.

Best Answer

Apache POI HWPF for .doc and XWPF for .docx files