Apache XMLSchema is a general purpose schema model that can be used when a Java object tree representation of an XML schema is required. This short tutorial explains how XMLSchema can be utilized.

Structure and Dependencies

The core XMLSchema classes have no third party dependencies. However it depends on the XMLUnit and JUnit libraries for unit testing. XMLSchema 2.0 requires Java 5 or higher.

The structure of the XMLSchema model is quite straightforward. It has a strict specification bound hierarchy of classes that represents each and every schema component. It is not based on an interface-implementation model which allows extensions and different implementations. However, the schema specification is quite stable and complete, hence a change is unlikely, which makes XMLSchema sufficient for almost all needs of schema handling.

Reading a Schema

The reader for the XMLSchema model is called the SchemaCollection ( It has a read method that returns a XmlSchema object which represents the whole schema. The XmlSchema instance returned can be used to access types and elements of the relevant schema by their qualified name.

The following code fragment shows how a file can be read through the SchemaCollection.

InputStream is = new FileInputStream(fileName);
XmlSchemaCollection schemaCol = new XmlSchemaCollection();
XmlSchema schema = StreamSource(is));

Navigating the Schema Model

Navigation of the model once the XmlSchema model is obtained is also quite straight forward. All top level elements and types are available through the schema object as either instances or can be accessed directly if it can have a QName reference. For example, if the qualified name of an element is known, then getElementByName method can be used to extract the XmlSchemaElement object directly from the schema object. The following code sample shows how such direct methods can be used to extract schema objects

XmlSchemaType schemaType = schema.getTypeByName(TYPE_QNAME);
XmlSchemaElement elem = schema.getElementByName(ELEMENT_QNAME);

Note that the TYPE_QNAME and ELEMENT_QNAME represents QName objects.

Printing the Schema Model

Printing of the model once the XmlSchema model has been modified or constructed in-memory, is also quite straightforward. Schema object has a write method that can use an output stream.

The following code fragment shows how to write the schema into the System output stream.



Apache XMLSchema is quite a versatile piece of code that can be used to manipulate and generate XML Schemas. It has minimum dependencies and can be used inside another project with ease.