XML Database Products:

Content (Document) Management Systems

Copyright 2000-2008 by Ronald Bourret

WARNING!

This list has not been updated since roughly 2010. As a result, information may be out of date and products may no longer be available. If you are interested in a product whose link does not work, search for it on the Web, as product pages frequently change and products/companies are frequently sold.

Overview

Content management systems are systems for storing, retrieving, and assembling documents from document fragments (content). They generally include such features as editors, version control, and multi-user access. Although they are usually built on top of a database (some are built on top of the file system), this is generally hidden from the user.

Notes:

Products

Amaxus XML Content Management System
Developer: Box UK
URL: http://www.boxuk.com/server/show/nav.00100a/amaxus-content-management-system.html
License: Commercial
Database type: Relational (ODBC, JDBC, native APIs)
Entry last updated: March, 2004

From the Web site:

"Amaxus is a browser-based content management system based on a relational database (SQL Server, MySQL and Oracle currently supported, on Windows, Linux or Solaris). Binary files are stored securely on the file system, for performance, with all text extracted and indexed in the database."

"All data is exposed as XML, and (server-side) XSL is used for presentation, allowing output to most formats, including (X)HTML, PDF, RTF, RDF, WML etc. All necessary XSLT is provided, and can be edited through the CMS."

"The software (PHP, Java and XML) is highly modular, allowing users to customise existing code and create additional modules as necessary."

"Source code to the core system is provided, together with technical documentation and optional developer training. The system is object-oriented, with full system-architecture documentation provided for further development."

"Features of the product include:

  • Roles based security (including NTLM/LDAP integration)
  • Audit trail and Versioning
  • WfMC compliant Workflow (XML defined business processes)
  • Personalisation engine, allowing secure areas/content and modification of content based on user-defined conditions (e.g. browser/user profile/ip address/search history/viewed content/etc.)
  • Dublin Core based metadata, including user-definable classification systems
  • Integrated image manipulation suite, including automatic thumbnail creation
  • Integrated full-text search engine, based on metadata and full-text, includes keyword, phrase, boolean and wildcard searches.
  • Intuitive, desktop-based interface, designed for non-technical users
  • Choose from a large, ever expanding library of modules (including Forum, Polls, Forms, Quizzes, Calendar, Contacts) which can be added at any stage."

[Ed. -- With regard to storage, some values from XML documents are extracted and stored in non-BLOB columns. However, the majority of XML is stored in BLOB columns.]
Arca Document Management System
Developer: 3Squared
URL: http://www.3squared.co.uk/arca_document_management_system.html
License: Commercial
Database type: Native XML (TEXTML Server)
Entry last updated: January, 2007

Arca Document Management System is a document management system built on top of IXIASOFT TEXTML Server. It can store both XML and non-XML documents; XML documents containing metadata can be associated with non-XML documents. Both full-text and metadata searches can be performed. Search features include wildcards, adjacency, frequency, like, exclusion, and choices. In addition, XML documents or fragments can be reused to build new documents.

Arca Document Management System supports check-in/check-out, versioning, security for individual documents and groups of documents, and output to multiple formats (Word, PDF, etc.). It can be run as a standalone server or integrated into other products.
Astoria
Developer: LightSpeed Software (who bought Chrystal Software)
URL: http://www.lspeed.com/products/ls_p_astoria/frameset.html
License: Commercial
Database type: Object-oriented (ObjectStore)
Entry last updated: December, 2002

Astoria is a content management system built on top of ObjectStore. On input, documents are parsed and stored with one object per element. This allows individual elements (and the fragments they define) to be reused in other documents. (There are three ways to reuse elements/fragments: copying, linking to a specific version of the element, or linking to the latest version.) Other element/fragment-level operations are check-in/check-out, access control, defining locking units, and "where used" information. Astoria can validate documents against DTDs if they are present, but does not require them.

Element-level storage applies to XML and SGML documents. Other documents, such as Word documents or JPEGs, are stored as single objects. Documents can be organized into cabinets (top level) and folders (lower levels).

Search support includes structure and value searches, as well as assembling the results of a search into a new document. Version information is saved as changes from the previous version, and Astoria includes utilities to compare versions at the element or document level.

Other features include C++, Java, and Python APIs, workflow tools, local and Web-based GUI tools for exploring the database, and integration with XMetaL, Epic, and Framemaker+SGML.
Cascade Server
Developer: Hannon Hill Corporation
URL: http://www.hannonhill.com/products/cascade-server/index.html
License: Commercial
Database type: Relational (Oracle, MS SQL Server, MySQL)
Entry last updated: November, 2005

From the company:

"Cascade Server is a content management system whereby all the content in the system is exposed as XML. Individual templates, blocks, and pages are managed as XML and transformed with XSL. The system publishes resulting content out to independent servers by way of FTP, SFTP, or a mapped network drive thereby making the published content decoupled from the CMS."

"Cascade Server has an XML-based workflow engine as well as an XML-based data definition engine for rapidly creation custom data structures that can be outputted using any desired schema."

"Cascade Server also acts as a web-based document management system"
CMS
Developer: Sorman
URL: http://www.sorman.se/products/cms/index.asp
License: Commercial
Database type: Object-oriented (POET)
Entry last updated: November, 2000

From the Web site:

"...provides XML and SGML publishers with component-level content management for streamlined content creation and production. CMS' advanced collaboration, reuse, versioning, and integration capabilities benefit publishers through more productive collaboration, better reuse of effort, and shorter creation cycles. CMS is flexible enough to accommodate multiple DTDs, data types, applications, custom interfaces, legacy systems, and best-of-breed tools-as part of a streamlined XML / SGML publishing solution."

CMS was formerly known as POET Content Management Suite.
Content@XM
Developer: XyEnterprise
URL: http://www.xyenterprise.com/packages.asp
License: Commercial
Database type: Relational (Oracle, Informix)
Entry last updated: December, 2002

Content@XML (formerly Parlance Content Manager) is a content management system built on top of a relational database. (Oracle and Informix are supported.) Documents are parsed into fragments according to user-specified element types, which define the roots of document fragments. The resulting fragments are then stored as BLOBs in the file system, with parent fragments maintaining pointers to child fragments. Fragments can then be reused to build new documents.

Users can also specify which attributes or PCDATA-only element types are to be extracted and used as properties. These properties can then be used in SQL searches. Full-text searches are supported as well.

Content@XML is not limited to XML and SGML documents. Because it stores documents as BLOBs, it can store any type of document, such as Word documents or JPEGs. Other features include versioning, change logs, check-in/check-out facilties, workflow, the ability to integrate relational data, COM, Corba, and Java APIs, GUI tools, and integration with editors such as XMetaL, Epic, and Framemaker.
Documentum
Developer: Documentum, Inc.
URL: http://www.documentum.com/products/launch/documentum_5.htm
License: Commercial
Database type: Relational (Oracle, Sybase, SQL Server, DB2)
Entry last updated: December, 2002

Documentum is a content management system based on a relational database. (Oracle, Sybase, SQL Server, and DB2 are supported.) Documents can be stored either as BLOBs in the database or in a secure area on the file system. The latter is commonly used for documents which are large enough that retrieval speed from the database is an issue. Documents are stored in a user-defined hierarchy, which is viewed through an Explorer-like GUI. Although documents are stored intact at the physical level, users can define two different types of views into them.

The first type of view is a fragment view, in which user-defined element types define the heads of fragments. The fragments then appear in the GUI as if they were separate documents, possibly in user-defined subdirectories. Fragments can be reused in other documents, either as snapshots or through links. (Of interest, referential integrity is enforced for these links. That is, it is not possible to delete a document if there is a link to a fragment in it.) Since fragments are views of documents, changes to the document are reflected in the fragment and vice versa.

The second type of view is a property view. Users define which attributes or (PCDATA-only) elements of a document or a fragment are to be treated as properties. These can then be viewed and updated through a properties dialog box, which also includes Documentum-defined properties such as document or fragment name and update time.

Documentum is not limited to XML documents. Because it stores documents as BLOBs or in the file system, it can store any type of document, such as Word documents or JPEGs. Like XML documents, these documents (or fragments of them) can be used to build other documents.

Documentum supports full-text searches -- including searches on phrases, metadata, and wildcards, as well as word stemming and breaking -- through the Verity search engine. It also supports searches on document properties. These can be defined by Documentum (such as document date), the values of particular elements or attributes in an XML document, or the value of particular content in non-XML documents. (An add-on is required for the latter.) Which properties are indexed can be specified in a configuration file.

Other features of Documentum include an API for building applications, integration with various XML editors (such as Epic), validation using DTDs and XML Schemas, XSLT support, check-in/check-out, versioning, workflow, access control lists, and integration of data from SAP, PeopleSoft, Siebel, and Excel.
Dynabase
Developer: Red Bridge Interactive
URL: http://www.ebt.com/products/dynabase/index.html
License: Commercial
Database type: Object-oriented (ObjectStore)
Entry last updated: December, 2001

From the Web site:

"[Dynabase] combines sophisticated content management capabilities with a powerful, dynamic serving environment. Designed for use by distributed work groups throughout an organization, DynaBase brings control, extensibility, and automation to the e-business team. DynaBase enables authors, developers, production personnel, and managers to collaborate directly on Web site projects over the Internet and corporate intranet. It is designed to handle all types of content and work with a variety of editors, enabling e-business teams to easily select and use the best tools for each task."

Red Bridge Interactive can be found on the Web at http://www.rbii.com.

Dynabase was bought from eBusiness Technologies, formerly known as Inso.
eidonXportal
Developer: eidon
URL: http://www.eidon-products.com/englisch_pages/products.htm
License: Commercial
Database type: Relational
Entry last updated: November, 2000

From the Web site:

"The eidon software will transform your source-data stock to a unified, homogenous format, then link it automatically to other information sources for retrieving references, for assigning the references found to reference targets, and finally for inserting the explicit reference marks into the documents. The documents are administered both in their initial form and that of the intermediate processing forms. The eidonXbase and eidonXlink modules store, administer, and link XML and SGML text and CGM4 graphics as single data records in a relational database, thus enabling extremely fast and precise access."
engenda
Developer: Red Bridge Interactive
URL: http://www.ebt.com/products/engenda/index.html
License: Commercial
Database type: Object-oriented (ObjectStore/Dynabase)
Entry last updated: December, 2001

engenda is a "content management and workflow automation solution" that stores documents in an object-oriented database. Features include:

  • Indexing of structured documents such as XML and HTML down to the tag level. Indexes are version- and edition-aware.
  • Check-in/check-out with definable user privileges and check-in/check-out comments.
  • Version control, including difference history.
  • Arbitrary document metadata (name/value pairs).
  • Editions (release points).
  • Logical merging of separate content repositories.

Red Bridge Interactive can be found on the Web at http://www.rbii.com.

Engenda was bought from eBusiness Technologies, formerly known as Inso.
entrepid
Developer: Red Bridge Interactive
URL: http://www.ebt.com/products/entrepid/index.html
License: Commercial
Database type: Relational (Oracle 8i)
Entry last updated: December, 2001

entrepid is a "content management, workflow automation and delivery solution for ... e-businesses" that is is based on Oracle 8i and stores "XML, HTML, text, binary BLOBs, and associated metadata".

It "includes three browser-based client applications: the entrepid Console application for content management administration, the entrepid Workflow application for workflow, and one for forms-based editing."

"The server-side components of the entrepid system reside within the Application Server environment. The engine ... interacts with the LDAP server and the Oracle repository (via SQL calls through JDBC)."

"In addition, entrepid enables users to write their own custom applications using the public entrepid content management, workflow or delivery [Java] APIs."

Red Bridge Interactive can be found on the Web at http://www.rbii.com.

Entrepid was bought from eBusiness Technologies, formerly known as Inso.
The Environment
Developer: Ecosystems Design
URL: http://www.eco-online.com/products.htm
License: Commercial
Database type: Unknown
Entry last updated: February, 2002

From the Web site:

"The Environment is a state-of-the-art XML content management application. Incorporating Ecosystems LiveOutline Technology, an innovative simple to use graphical interface and a central Meta Data repository for storing and managing XML."
EXPLOit
Developer: Document Solutions
URL: http://www.documentsolutions.it/versioneinglese/products.htm (English)
http://www.documentsolutions.it/versioneitaliana/prodotti.htm (Italian)
License: Commercial
Database type: Relational (SQL Server)
Entry last updated: December, 2005

EXPLOit is a content management systems that stores fragments of XML or SGML documents in NTEXT (variable-length Unicode) fields in SQL Server. With each fragment, it also stores a pre-defined set of metadata, such as author, revision number, creation date, and status. EXPLOit can also store non-XML content, such as graphics files.

Users decide what constitutes a fragment based on how they want to reuse individual parts of a document. They then build complete documents from fragments, where the complete document contains references to the fragments to include. (Fragments can include references to other fragments as well.) Fragments are identified by their name and their language, which allows fragments translated into different languages to have the same name. This simplifies the process of recreating a book in a new language, as only language IDs need to be changed.

EXPLOit uses FrameMaker to assemble the XML and non-XML content in a document and publish it to a variety of formats, such as PDF, Postscript, XML, and Word. EXPLOit supports DTDs, user roles, workflow, check-in/check-out, and a GUI tool for managing fragments and documents.
Frontier
Developer: UserLand Software
URL: http://frontier.userland.com/, http://frontier.userland.com/stories/storyReader$190
License: Commercial
Database type: Object-oriented
Entry last updated: November, 2000

From the Web site:

"... an integrated Internet content system built around a high performance object database, outliner, scripting, and a multi-threaded runtime, built-in web server, editorial groupware and industry-leading syndication-aggregation and distributed computing technology. Link management, macros, rendered pages. ... Frontier can store, manage and generate XML-based information ... [and has] full support for XML-RPC."
GEMt
Developer: X.Systems, Inc.
URL: http://www.xsystems.com/software.htm
License: Commercial
Database type: Native XML (Tamino)
Entry last updated: January, 2007

GEMt is a content management system that uses Tamino as its underlying data store. It stores both native XML and other types of content in Tamino, with a "metadata token" in Tamino. For XML content, this token is inserted directly into the XML document, allowing unified searching of content and metadata. For non-XML content, the metadata token is managed separately in Tamino with a link to its corresponding content. This allows users to use Tamino's query capabilities to search XML content, as well as the metadata for non-XML content.

GEMt provides workflow through the notion of "events". An event is generated when an action, such as adding a file to the system, renaming a file, or checking a file out for editing, meets a specific criteria. For example, if author A checks a file into the system, this might generate an event that causes the file to be placed in the queue for editor B to edit, or it might cause the file to be published to a Web site. Events are logged so that the history of a given document can be tracked. Users can configure GEMt workflows by defining events, constraints on those events, and actions to take in response to those events.

GEMt comes with a built-in set of actions, but also allows users to create their own actions. These are user-written programs and are known as Event Dependent Processes (EDPs). Like built-in actions, EDPs are run in response to certain events. The system passes a variety of information to them, including the document(s) that triggered the event. The EDP can then perform whatever actions it wants, possibly based on the contents of the document or other variables. EDPs can be written in a variety of langauges, including C, Perl, Tcl/TK, and OmniMark.

GEMt ships with the Productivity+PLUS toolset, which includes support for linking, revision management, and versioning. This can be configured to use XLink or DTD/schema-mandated links for linking XML documents and records all links in a link database. The link database allows the system to warn users before they delete link targets, generate link reports (such as determining where given content is used), and validate links. Versioning is done by storing all versions in a single document, where each version is marked by a versioning tag. A controlled language tool may be purchased separately.

GEMt supports the creation of compound documents through the use of external general entities and XIncludes. Each entity is stored separately in the database, which means it can be managed separately from the other entities in the compound document. That is, it can have its own versioning, security, etc.

GEMt also supports a number of output creation and portal management functions based on XQuery. The system can act as a Web server, accepting user requests in the form of XQuery statements, and can produce output directly from the database, apply XSLT stylesheets to the output, or merge the output data with HTML template files.

GEMt supports the Epic and ADEPT editors from Arbor Text and can be extended to work with other editors as well. It also includes an administration tool for configuring the system and managing users and workgroups, including access control.
GroveMinder
Developer: Epremis
URL: http://www.epremis.com/index.html
License: Commercial
Database type: Unknown
Entry last updated: February, 2002

From the Web site:

"...enables a comprehensive information framework that encompasses the entire organization. The framework allows information to flow seamlessly not only within a department, but throughout an organization, even among diverse databases and computer systems. GroveMinder is a document management system. It manages relationships between information objects (any piece of data or meta-data: documents, document components, databases, records, images, sensor output, etc.) With the GroveMinder system, all information objects, data and metadata alike, are equally easy to access."

Note: Epremis bought GroveMinder from TechnoTeacher. It is not clear what the status of GroveMinder is. (February, 2002)
iENGINE
Developer: LightSpeed Software
URL: http://www.lspeed.com/products/iengine/frameset.html
License: Commercial
Database type: Pluggable
Entry last updated: February, 2002

iENGINE is a content management system that stores content as XML documents. It can use Astoria, Interwoven, Broadvision, or Documentum as its underlying data store and provides check-in/check-out, roles-based security and versioning. It supports a "wide spectrum of authoring processes ... including forms-based methods, structured authoring tools and popular application software." Using Astoria, it can assemble new documents from fragments of multiple, existing documents.

iENGINE includes iINTEGRATOR, which can be used to transform data as well as to integrate data from other data sources, including files, application data, and relational databases. Tranformations can be done on both inbound and outbound data. Thus, iENGINE can be used to store and manage content in any format, as long as it is possible to write iINTEGRATOR templates that can convert that format to/from XML.
Ingeniux Content Management System
Developer: Ingeniux Corporation
URL: http://www.ingeniux.com/x313.xml
License: Commercial
Database type: Relational
Entry last updated: January, 2003

From the company:

"The Ingeniux Content Management System (ICMS) is a system for creating and managing enterprise content. Ingeniux CMS is XML-based, and uses XML internally for all content storage and representation. It consists of two parts -- the Ingeniux CMS Publisher and the Ingeniux CMS Server and includes an XML repository for storage and versioning of content. Ingeniux CMS can also access data in any JDBC, ODBC, or ADO database.

Ingeniux CMS Publisher Features

  • Distributed, Web-based application
  • Full WYSIWYG editing of content
  • Preview site on different brands and devices
  • Drag-and-drop asset creation for docs, art and media
  • Automatic on-the-fly navigation
  • Workflow customizable on a page-by-page basis
  • Node-by-node access permissions in content tree
  • Built-in support for industry-standard databases

Ingeniux CMS Server Features

  • Java and Windows versions
  • Client detection for XML/XSLT binding
  • Server-side HTML transformation for non-XML compatible clients
  • Client-side transformation for XML compatible clients
  • Support for syndication and cobranding
  • Fully extensible through a robust API
  • Compatible with .NET
  • Database support: ODBC, JDBC, and ADO
  • Real-time processing of XML from external sources"
InsightBuilder
Developer: Polar Design
URL: http://www.insightbuilder.net/keyfeatures/index.shtml
License: Commercial
Database type: Relational (ODBC)
Entry last updated: June, 2004

InsightBuilder is a content management system that stores XML and non-XML documents (such as images) as "objects" in a relational database, presumably as BLOBs. Output documents are built by applying XSLT templates to these "objects". It appears that output documents can integrate external data as well through the IBObject API, which apparently can be used to abstract external data sources such as relational databases or application data. Documents can be published to a variety of targets, including Web servers and local directories, using FTP, HTTPS, and Web services. Users can schedule publication times.

InsightBuilder supports role- and group-based security (with privileges assignable at a variety of levels, from documents to entire Web sites), version control, and the integration of third-party software. It also includes GUI-based tools for managing and editing documents; the editor appears to be an (X)HTML editor.
Life*CDM
Developer: Corena
URL: http://www.corena.de/english/lifecdm.shtml
License: Commercial
Database type: Relational (Oracle)
Entry last updated: February, 2002

Life*CDM (Compound Document Manager) is a content management system that provides an element-level repository. Unlike most content management systems, the contents of each document are stored according to the DTD used by the document, rather than as a generic document structure. In particular, elements are stored as individual objects with attributes stored as properties of those objects. Data (PCDATA, Word documents, graphics, etc.) are stored as BLOBs. To increase retrieval speed, multiple objects (elements) may be stored in a single binary structure, although each remains individually addressable.

Life*CDM supports versioning (including the ability to print only changed pages), workflow management, HyTime links, and support for binary objects such as graphics. It uses Epic (from Arbortext) as an editor and includes a document composer for PostScript and PDF output. It is programmatically accessible through C, C++, and TCL APIs.
Lucid e-Globalizer
Developer: Lucid'i.t.
URL: http://www.lucid-it.com/en/solution/index.html
License: Commercial
Database type: Native XML (Lucid XML Data Manager)
Entry last updated: November, 2002

From the Web page:

"Lucid'i.t. solution is a software suite enabling you to manage, produce, translate, store and publish mono or multilingual information collection. In more detail, Lucid e-Globalizer [contains] a set of features and functions enabling the management of enterprise data: a workspace manager, a workflow manager, a space for editing and access to localization tools, a dissemination system, [and] a set of connectors. [It also contains] a set of features and functions for administration, management and supervision: a search function, document version management, a data synchronization function, a localizable administration interface, a statistical function and an audit function."

Lucid e-Globalizer is built on the Lucid XML Data Manager, a native XML database.
One-to-One Content
Developer: BroadVision
URL: http://www.broadvision.com/
License: Commercial
Database type: Unknown
Entry last updated: February, 2002

BroadVision One-to-0ne Content (formerly Interleaf BladeRunner) "manages the complete content lifecycle including design, creation, management, deployment/repurposing, and distribution." The repository provides check-in/check-out facilities, can manage multiple versions of the same document, provides role-based access, and tracks metadata. Additional facilities create audit trails and support workflows. Output formats include PDF, RTF, PostScript, and HTML; XSLT and XSL-FO are supported as well. The system has an API for programmatic access, and can be extended through user-written modules.

One-to-One Content also includes Content Creator, which allows users to edit XML documents using Microsoft Word. This requires users to map Word styles to element types in a DTD using a proprietary mapping language.
Rhythmyx Web Content Management
Developer: Percussion Software
URL: http://www.percussion.com/products/content-management/rhythmyx/technical-specifications/generic-3018.html
License: Commercial
Database type: Relational
Entry last updated: November, 2000

Rhythmyx Web Content Management is a content management system. It can deliver documents to the Web, as well as non-Web targets, such as print or external applications. It stores documents in a relational database (Oracle, Microsoft SQL Server, or IBM DB2) and offers the usual set of content management capabilities (check-in / check-out, security, full-text search (from Convera), and so on). Rhythmyx content can be accessed from any WebDAV-compliant editor, and Rhythmyx itself can be accessed as a Web Service or through a JSR 170 API.

Web pages and other documents can be assembled from content extracted from more than 200 file types. Documents can also be built using templates written in Velocity, a proprietary scripting language. Output formats include HTML, XML, and PDF, among others.

Rhythmyx Web Content Management comes with GUI tools for editing, workflow, layout, and integrating external documents and data.
Prowler
Developer: infozone
URL: http://www.infozone-group.org/prowlerDocs/html/proposal.html
License: Open Source
Database type: Object-oriented (ozone), relational (JDBC), etc.
Entry last updated: Summer, 2001

Prowler is a content management framework that provides a transactional XML facade over multiple back-end data sources. Data sources are integrated through adapters, allowing extensibility to any back end. Prowler ships with adapters for ozone, the file system, POP3/IMAP mail boxes, and relational databases (through DB2XML).

For queries, Prowler supports XPath. Each query consists of two parts: a query that selects a set of documents (possibly from different back ends) and a query that selects elements from within those documents. Thus, a single query can be used to retrieve data from multiple documents in multiple back ends. For updates, Prowler uses the XUpdate update language from the XML:DB Initiative.

Transactions are supported through the Java Transaction API (JTA). This supports two-phase commits across multiple data sources. (It is not clear if two-phase commits are actually supported yet.) Prowler ships with a JTA-conformant transaction manager; other managers may be plugged in.

Prowler also contains a user manager and supports linear versioning. Prowler is part of the Infozone framework.
RSuite CMS
Developer: Really Strategies
URL: http://www.rsuitecms.com/
License: Commercial
Database type: Native XML (Mark Logic)
Entry last updated: January, 2007

RSuite CMS is a content management system based on MarkLogic Server, a native XML database. It can store any type of document (XML, Word, PDF, graphics, etc.) and allows XML content to be checked in or out, edited, versioned, reused, or assigned metadata at the node level.

XML documents can be searched with RQL, a proprietary query language that is translated into XQuery for execution on the underlying database. RQL supports XML-aware full-text searches -- that is, full-text searches restricted to certain elements or attributes -- as well as metadata searches. Metadata can be externally assigned values (such as import date) or values inside the document (such as element or attribute values) that have been designated as metadata. Non-XML documents can be searched through externally assigned metadata and (presumably) can be searched using full-text search. In addition to RQL, XQuery is available through a user interface available only to developers and administrators.

RSuite CMS supports both DTDs and XML Schemas. It features check-in/check-out, workflow, versioning, and LDAP-based security, and can be accessed programmatically through Java and .NET APIs. Other features include management tools, a customizeable browser-based UI (including XML editor), a customizeable forms editor (which allows users to edit selected metadata and content via a form), integration with other editors, import and export wizards, and a sandbox for playing with new content.
SCHEMA ST4
Developer: SCHEMA GmbH
URL: http://www.schema.de/sitehtml/site-e/schemast.htm
License: Commercial
Database type: Relational (Oracle, SQL Server)
Entry last updated: November, 2008

SCHEMA ST4 is a content management system designed for single-source publishing. It consists of three main components. The SCHEMA ST4 Application Server implements the basic content management functionality and uses either Oracle 9i or SQL Server 2000 as its underlying database. The SCHEMA ST4 Index Server performs searches using the the Oracle 9i Text Cartridge (on Oracle) or the Microsoft Search Service (on SQL Server 2000). It can perform full-text searches, metadata searches, searches for particular elements and attributes in XML documents, and searches in Word and PDF files stored in the database. Finally, the SCHEMA ST4 Speed Server performs caching, using a subscription service to ensure that caches are up to date. The Speed Server is a stand-alone module and can be located on the client or server. In addition, Speed Servers can act as clients for other Speed Servers, allowing a hierarchy of caches to be constructed.

SCHEMA ST4 can store XML documents and binary files, such as graphics files; it apparently stores documents in CLOB or BLOB columns. Individual documents can be combined into "frameworks". A framework is a hierarchy of nodes and links, where nodes point to a single piece of content, such as an XML document or a graphic stored in the database, and links define the hierarchy by linking individual nodes. For example, suppose you have several different models of the same product and need a separate user's manual for each model. Each manual would have some procedures common to all models and some procedures unique to that particular model. Assuming there are separate XML documents for each procedure, you could define a framework for each manual, with nodes in each framework pointing to the relevant procedure documents.

Nodes and links are instances of node classes and link classes, where each class defines a particular structure. For example, a node class for a user's manual might have a link to an introduction node and one or more links to procedure nodes. A particular user's manual would then be defined as an instance of that class. As in an object-oriented programming language, classes can inherit from other classes. For example, a node class for a user's manual might inherit from a more general node class for a book. In addition, pre-defined and user-defined metadata can be specified for nodes, links, node classes, link classes, and entire frameworks.

Output documents are created by applying ST4 Transformation Templates (STTs) to frameworks. STTs are "derived from XSLT" and transform a particular framework into the specified format. For example, different STTs could be used to transform a user's manual framework into a Word document, an XHTML document, or a help document. A number of output formats are supported, including HTMLHelp, WinHelp, FrameMaker, Word, XML, and HTML. Transformations can be started manually or programmatically; the latter is useful when SCHEMA ST4 is used to serve Web pages.

A single framework can have a number of variations. For example, a framework for a user's manual might have different variations for each language in which the manual is written. That is, each node points to multiple XML documents -- one for each target language. Or graphic nodes might point to multiple graphic files, such as a JPEG file for Web output and an SVG file for XML output.

SCHEMA ST4 comes with a number of different clients for building frameworks and managing the system. Most are user-configurable and can use XML Spy, XMetaL, or Microsoft Word as editors. (In the case of Word, you must map the document template to an XML DTD.) Other tools include a workflow module, language translation filters, and the ability to import or export data via XML or ADO.NET. SCHEMA ST4 also supports programmatic access via a .NET-based API, Web access via SOAP, group-based authentication, streaming of binary data, and document versioning.
SiberSafe
Developer: SiberLogic
URL: http://www.siberlogic.com/products/sibersafe/
License: Commercial
Database type: Relational (JDBC)
Entry last updated: February, 2002

From the company:

"SiberSafe ... is a 100% Java, TCP/IP, HTTP and WebDAV-enabled multithreaded, load-balanced XML repository server that provides ... XML content management functionality in the following areas:

  • User-defined fragmentation of XML documents;
  • Fragment-level storage, locking and retrieval of XML documents;
  • Fragment-level versioning of XML documents;
  • Fragment-level indexing and search of XML documents;
  • XML document dependency tracking and external entity management;
  • Publishing XML documents into various formats, including PDF, RTF, HTML etc;
  • Fully integrated workflow control, including tracking of tasks and assignments;
  • Project-wide fragment-level branching and merging;
  • Multi-language translation automation;
  • Windows NT, WebDAV and HTTP (browser) clients available out-of-the-box;

SiberSafe can also store documents other than XML, such as images and DTDs. SiberSafe is DTD neutral and can work with any user-defined DTD ... [It] can run on any JDBC-compatible database, however is by default configured for Microsoft Access."
Developer: STEP Electronic Publishing Solutions GmbH
URL: http://www.step.de/sigmalink.htm
License: Commercial
Database type: Relational
Entry last updated: November, 2000

From the Web site:

"SigmaLink is an intranet-based document handling and editing system with full support for SGML and XML. Some of the advantages of SigmaLink are:

  • Designed for publishers and their special needs.
  • Handles both [XML] and [non-XML] documents and therefore supports migration to [XML] over the long term.
  • Built of standard components, such as an SQL database system for full-text search, workflow, known SGML editors, etc.
  • Offers both online and offline work modes, making it possible to use the system in, for example, home offices."

[Ed.-Any errors in translation are mine.]
TeamSite
Developer: Interwoven
URL: http://www.interwoven.com/products/
License: Commercial
Database type: Relational, file system
Entry last updated: February, 2002

From the Web site:

"TeamSite ... provides the core content management and collaboration functions for moving business processes to the Web. As a central component of the Content Infrastructure system, TeamSite enables parallel development, supports all content creation tools, elminates content retrofit, enables whole-site versioning, and provides ... ease-of-use."

TeamXML, an optional module, provides support for managing XML documents. The DataDeploy module allows relational data to be serialized as XML as well as to "synchronize XML content and metadata from the ... TeamSite repository to ... relational databases." OpenDeploy provides synchronization with the file system. Finally, the Templating module allows XML documents to be generated from a variety of sources, including the TeamSite repository.
Vasont
Developer: Vasont Systems
URL: http://www.vasont.com/software/index.asp
License: Commercial
Database type: Relational (Oracle)
Entry last updated: March, 2002

Vasont (formerly Target 2000) is a "cross-media publishing system" based on an Oracle database. It stores data separately from tags, allowing that data to be combined and published in a variety of formats, such as SGML, XML, HTML, and RTF. It features "drag-and-drop, cloning, and check-in/check-out capabilities" and has an API that allows users to "customize their interface". Other features include workflow and the ability to automatically propagate content changes to all "products" (documents?) that use that content.

Vasont is integrated with "third-party editing software and compositions systems, including XMetaL, Epic, Framemaker+SGML, [and] 3B2."
Vignette Content Suite
Developer: Vignette Corp.
URL: http://www.vignette.com/CDA/Site/0,2097,1-1-1489-2890,00.html
License: Commercial
Database type: Unknown
Entry last updated: February, 2002

Vignette Content Suite supports content management tasks such as, "workflow, versioning, and cache management." It includes GUI tools for a number of tasks, including "the creation of browser-based content management applications for the Vignette content management engine." It also includes tools for logging, analyzing, and reporting user interactions.

Vignette Content Suite can integrate data from a large number of applications (e.g. PeopleSoft, SAP, Lotus Notes, Word, Excel), as well as databases (ODBC, JDBC). Integration is apparently done by first converting the data to XML, then merging it.
XCMS
Developer: KwareSoft Inc.
URL: http://203.236.253.101/eng/xcms.asp
License: Commercial
Database type: Relational (JDBC)
Entry last updated: December, 2002

XCMS is a content management system that can use any relational database through JDBC. On input, documents are parsed and stored at the level of individual elements and attributes. These can then be reused to create new documents. Documents can be validated against a DTD or XML Schema, but this is not required. They are arranged into collections based on their DTD or XML Schema. If they do not use a DTD or XML Schema, they are placed in a separate collection. XCMS can also store non-XML documents, such as audio and video files.

XPath, full-text, and document property searches are supported. All searches can be done over the entire database or against a particular collection. A schema-driven, GUI-based tool exists to help build XPath queries. In addition, documents can be transformed with XSLT. To increase search speed, XCMS constructs indexes on document content, structure, and properties.

Other features include versioning, check-in/check-out, document privileges (public, group, private), and a built-in editor, which can be used to edit complete documents or just element and attribute values.
XDocs
Developer: Bluestream Database Software Corp.
URL: http://www.bluestream.com/products/xdocs10
License: Commercial
Database type: Native XML (XStreamDB)
Entry last updated: September, 2006

XDocs is a content management system built on top of XStreamDB. Stored objects are known as "resources", and can be XML documents or binary files. Each resource has an associated MIME type and XML-based metadata, which has a few predefined fields (validation status, description, keywords) and is arbitrarily extensible. Resources are organized into a hierarchy of user-defined collections. XML documents can, through the user interface, be broken into components, which can then be checked out and edited separately. (The document itself stays intact.)

XDocs supports full-text searches over documents, in which keywords can be ANDed, ORed, or a treated as a phrase. Searches can be restricted to a portion of each document with an XPath expression; a subset of XPath -- for example, predicates ([]) are not allowed -- is used for this purpose. Search results are returned in the form of a list of documents that satisfy the search. Searches can also be performed against document metadata and by resource status (checked out, read only, etc.) or lock owner.

XDocs also supports pipelines, which are used to transform documents for export or display. A pipeline consists of a generator node (the ID of a document or an XQuery expression), zero or more transformer nodes (which apply XSLT stylesheets or XPath expressions), and a serializer node (which serializes the document as XML or some other format, such as PDF). Pipelines can be parameterized, such as with the IDs of resources or the URIs of XSLT stylesheets to use. A number of predefined pipelines are provided, such as to convert an XML document to HTML or PDF.

XDocs can be configured to "understand" any XML vocabulary. This configuration information consists of such things as schemas, CSS stylesheets, XSLT stylesheets, predefined XQuery expressions, and information about links. These are used, for example, to display documents in a browser and to configure editors such as XMetal. XDocs is shipped with configuration information for DITA.

XDocs has both Swing and browser-based GUI tools, which allow users to perform such tasks as checking documents in and out, browsing the database, and viewing documents as HTML or PDF. It also has a tool for building Web sites from documents stored in the database. Security consists of user authentication and access control lists, which allow a user or group to perform (or not perform) certain operations on resources or collections. XDocs also provides a Java API.
X-Hive/Docato
Developer: X-Hive Corporation
URL: http://www.x-hive.com/products/docato/index.html
License: Commercial
Database type: Native XML (X-Hive/DB)
Entry last updated: May, 2005

X-Hive/Docato is a content management system based on the X-Hive/DB native XML database. It consists of two modules: X-Hive/Docato Composer and X-Hive/Docato Browser.

X-Hive/Docato Composer performs content management and is built on top of X-Hive/DB. It adds such features as workflow management, user and security management, audit trails and activity logs, integration with XML editors such as XMetal and Arbortext Epic, a bulk data loader, and an impact analyzer for determining the effects of schema changes.

X-Hive/Docato Browser is a framework for building Web browser-based interactive electronic technical manuals (IETMs). It is based on Apache Struts, which uses a model-view-controller paradigm for building Web applications. X-Hive/Docato Browser also uses Tiles, which are an extension to Struts for building Web pages from smaller fragments. This allows each segment (tile) of a Web page to execute its own JSP request. The X-Hive/Docato Browser uses XQuery for database queries, XSLT for display, and a proprietary XML language for specifying the user interface. It is shipped with libraries for connecting to and querying the database, custom XML tag libraries, various code snippets, and command line libraries for various administrative tasks.