Analyzing Document Archives

It’s a well known saying that 80% of our knowledge is still stored inside documents. it makes sense to pay attention to the management of these documents. At aboutdatajournalism.org we’ve chosen to install the Alfresco ECMS solution. This software offers all the functionality for good document management and integrates well with our Semantic Analyzer.

Main benefits:

  • Automatic rendering from PDF to text thanks to Alfresco Rules
  • Meta-data extraction and categorization
  • User management and security
  • API integration
  • Scripting environment for extra functionality

 

Our solution

We store and manage documents inside the ECMS and extract the rendered text together with the meta-data to our Smart Index Software.