Docco
Details
| Size: | 0K |
| Last Update: | 2008-05-31 01:16:09 |
| Version: | 0.4 |
| OS Support: | Linux |
| License/Program Type: | BSD License |
| Publisher: | DSTC Pty.Ltd |
| Price: | $0.00 |
Description:
Docco 0.4 is utilities software developed by DSTC Pty.Ltd.
Docco is a little personal document management system we build on top of Apache's indexing and search engine Lucene. Docco adds user interfaces for indexing and querying to Lucene, where the latter gets enhanced by using Formal Concept Analysis' visualisation techniques.
The tool is able to index local hard drives and everything mounted into the local file system, such as Windows or Unix network drives. It scans for a number of different document formats and creates a database containing which words are contained in which documents.
This allows very fast lookup of keywords and other information like authors, title or location. The keywords used are generated from the bodies of the documents, such that no manual annotation is required.
Docco support the follwing formats:
plain text
HTML
XML
OpenOffice/ StarOffice 6.0 documents
Word (with POI plugin)
Excel (with POI plugin)
PDF (with PDFbox or Multivalent plugin)
UNIX man pages (with Multivalent plugin)
Once an index is created, the query interface allows asking for any documents containing certain keywords and shows how these combine. Once a set of interesting documents is found, they can be selected and will be displayed as tree view, from which they can be opened in the default application.
Requirements:
Java 1.4.2 or later
What's New in This Release:
symlinks are not followed anymore (Linux/UNIX)
index locks are detected and can be removed by the user
extra information for index (contents, mappings) is stored after the index was created, not only on shutdown. This means Docco can access the index even after an unclean exit (it will be locked, though)
support for the RTF format (some of it)
nested diagrams can be created using a new button
Lucene is updated to version 1.9.1, all code has been updated to not have any deprecation warnings
analyzers are now supported, which most importantly means we support stop words and stemming for a number of languages, with the choice of analyzer being attached to each index -thus Docco can query different directories with different language tools
Docco 0.4 supports different languages (including english). It works with Linux.
Downloading Docco 0.4 will take if you use fast ADSL connection.
0 comments
Add to
Docco Version History
| Product |
Date Added |
| Docco 0.4 |
2008-05-31 01:16:09 |
Related Software
|
|
From category: Utilities |
| DocSys 1.09 is utilities software developed by Bryce Harrington. DocSys is a document management system written in Perl and using MySQL for storing metadata about documents. Installation: To... |
|
|
From category: Utilities |
| dnd-list 1.2 is utilities software developed by Callum McKenzie. dnd-list is a utility for determining what drag and drop types a program provides. dnd-list provides a means for determining... |
|
|
From category: Utilities |
| DrQueue 0.64.2 RC3 is utilities software developed by Jorge Daza Garcia-Bla. DrQueue is an Open Source render farm managing software. DrQueue distributes shell based tasks such as rendering images... |
|
|
From category: Utilities |
| cowsay is a simple text filter.... |
|
|
From category: Backup |
| PHOTORECOVERY was developed as an easy to use application that was designed to recover images, movies, and sound files from all types of Digital Media. It was designed to be compatible with Memory Sti... |
|
|
From category: Other-Tools |
| Rubicon Karat Fonts v1.31. Postscript Type1 format. Kabel clone, sans serif font, accurate and well hinted. Matching font metrics, full char set, kerning pairs. For laser, inkjet, typesetter from 300... |
|
|
From category: Utilities |
| Dos2Unix 1.0.0 is utilities software developed by Peter Hanecak. Dos2Unix is filter used to convert plain texts from DOS (CR/LF) format to UNIX format (CR) and vice versa. Installation: \... |
|
|
From category: Other-Tools |
| Tired of putting up with Microsoft Word&039;s bloated file size and price, but still need to deal with documents in Word format? Then you should take a serious look at AbiWord. This open source word... |
|
|
From category: Network-Tools |
| MZL & Novatech Traffic Statistics Linux Server delivers network usage data for Linux gateways. It generates from libpcap IP data record files containing network usage information in bytes broken d... |
|
|
From category: Utilities |
| CSSC 1.0.1 is utilities software developed by James Youngman and Greg Hosler. CSSC is the GNU project\'s replacement for the traditional Unix SCCS suite. CSSC project aims for full compatibility (i... |
|
|
From category: Utilities |
| Freshmeat Submitter 0.0.5 is utilities software developed by Andrew Wood. Freshmeat Submitter project is a Perl script to submit updates to freshmeat.net. fm-submit is a script to submit pro... |
|
|
From category: Utilities |
| Commi 0.3.2 is utilities software developed by Sebastian Block. Commi is a line orientated serial terminal like minicom. It is a fork of CuteCom from Alexander Neundorf. It is free software and dis... |
|
|
From category: Utilities |
| All in One 0.5.0 is utilities software developed by Mario Pascucci. All in One is a dock applet for Fluxbox and similar Window Managers. It shows: memory usage: main, buffers, shared... |
|
|
From category: Utilities |
| configsaver 0.2 is utilities software developed by Chris AtLee. configsaver will synchronize your personal configuration files (such as ~/.vimrc, ~/.bashrc, etc.) using your gmail account. When... |
|
|
From category: Utilities |
| eolfix 0.1.0 is utilities software developed by Ross Smith. eolfix is a command line utility for querying and correcting end-of-line (EOL) characters in ASCII text files. It can convert line... |
Leave a comment