Methodologies and Techniques for Semantic Management of Documents in Dematerialization Processes
Amato, Flora (2009) Methodologies and Techniques for Semantic Management of Documents in Dematerialization Processes. [Tesi di dottorato] (Inedito)
Full text disponibile come:
Knowledge management has become a challenging issue for almost all the e-Government based applications. One of the main issues for E-Government activities is to manage the great amount of available data efﬁciently. The presence of a huge amount of information, in fact, is typical of bureaucratic processes, like the ones pertaining to public administrations. Such information is often recorded on paper or on different digital ﬁles and its management is very expensive, both in terms of space used for storing and in terms of time spent in searching for the documents of interest. Furthermore, the manual management of these documents is absolutely not error-free. In order to efﬁciently access the information embedded in very large document repositories, techniques for semantic document management are required. They ensure a large and intense process of dematerialization and aim at eliminating or at least reducing, the amount of paper documents. E-Government based applications need proper data models for information content characterization, in order to automatically transformunstructured (or sometimes semi-structured) documents into formally structured records, suitable for machine processing. Furthermore a way for presenting information contained in documents, depending on access policies and available technologies has to be provided. Finally different kinds of media elements, contained in digital documents, have to be managed. Indeed, nowadays, almost all the novel bureaucratic processes are characterized by both text and multimedia data (e. g. audio, still images, sometimes videos), which need to be properly handled, stored and distributed. In this thesis, we present a novel model of digital documents for improving the dematerialization effectiveness, that constitutes the starting point for an information system able to manage documents streams efﬁciently. This model takes into account E-Government applications needs like as the respect of provisions in force and the adaptability to evolving technologies. At the best of our knowledge, the proposed model is one of the ﬁrst attempts to give a single and uniﬁed characterization for the management of multimedia documents, pertaining to a bureaucratic domain as the E-Government one, on which a system of semantic procedures are deﬁned for the transformation of the non structured documents (pertaining to specialized domain) into structured data. Furthermore, architecture for the management of the document whole life cycle has been proposed, which provides advanced functionalities for semantic processing, such as giving formal structure to document informative content, information extraction, semantic retrieval, indexing, storage, presentation, together with long-term preservation.
Solo per gli Amministratori dell'archivio: edita il record