Norbert Fuhr Livres





Focused access to XML documents
- 456pages
- 16 heures de lecture
This book constitutes the thoroughly refereed post-conference proceedings of the 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, held at Dagstuhl Castle, Germany, in December 2007. The 37 revised full papers presented were carefully reviewed and selected for presentation at the workshop from 50 initial submissions. The papers are organized in an ad hoc track and 6 topical sections on book search, XML-mining, entity ranking, interactive, link-the-wiki, and multimedia.
Comparative evaluation of XML information retrieval systems
- 554pages
- 20 heures de lecture
This book provides a comprehensive overview of various methodologies and evaluations from INEX 2006, focusing on XML retrieval and related tasks. It covers the Wikipedia XML Corpus and evaluation measures, including sensitivity analysis for choosing ideal recall bases. The ad hoc track is discussed alongside methods for preferential unification of plural retrieved elements. Contributions from organizations like CSIRO and IBM highlight dynamic element retrieval and indexing techniques. The text explores efficient XML retrieval using summaries and evaluates structured information retrieval, multimedia retrieval, and probabilistic approaches. Interactive and heterogeneous collection tracks are examined, along with the taxonomy of XML retrieval use cases. The document track includes insights into XML mining and classification based on structural and content similarities. Techniques such as graph neural networks, contextual self-organizing maps, and conditional random fields for document transformation are also presented. Overall, the book encapsulates the advancements and methodologies in XML information retrieval, providing valuable insights for researchers and practitioners in the field.
Advances in XML information retrieval and evaluation
- 556pages
- 20 heures de lecture
Content-oriented XML retrieval has been receiving increasing interest due to the widespread use of eXtensible Markup Language (XML), which is becoming a standard document format on the Web, in digital libraries, and publishing. By exploiting the enriched source of syntactic and semantic information that XML markup provides, XML information retrieval (IR) systems aim to implement a more focused retrieval strategy and return document components, so-called XML elements – instead of complete documents – in response to a user query. This focused retrieval approach is of particular bene? t for collections containing long documents or documents covering a wide variety of topics (e. g., books, user manuals, legal documents, etc.), where users’ e? ort to locate relevant content can be reduced by directing them to the most relevant parts of the documents. Implementing this, more focused, retrieval paradigm means that an XML IR system needs not only to ? nd relevant information in the XML documents, but it also has to determine the appropriate level of granularity to be returned to the user. In addition, the relevance of a retrieved component may be dependent on meeting both content and structural query conditions.
Advances in XML information retrieval
- 438pages
- 16 heures de lecture
The ultimate goal of many information access systems (e. g., digital libraries, the Web, intranets) is to provide the right content to their end-users. This content is increasingly a mixture of text, multimedia, and metadata, and is formatted according to the adopted –W3C standard for information repositories, the so-called eXtensible Markup L- guage (XML). Whereas many of today’s information access systems still treat do- ments as single large (text) blocks, XML offers the opportunity to exploit the internal structure of documents in order to allow for more precise access thus providing more specific answers to user requests. Providing effective access to XML-based content is therefore a key issue for the success of these systems. The aim of the INEX campaign (Initiative for the Evaluation of XML Retrieval), which was set up at the beginning of 2002, is to establish infrastructures, XML test suites, and appropriate measurements for evaluating the performance of information retrieval systems that aim at giving effective access to XML content. More precisely, the goal of the INEX initiative is to provide means, in the form of a large XML test collection and appropriate scoring methods, for the evaluation of content-oriented XML retrieval systems.