SciMath: A Mathematical Information Retrieval System using Signature Based B Tree Indexing
Abstract: Given a mathematical query, traditional text retrieval systems are not very effective in retrieving mathematical information from scientific documents. This paper presents the design and implementation of a new mathematical information retrieval (MIR) system: SciMath, which can take a mathematical formulae as a query and retrieve the relevant scientific documents consisting the relevant mathematical contents based on a B-Tree indexing scheme. The proposed system is then compared with two classical math-aware search engines to prove its effectiveness.
Keywords: Mathematical Information Retrieval (MIR), Ranking, Structure Encoded String (SES), B-Tree indexing.
