TY - GEN
T1 - A figure search engine architecture for a chemistry digital library
AU - Choudhury, Sagnik Ray
AU - Tuarob, Suppawong
AU - Mitra, Prasenjit
AU - Rokach, Lior
AU - Kirk, Andi
AU - Szep, Silvia
AU - Pellegrino, Donald
AU - Jones, Sue
AU - Lee Giles, C.
PY - 2013/8/23
Y1 - 2013/8/23
N2 - Academic papers contain multiple figures representing important findings and experimental results; we present a search engine specifically focused on figures in academic documents. This search engine allows users to search on figures in approximately 150,000 chemistry journal articles though the method is easily extendable to other domains. Our system indexes figure caption and mentions extracted from the PDF in documents using a custom built extractor. Recall and precision performance of extracted figures is in the 80 to 90 % range. We give the frame work for the extraction algorithm, architecture and ranking function.
AB - Academic papers contain multiple figures representing important findings and experimental results; we present a search engine specifically focused on figures in academic documents. This search engine allows users to search on figures in approximately 150,000 chemistry journal articles though the method is easily extendable to other domains. Our system indexes figure caption and mentions extracted from the PDF in documents using a custom built extractor. Recall and precision performance of extracted figures is in the 80 to 90 % range. We give the frame work for the extraction algorithm, architecture and ranking function.
KW - Figure search
KW - Information extraction
UR - http://www.scopus.com/inward/record.url?scp=84882283062&partnerID=8YFLogxK
U2 - 10.1145/2467696.2467757
DO - 10.1145/2467696.2467757
M3 - Conference contribution
AN - SCOPUS:84882283062
SN - 9781450320764
T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
SP - 369
EP - 370
BT - JCDL 2013 - Proceedings of the 13th ACM/IEEE-CS Joint Conference on Digital Libraries
T2 - 13th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL 2013
Y2 - 22 July 2013 through 26 July 2013
ER -