Latent Semantic Analysis (LSA) is a method that allows us to automatically index and retrieve information from a set of objects by reducing the term-by-document matrix using the Singular Value Decomposition (SVD) technique. However, LSA has a high computational cost for analyzing large amounts of information. The goals of this work are (i) to improve the execution time of semantic space construction, dimensionality reduction, and information retrieval stages of LSA based on heterogeneous systems and (ii) to evaluate the accuracy and recall of the information retrieval stage. We present a heterogeneous Latent Semantic Analysis (hLSA) system, which has been developed using General-Purpose computing on Graphics Processing Units (GPGPUs) architecture, which can solve large numeric problems faster through the thousands of concurrent threads on multiple CUDA cores of GPUs and multi-CPU architecture, which can solve large text problems faster through a multiprocessing environment. We execute the hLSA system with documents from the PubMed Central (PMC) database. The results of the experiments show that the acceleration reached by the hLSA system for large matrices with one hundred and fifty thousand million values is around eight times faster than the standard LSA version with an accuracy of 88% and a recall of 100%.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2y4tClK
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Publication date: Available online 4 January 2018 Source: European Journal of Radiology Author(s): Peiyao Zhang, Jing Wang, Qin Xu, Zhen...
-
Publication date: March 2017 Source: Free Radical Biology and Medicine, Volume 104 from #AlexandrosSfakianakis via Alexandros G.Sfak...
-
Dtsch med Wochenschr DOI: 10.1055/s-0043-100054 Hintergrund und Fragestellung Ein etablierter Weg, die optimale Behandlung von Tumorpatien...
-
Background Hyperthyroidism is associated with increased thrombotic risk. As contact system activation through formation of neutrophil extrac...
-
Editors' Recognition for Reviewing in 2020 No abstract available Establishing Satellite Lung Cancer Screening Sites With Telehealth to A...
-
Deepak Thapa, Vanita Ahuja, Deepanshu Dhiman Indian Journal of Anaesthesia 2017 61(12):1012-1014 from #AlexandrosSfakianakis via Alexa...
-
Abstract Limited memory size is considered as a major bottleneck in data centers for intelligent urban computing. It is shown that there e...
-
ecancer is supporting #BowelCancerAwarenessMonth https://t.co/opXxCAAxzE from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inore...
-
Abstract Silvicultural models are often developed and applied without due consideration of fire modelling. Yet, this information is import...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου