Latent Semantic Analysis (LSA) is a method that allows us to automatically index and retrieve information from a set of objects by reducing the term-by-document matrix using the Singular Value Decomposition (SVD) technique. However, LSA has a high computational cost for analyzing large amounts of information. The goals of this work are (i) to improve the execution time of semantic space construction, dimensionality reduction, and information retrieval stages of LSA based on heterogeneous systems and (ii) to evaluate the accuracy and recall of the information retrieval stage. We present a heterogeneous Latent Semantic Analysis (hLSA) system, which has been developed using General-Purpose computing on Graphics Processing Units (GPGPUs) architecture, which can solve large numeric problems faster through the thousands of concurrent threads on multiple CUDA cores of GPUs and multi-CPU architecture, which can solve large text problems faster through a multiprocessing environment. We execute the hLSA system with documents from the PubMed Central (PMC) database. The results of the experiments show that the acceleration reached by the hLSA system for large matrices with one hundred and fifty thousand million values is around eight times faster than the standard LSA version with an accuracy of 88% and a recall of 100%.
from #AlexandrosSfakianakis via Alexandros G.Sfakianakis on Inoreader http://ift.tt/2y4tClK
via IFTTT
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δημοφιλείς αναρτήσεις
-
Abstract Recent updating of the World Health Organization (WHO) classification of central nervous system (CNS) tumors in 2016 demonstrates...
-
In our previous work, the dichloromethane-methanol (1:1 v/v) extract, fractions and isolated compounds from Polyscias fulva stem bark showed...
-
Background Agricultural work can expose workers to increased risk of heat strain and volume depletion due to repeated exposures to high ambi...
-
Cincinnati.com No fooling; go get your head (and neck) examined for free Cincinnati.com Thursday, get your head examined. UC Health ...
-
Anaphora is a rhetorical term for the repetition of a word or phrase at the beginning of successive clauses or verses. from #AlexandrosSfa...
-
Nursing students' perceptions of a video-based serious game's educational value: A pilot study. Nurse Educ Today. 2017 Dec 28;...
-
Abstract We introduce a novel diagnostic Visual Voiding Device (VVD), which has the ability to visually document urinary voiding events an...
-
Method combines radiomics with three - compartment breast image analysis of dual - energy mammography (Source: The Doctors Lounge - Oncology...
-
Cone beam computerized tomography (CBCT) has been widely used in dental implanting. However, the local hospitals usually don’t have access t...
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου