Simultaneous use of CPU and GPU to real time inverted index updating in Microblogs

sajad bolhasani

Abstract


Nowadays, with attention to developing the different data networks, the wide masses of data are producing and updating continually. Managing the great data enumerate the fundamental challenges in data mining. One of the considered main subjects in this context is how searching among the wide masses of data. Therefore, require to producing the typical powerful, expansible and efficient file of documents and data for using in search motors is necessary. In this study, with surveying the done prior works, implementing the inverted index with the immediate updating capability from the dynamic and little data of microblogs is targeted. With utilization from processing multicore facility, the approach of the graphical processing unit (GPU) is presented that as expansible and without decreasing the attention, the index file is prepared with suitable speed, as the mentioned file is usable in inquiry unit. This method tries to feed the updating unit continually with separating the operation for the system Central Processing Unit (CPU) and suitable utilization of parallel processing capability of CUDA core. Also, in parallel to increasing the quality, one Hint method is presented for employing the vacant cores and compactor function for decreasing the index file mass. The results indicate that the presence of necessary hardware, the presented method in identity to immediate updating slogan, have the upper speed for making the inverted index of microblogs than to available samples.


Keywords


Inverted index; Microblog; GPU; Update

Full Text:

PDF

References


P. Mudgil, A. K. Sharma, and P. Gupta, "An Improved Indexing Mechanism to Index Web Documents", IEEE, 2013.

R.Konow, G.Navarro, and C. L. A. Clarke, "Faster and Smaller Inverted Indices with Treaps", artially funded by Fondecyt grant 1-110066 , by the Conicyt PhD Scholarship Program, Chile and by the Emerging Leaders in the Americas Program, Government of Canada ACM, 2013.

S. Brin and L. Page, "Reprint of: The anatomy of a large-scale hypertextual web search engine", Computer Networks, 2012.

Z. Wei and J. JaJa, "A fast algorithm for constructing inverted files on heterogeneous platforms", J. Parallel Distrib. Comput, 2012.

N. Grimsmo, "Dynamic indexes vs. static hierarchies for substring search", Trondheim, 2005.

R. A. Baeza-Yates and B. Ribeiro-Neto, "Modern Information Retrieval", Addison-Wesley Longman Publishing Co, Inc., 1999.

C. D. Manning, P. Raghavan, and H. Schütze, "Introduction to Information Retrieval", 2008.

NVIDIA CUDAâ„¢, "NVIDIA CUDA C Programming Guide", Book, www.nvidia.com, 2012

W. Di, Z. Fan, A. Naiyong, W. Fang, L. Jing, and W. Gang, "A Batched GPU Algorithm for Set Intersection", 2009.

Z. Wei and J. JaJa, "A fast algorithm for constructing inverted files on heterogeneous platforms", J. Parallel Distrib. Comput. 2012.

W. Lingkun, L. Wenqing, X. Xiaokui, and X. Yabo, "LSII: An indexing structure for exact real-time search on microblogs", in Data Engineering (ICDE), IEEE 29th International Conference on, 2013.

Q. Bai, C. Ma, and X. Chen, "A new index model based on inverted index", IEEE, 2012.

N. N. Sophoclis, M. Abdeen, E. S. M. El-Horbaty, and M. Yagoub, "A novel approach for indexing Arabic documents through GPU computing", IEEE, 2012.


Refbacks

  • There are currently no refbacks.


ISSN: 1694-2507 (Print)

ISSN: 1694-2108 (Online)