An Efficient Algorithm for Informational Retrieval using Web Usage Mining

AUTHORS

Preeti Rathi,Research ScholarDepartment of Computer ScienceKanya Gurukul CampusDehradun
Nipur Singh,Department of Computer Science Kanya Gurukul Campus Dehradun

ABSTRACT

Retrieval of information from the database and web log files is very time consuming process. There are many techniques and models to retrieve data from web. There are two types of data available in web i.e. structured and unstructured. If data is structured then retrieval of information is easy task. Otherwise firstly apply algorithm to unstructured data and then models will be apply. Vector space and Boolean models are used for IR. In this paper we compare both Boolean model & Vector Space model techniques to retrieval data from web (log files) and proposed a new algorithm based on time, frequency, memory consumption etc..

 

KEYWORDS

Log files, IR, Vector Space Model, Boolean Model, Web, TF, IDF

REFERENCES

[1]     M.François Sy, S.Ranwez, J.Montmain,“User centered and ontology based information Retrieval system for life sciences”, BMC Bioinformatics,2105 DOI: 10.1186/1471-2105-13-S1-S4(CrossRef)(Google Scholar)
[2]     Abdur Rehman, Kashif Javed, Haroon A. Babri and Mehreen Saeed, “Relative discrimination criterion – A novel feature ranking method for text data,” Expert Systems with Applications, Elsevier, Vol.42, No.7, pp. 3670-3681, (2015). DOI: 10.1016/j.eswa.2014.12.013(CrossRef)(Google Scholar)
[3]     Kyu-Hwan Jung and Jaewook Lee, “Probabilistic generative ranking method based on multi-support vector domain description,” Information Sciences, Elsevier, Vol.247 pp 144-153, (2013). DOI: 10.1016/j.ins.2013.05.001(CrossRef)(Google Scholar)
[4]     M.W. Berry, Z. Drmac and E. R. Jessup, “Matrics, Vector Spaces, and Information Retrieval,” Society for Industrial and Applied Mathematics, Vol. 41 No. 2, pp, 335-362, (1999) DOI: 10.1137/S0036144598347035(CrossRef)(Google Scholar)
[5]     J.N. Singh and S.K. Dwivedi, “Analysis of Vector Space Model Information Retrieval”. In Proceedings of the National Conference on Communication Technologies and its Impact on Next Generation computing (CTNGC’ 12), International Journal of Computer Application (IJCA), (2012)
[6]     D. Hiemstra and A.P. De Vires, “Relating the New Language Models of Information Retrieval to the Traditional Retrieval Models”, CTIT Technical Report TR- CTIT- 00- 00, pp, 1-14, (2000).
[7]     Y. Lv and C. Zhai, “Adaptive Relevance Feedback in Information Retrieval”, In Proceeding of CIK ’09, November 2-6, Hong Kong, China, (2009). DOI: 10.1145/1645953.1645988(CrossRef)(Google Scholar)
[8]     Jitendra Nath Singh, “A Comparative Study on Approaches of Vector Space Model in Information Retrieval”, International Journal of Computer Applications (0975-8887), (2013).
[9]     Nicolas Couellan, Sophie Jan, Tom Jorquera and Jean-Pierre George, “Self-adaptive Support Vector Machine: A multi-agent optimization perspective,” Expert Systems with Applications, Elsevier, Vol.42, No.9, pp.1-15, (2015). DOI: 10.1016/j.eswa.2015.01.028(CrossRef)(Google Scholar)
[10]  Han Xu, Eric Martin and Ashesh Mahidadia, “Contents and time sensitive document ranking of scientific literature,” Journal Informetrics, Elsevier, Vol.8, No.3, pp. 546-561, (2014). DOI: 10.1016/j.joi.2014.04.006(CrossRef)(Google Scholar)
[11]  Dr. M. Balamurugan, “A Trend Analysis of Information Retrieval Models”, International Journal of Advanced Research in Computer Science, ISSN No. 0976-5697, Volume 8, No. 5, May-June (2017)
[12]  Kiran Prakash Bachchhav, “Information Retrieval: search process, techniques and strategies”, IJNGLT, Vol.2, No.1, February (2016)
[13]  Srinagnaya G., “A Technical Study on Information Retrieval using Web Mining Techniques”, IEEE Sponsored 2nd International Conference on Innovations in Information Embedded and Communication systems(ICIIECS) (2015). DOI: 10.1109/ICIIECS.2015.7192894(CrossRef)(Google Scholar)
[14]  Sanjay and Dharmender Kumar, “A Review Paper on Page Ranking Algorithms”, International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) Vol. 4, No.6, June (2015)

CITATION

  • APA:
    Rathi,P. and Singh,N.(2019). An Efficient Algorithm for Informational Retrieval using Web Usage Mining. International Journal of Hybrid Information Technology, 12(2), 13-20. http://dx.doi.org/10.21742/IJHIT.2019.12.2.03
  • Harvard:
    Rathi,P.and Singh,N.(2019). "An Efficient Algorithm for Informational Retrieval using Web Usage Mining". International Journal of Hybrid Information Technology, 12(2), pp.13-20. doi:http://dx.doi.org/10.21742/IJHIT.2019.12.2.03
  • IEEE:
    [1]P. Rathiand N.Singh, "An Efficient Algorithm for Informational Retrieval using Web Usage Mining". International Journal of Hybrid Information Technology, vol.12, no.2, pp.13-20, Nov. 2019
  • MLA:
    Rathi Preeti and Singh Nipur. "An Efficient Algorithm for Informational Retrieval using Web Usage Mining". International Journal of Hybrid Information Technology, vol.12, no.2, Nov. 2019, pp.13-20, doi:http://dx.doi.org/10.21742/IJHIT.2019.12.2.03
 

COPYRIGHT

Creative Commons License
© 2019 Rathi Preeti et al. Published by Global Vision Press. This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CCBY4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

ISSUE INFO

  • Volume 12, No. 2, 2019
  • ISSN(p):1738-9968
  • ISSN(o):2652-2233
  • Published:Nov. 2019

DOWNLOAD