Vol. 14 No. 4 (2024): IJCRT, Volume 14, Issue 4, 2024
Review Article

Optimal Keyword Selection by Hybrid Optimization with Itemset Mining for Text Summarization in Biomedical Sector

Dr. Nishant Pachpor
Assistant Professor
Dr. Salim Shaikh
Assoc. Professor, Department of Computer Engineering, Kalsekar Technical Campus, Panvel India.
Dr. Sachin Misal
International Institue of Managment Science Chinchwad Pune
Dr. Ashwini Brahme
International Institue of Managment Science Chinchwad Pune

Published 2024-10-28

Keywords

  • Biomedical Text,
  • Deep Learning,
  • Hybrid Optimization,
  • Graph-based Learning,
  • Recurrent Neural Network,
  • Colliding Bodies Optimization (CBO),
  • Cuckoo Search Optimization (CSO)
  • ...More
    Less

How to Cite

Pachpor, N., Shaikh, S., Misal, S., & Brahme, A. (2024). Optimal Keyword Selection by Hybrid Optimization with Itemset Mining for Text Summarization in Biomedical Sector. IJCRT Research Journal | UGC Approved and UGC Care Journal | Scopus Indexed Journal Norms, 14(4), 50200–50207. https://doi.org/10.61359/IJCRT2024050021

Abstract

Biomedical literature is growing exponentially, creating challenges for researchers and clinicians to access relevant information efficiently. Automatic biomedical text summarization is a promising solution to this issue, allowing the extraction of essential information while reducing redundancy. Traditional summarization techniques often rely on shallow linguistic features or simple term frequencies, which fail to capture the complex relationships in biomedical texts. This paper explores advanced methods for biomedical text summarization, including graph-based approaches, frequent item set mining, and deep learning models such as BERT. The proposed framework introduces a hybrid optimization technique combining Colliding Bodies Optimization (CBO) and Cuckoo Search Optimization (CSO) for optimal keyword selection, alongside a Recurrent Neural Network (RNN) for sentence categorization. Extensive experimentation using UMLS-based concept extraction, keyword selection, and Apriori-based itemset mining demonstrates that the proposed method significantly outperforms existing models in both the in formativeness and coherence of generated summaries. The results reveal that combining deep language models with domain-specific knowledge enhances summarization quality and can be effectively applied to diverse types of biomedical text.