Skip to main content

2024 | OriginalPaper | Buchkapitel

Adapting LLMs for Efficient, Personalized Information Retrieval: Methods and Implications

verfasst von : Samira Ghodratnama, Mehrdad Zakershahrak

Erschienen in: Service-Oriented Computing – ICSOC 2023 Workshops

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The advent of Large Language Models (LLMs) heralds a pivotal shift in online user interactions with information. Traditional Information Retrieval (IR) systems primarily relied on query-document matching, whereas LLMs excel in comprehending and generating human-like text, thereby enriching the IR experience significantly. While LLMs are often associated with chatbot functionalities, this paper extends the discussion to their explicit application in information retrieval. We explore methodologies to optimize the retrieval process, select optimal models, and effectively scale and orchestrate LLMs, aiming for cost-efficiency and enhanced result accuracy. A notable challenge, model hallucination-where the model yields inaccurate or misinterpreted data-is addressed alongside other model-specific hurdles. Our discourse extends to crucial considerations including user privacy, data optimization, and the necessity for system clarity and interpretability. Through a comprehensive examination, we unveil not only innovative strategies for integrating Language Models (LLMs) with Information Retrieval (IR) systems, but also the consequential considerations that underline the need for a balanced approach aligned with user-centric principles.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ghodratnama, S., Beheshti, A., Zakershahrak, M., Sobhanmanesh, F.: Extractive document summarization based on dynamic feature space mapping. IEEE Access 8, 139084–139095 (2020)CrossRef Ghodratnama, S., Beheshti, A., Zakershahrak, M., Sobhanmanesh, F.: Extractive document summarization based on dynamic feature space mapping. IEEE Access 8, 139084–139095 (2020)CrossRef
4.
Zurück zum Zitat Beheshti, A., Benatallah, B., Motahari-Nezhad, H.R., Ghodratnama, S., Amouzgar, F.: A query language for summarizing and analyzing business process data. arXiv preprint arXiv:2105.10911 (2021) Beheshti, A., Benatallah, B., Motahari-Nezhad, H.R., Ghodratnama, S., Amouzgar, F.: A query language for summarizing and analyzing business process data. arXiv preprint arXiv:​2105.​10911 (2021)
5.
Zurück zum Zitat Ghodratnama, S., Zakershahrak, M., Beheshti, A.: Summary2vec: learning semantic representation of summaries for healthcare analytics. In: 2021 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2021) Ghodratnama, S., Zakershahrak, M., Beheshti, A.: Summary2vec: learning semantic representation of summaries for healthcare analytics. In: 2021 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2021)
6.
Zurück zum Zitat Khanna, U., Ghodratnama, S., Beheshti, A., et al.: Transformer-based models for long document summarisation in financial domain. In: Proceedings of the 4th Financial Narrative Processing Workshop@ LREC2022, pp. 73–78 (2022) Khanna, U., Ghodratnama, S., Beheshti, A., et al.: Transformer-based models for long document summarisation in financial domain. In: Proceedings of the 4th Financial Narrative Processing Workshop@ LREC2022, pp. 73–78 (2022)
7.
Zurück zum Zitat Beheshti, A., Ghodratnama, S., Elahi, M., Farhood, H.: Social Data Analytics. CRC Press, Boca Raton (2022)CrossRef Beheshti, A., Ghodratnama, S., Elahi, M., Farhood, H.: Social Data Analytics. CRC Press, Boca Raton (2022)CrossRef
8.
Zurück zum Zitat Duhan, N., Sharma, A., Bhatia, K.K.: Page ranking algorithms: a survey. In: 2009 IEEE International Advance Computing Conference, pp. 1530–1537. IEEE (2009) Duhan, N., Sharma, A., Bhatia, K.K.: Page ranking algorithms: a survey. In: 2009 IEEE International Advance Computing Conference, pp. 1530–1537. IEEE (2009)
9.
Zurück zum Zitat Salton, G., Wong, A., Yang, C.-S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)CrossRef Salton, G., Wong, A., Yang, C.-S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)CrossRef
10.
11.
Zurück zum Zitat Ghodratnama, S., Behehsti, A., Zakershahrak, M.: A personalized reinforcement learning summarization service for learning structure from unstructured data. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 206–213. IEEE (2023) Ghodratnama, S., Behehsti, A., Zakershahrak, M.: A personalized reinforcement learning summarization service for learning structure from unstructured data. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 206–213. IEEE (2023)
12.
Zurück zum Zitat Ghodratnama, S., Beheshti, A., Zakershahrak, M., Sobhanmanesh, F.: Intelligent narrative summaries: from indicative to informative summarization. Big Data Res. 26, 100257 (2021)CrossRef Ghodratnama, S., Beheshti, A., Zakershahrak, M., Sobhanmanesh, F.: Intelligent narrative summaries: from indicative to informative summarization. Big Data Res. 26, 100257 (2021)CrossRef
13.
Zurück zum Zitat Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef
14.
Zurück zum Zitat Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017) Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
15.
Zurück zum Zitat Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I., et al.: Language models are unsupervised multitask learners. OpenAI Blog 1(8), 9 (2019) Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I., et al.: Language models are unsupervised multitask learners. OpenAI Blog 1(8), 9 (2019)
16.
Zurück zum Zitat Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018) Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805 (2018)
17.
Zurück zum Zitat Zakershahrak, M., Ghodratnama, S.: Are we on the same page? Hierarchical explanation generation for planning tasks in human-robot teaming using reinforcement learning. arXiv preprint arXiv:2012.11792 (2020) Zakershahrak, M., Ghodratnama, S.: Are we on the same page? Hierarchical explanation generation for planning tasks in human-robot teaming using reinforcement learning. arXiv preprint arXiv:​2012.​11792 (2020)
18.
Zurück zum Zitat Lewis, P., et al.: Retrieval-augmented generation for knowledge-intensive NLP tasks. In: Advances in Neural Information Processing Systems, vol. 33, pp. 9459–9474 (2020) Lewis, P., et al.: Retrieval-augmented generation for knowledge-intensive NLP tasks. In: Advances in Neural Information Processing Systems, vol. 33, pp. 9459–9474 (2020)
19.
Zurück zum Zitat Beheshti, A., et al.: ProcessGPT: transforming business process management with generative artificial intelligence. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 731–739 (2023) Beheshti, A., et al.: ProcessGPT: transforming business process management with generative artificial intelligence. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 731–739 (2023)
20.
Zurück zum Zitat Beheshti, A.: Empowering generative AI with knowledge base 4.0: towards linking analytical, cognitive, and generative intelligence. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 763–771 (2023) Beheshti, A.: Empowering generative AI with knowledge base 4.0: towards linking analytical, cognitive, and generative intelligence. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 763–771 (2023)
Metadaten
Titel
Adapting LLMs for Efficient, Personalized Information Retrieval: Methods and Implications
verfasst von
Samira Ghodratnama
Mehrdad Zakershahrak
Copyright-Jahr
2024
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-97-0989-2_2

Premium Partner