nach oben

Erschienen in:

2024 | OriginalPaper | Buchkapitel

Adapting LLMs for Efficient, Personalized Information Retrieval: Methods and Implications

verfasst von : Samira Ghodratnama, Mehrdad Zakershahrak

Erschienen in: Service-Oriented Computing – ICSOC 2023 Workshops

Verlag: Springer Nature Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The advent of Large Language Models (LLMs) heralds a pivotal shift in online user interactions with information. Traditional Information Retrieval (IR) systems primarily relied on query-document matching, whereas LLMs excel in comprehending and generating human-like text, thereby enriching the IR experience significantly. While LLMs are often associated with chatbot functionalities, this paper extends the discussion to their explicit application in information retrieval. We explore methodologies to optimize the retrieval process, select optimal models, and effectively scale and orchestrate LLMs, aiming for cost-efficiency and enhanced result accuracy. A notable challenge, model hallucination-where the model yields inaccurate or misinterpreted data-is addressed alongside other model-specific hurdles. Our discourse extends to crucial considerations including user privacy, data optimization, and the necessity for system clarity and interpretability. Through a comprehensive examination, we unveil not only innovative strategies for integrating Language Models (LLMs) with Information Retrieval (IR) systems, but also the consequential considerations that underline the need for a balanced approach aligned with user-centric principles.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Predictive Auto-scaling: LSTM-Based Multi-step Cloud Workload Prediction

Nächstes Kapitel Towards Improving Insurance Processes: A Time Series Analysis of Psychosocial Recovery After Workplace Injury Across Legislative Environments

https://www.langchain.com.

https://www.llamaindex.ai.

https://github.com/stanfordnlp/dspy.

https://github.com/explodinggradients/ragas.

https://www.langchain.com/langsmith.

Ghodratnama, S., Beheshti, A., Zakershahrak, M., Sobhanmanesh, F.: Extractive document summarization based on dynamic feature space mapping. IEEE Access 8, 139084–139095 (2020)CrossRef

Ghodratnama, S., Zakershahrak, M., Sobhanmanesh, F.: Am i rare? An intelligent summarization approach for identifying hidden anomalies. In: Hacid, H., et al. (eds.) ICSOC 2020. LNCS, vol. 12632, pp. 309–323. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-76352-7_31CrossRef

Ghodratnama, S., Zakershahrak, M., Sobhanmanesh, F.: Adaptive summaries: a personalized concept-based summarization approach by learning from users’ feedback. In: Hacid, H., et al. (eds.) ICSOC 2020. LNCS, vol. 12632, pp. 281–293. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-76352-7_29CrossRef

Beheshti, A., Benatallah, B., Motahari-Nezhad, H.R., Ghodratnama, S., Amouzgar, F.: A query language for summarizing and analyzing business process data. arXiv preprint arXiv:2105.10911 (2021)

Ghodratnama, S., Zakershahrak, M., Beheshti, A.: Summary2vec: learning semantic representation of summaries for healthcare analytics. In: 2021 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2021)

Khanna, U., Ghodratnama, S., Beheshti, A., et al.: Transformer-based models for long document summarisation in financial domain. In: Proceedings of the 4th Financial Narrative Processing Workshop@ LREC2022, pp. 73–78 (2022)

Beheshti, A., Ghodratnama, S., Elahi, M., Farhood, H.: Social Data Analytics. CRC Press, Boca Raton (2022)CrossRef

Duhan, N., Sharma, A., Bhatia, K.K.: Page ranking algorithms: a survey. In: 2009 IEEE International Advance Computing Conference, pp. 1530–1537. IEEE (2009)

Salton, G., Wong, A., Yang, C.-S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)CrossRef

10.

Ghodratnama, S.: Towards personalized and human-in-the-loop document summarization. arXiv preprint arXiv:2108.09443 (2021)

11.

Ghodratnama, S., Behehsti, A., Zakershahrak, M.: A personalized reinforcement learning summarization service for learning structure from unstructured data. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 206–213. IEEE (2023)

12.

Ghodratnama, S., Beheshti, A., Zakershahrak, M., Sobhanmanesh, F.: Intelligent narrative summaries: from indicative to informative summarization. Big Data Res. 26, 100257 (2021)CrossRef

13.

Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef

14.

Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)

15.

Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I., et al.: Language models are unsupervised multitask learners. OpenAI Blog 1(8), 9 (2019)

16.

Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

17.

Zakershahrak, M., Ghodratnama, S.: Are we on the same page? Hierarchical explanation generation for planning tasks in human-robot teaming using reinforcement learning. arXiv preprint arXiv:2012.11792 (2020)

18.

Lewis, P., et al.: Retrieval-augmented generation for knowledge-intensive NLP tasks. In: Advances in Neural Information Processing Systems, vol. 33, pp. 9459–9474 (2020)

19.

Beheshti, A., et al.: ProcessGPT: transforming business process management with generative artificial intelligence. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 731–739 (2023)

20.

Beheshti, A.: Empowering generative AI with knowledge base 4.0: towards linking analytical, cognitive, and generative intelligence. In: 2023 IEEE International Conference on Web Services (ICWS), pp. 763–771 (2023)

21.

Zheng, L., et al.: Judging LLM-as-a-judge with MT-bench and chatbot arena. arXiv preprint arXiv:2306.05685 (2023)

Titel: Adapting LLMs for Efficient, Personalized Information Retrieval: Methods and Implications
verfasst von: Samira Ghodratnama
Mehrdad Zakershahrak
Verlag: Springer Nature Singapore
Buch: Service-Oriented Computing – ICSOC 2023 Workshops
Print ISBN: 978-981-9709-88-5

Electronic ISBN: 978-981-9709-89-2

Copyright-Jahr: 2024
DOI: https://doi.org/10.1007/978-981-97-0989-2_2

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner