EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search

Autor: Rathinasamy, Kamalkumar, Nettar, Jayarama, Kumar, Amit, Manchanda, Vishal, Vijayakumar, Arun, Kataria, Ayush, Manjunath, Venkateshprasanna, GS, Chidambaram, Sodhi, Jaskirat Singh, Shaikh, Shoeb, Khan, Wasim Akhtar, Singh, Prashant, Ige, Tanishq Dattatray, Tiwari, Vipin, Mondal, Rajab Ali, K, Harshini, Reka, S, Amancharla, Chetana, Rahman, Faiz ur, A, Harikrishnan P, Saha, Indraneel, Tiwary, Bhavya, Patel, Navin Shankar, S, Pradeep T, J, Balaji A, Priyapravas, Tarafdar, Mohammed Rafee
Rok vydání: 2024
Předmět:
Druh dokumentu: Working Paper
Popis: Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components. While pre-trained embeddings may exhibit proximity or disparity based on their original training objectives, they might not fully align with the unique characteristics of enterprise-specific data, leading to suboptimal alignment with the retrieval goals of enterprise environments. In this paper, we propose a methodology to fine-tune pre-trained embedding models specifically for enterprise environments. By adapting the embeddings to better suit the retrieval tasks prevalent in enterprises, we aim to enhance the performance of information retrieval solutions. We discuss the process of fine-tuning, its effect on retrieval accuracy, and the potential benefits for enterprise information management. Our findings demonstrate the efficacy of fine-tuned embedding models in improving the precision and relevance of search results in enterprise settings.
Databáze: arXiv