Zobrazeno 1 - 10
of 5 441
pro vyhledávání: '"Mohammed, J"'
The advent of large language models (LLMs) has initiated much research into their various financial applications. However, in applying LLMs on long documents, semantic relations are not explicitly incorporated, and a full or arbitrarily sparse attent
Externí odkaz:
http://arxiv.org/abs/2410.02024
Knowledge graph (KG) completion aims to identify additional facts that can be inferred from the existing facts in the KG. Recent developments in this field have explored this task in the inductive setting, where at test time one sees entities that we
Externí odkaz:
http://arxiv.org/abs/2410.00876
Autor:
Mohbat, Fnu, Zaki, Mohammed J.
In the rapidly evolving landscape of online recipe sharing within a globalized context, there has been a notable surge in research towards comprehending and generating food recipes. Recent advancements in large language models (LLMs) like GPT-2 and L
Externí odkaz:
http://arxiv.org/abs/2408.16889
Autor:
Dihan, Fatema Jannat, Murad, Saydul Akbar, Muzahid, Abu Jafar Md, Uddin, K. M. Aslam, Alenazi, Mohammed J. F., Bairagi, Anupam Kumar, Biswas, Sujit
Monkeypox virus (MPXV) is a zoonotic virus that poses a significant threat to public health, particularly in remote parts of Central and West Africa. Early detection of monkeypox lesions is crucial for effective treatment. However, due to its similar
Externí odkaz:
http://arxiv.org/abs/2405.21016
Graph transformers typically lack third-order interactions, limiting their geometric understanding which is crucial for tasks like molecular geometry prediction. We propose the Triplet Graph Transformer (TGT) that enables direct communication between
Externí odkaz:
http://arxiv.org/abs/2402.04538
Autor:
Sakander Hayat, Sunilkumar M. Hosamani, Asad Khan, Ravishankar L. Hutagi, Umesh S. Mujumdar, Mohammed J. F. Alenazi
Publikováno v:
AIMS Mathematics, Vol 9, Iss 9, Pp 24955-24976 (2024)
Regarding a simple graph $ \Gamma $ possessing $ \nu $ vertices ($ \nu $-vertex graph) and $ m $ edges, the vertex-weight and weight of an edge $ e = uv $ are defined as $ w(v_{i}) = d_{ \Gamma}(v_{i}) $ and $ w(e) = d_{ \Gamma}(u)+d_{ \Gamma}(v)-2 $
Externí odkaz:
https://doaj.org/article/894abb87aa9b43449998fc3cb40f93db
Clustering is a widely used unsupervised learning technique involving an intensive discrete optimization problem. Associative Memory models or AMs are differentiable neural networks defining a recursive dynamical system, which have been integrated wi
Externí odkaz:
http://arxiv.org/abs/2306.03209
Transformers use the dense self-attention mechanism which gives a lot of flexibility for long-range connectivity. Over multiple layers of a deep transformer, the number of possible connectivity patterns increases exponentially. However, very few of t
Externí odkaz:
http://arxiv.org/abs/2306.01705
The robustness of a model for real-world deployment is decided by how well it performs on unseen data and distinguishes between in-domain and out-of-domain samples. Visual document classifiers have shown impressive performance on in-distribution test
Externí odkaz:
http://arxiv.org/abs/2305.17219
Autor:
Hoover, Benjamin, Liang, Yuchen, Pham, Bao, Panda, Rameswar, Strobelt, Hendrik, Chau, Duen Horng, Zaki, Mohammed J., Krotov, Dmitry
Publikováno v:
37th Conference on Neural Information Processing Systems (NeurIPS 2023)
Our work combines aspects of three promising paradigms in machine learning, namely, attention mechanism, energy-based models, and associative memory. Attention is the power-house driving modern deep learning successes, but it lacks clear theoretical
Externí odkaz:
http://arxiv.org/abs/2302.07253