Výsledky vyhledávání

Report

MRWeb: An Exploration of Generating Multi-Page Resource-Aware Web Code from UI Designs

Autor: Wan, Yuxuan, Dong, Yi, Xiao, Jingyu, Huo, Yintong, Wang, Wenxuan, Lyu, Michael R.

Multi-page websites dominate modern web development. However, existing design-to-code methods rely on simplified assumptions, limiting to single-page, self-contained webpages without external resource connection. To address this gap, we introduce the

Externí odkaz: http://arxiv.org/abs/2412.15310

Zobrazit plný text záznamu

Report

Dynamic strain sensing using Doppler-shift-immune phase-sensitive OFDR with ultra-weak reflection array and frequency-tracking

Autor: Yang, Qiang, Xie, Weilin, Wang, Congfan, Li, Bowen, Li, Xin, Zheng, Xiang, Wei, Wei, Dong, Yi

In distributed fiber-optic sensing based on optical frequency domain reflectometry (OFDR), Doppler frequency shifts due to the changes of disturbances during one sweep period introduce demodulation errors that accumulate along both the distance and t

Externí odkaz: http://arxiv.org/abs/2410.19368

Zobrazit plný text záznamu

Report

Diverging Preferences: When do Annotators Disagree and do Models Know?

Autor: Zhang, Michael JQ, Wang, Zhilin, Hwang, Jena D., Dong, Yi, Delalleau, Olivier, Choi, Yejin, Choi, Eunsol, Ren, Xiang, Pyatkin, Valentina

We examine diverging preferences in human-labeled preference datasets. We develop a taxonomy of disagreement sources spanning 10 categories across four high-level classes -- task underspecification, response style, refusals, and annotation errors. We

Externí odkaz: http://arxiv.org/abs/2410.14632

Zobrazit plný text záznamu

Report

HelpSteer2-Preference: Complementing Ratings with Preferences

Autor: Wang, Zhilin, Bukharin, Alexander, Delalleau, Olivier, Egert, Daniel, Shen, Gerald, Zeng, Jiaqi, Kuchaiev, Oleksii, Dong, Yi

Reward models are critical for aligning models to follow instructions, and are typically trained following one of two popular paradigms: Bradley-Terry style or Regression style. However, there is a lack of evidence that either approach is better than

Externí odkaz: http://arxiv.org/abs/2410.01257

Zobrazit plný text záznamu

Report

Adaptive Guardrails For Large Language Models via Trust Modeling and In-Context Learning

Autor: Hu, Jinwei, Dong, Yi, Huang, Xiaowei

Guardrails have become an integral part of Large language models (LLMs), by moderating harmful or toxic response in order to maintain LLMs' alignment to human expectations. However, the existing guardrail methods do not consider different needs and a

Externí odkaz: http://arxiv.org/abs/2408.08959

Zobrazit plný text záznamu

Report

Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach

Autor: Wan, Yuxuan, Wang, Chaozheng, Dong, Yi, Wang, Wenxuan, Li, Shuqing, Huo, Yintong, Lyu, Michael R.

Websites are critical in today's digital world, with over 1.11 billion currently active and approximately 252,000 new sites launched daily. Converting website layout design into functional UI code is a time-consuming yet indispensable step of website

Externí odkaz: http://arxiv.org/abs/2406.16386

Zobrazit plný text záznamu

Report

Nemotron-4 340B Technical Report

Autor: Nvidia, Adler, Bo, Agarwal, Niket, Aithal, Ashwath, Anh, Dong H., Bhattacharya, Pallab, Brundyn, Annika, Casper, Jared, Catanzaro, Bryan, Clay, Sharon, Cohen, Jonathan, Das, Sirshak, Dattagupta, Ayush, Delalleau, Olivier, Derczynski, Leon, Dong, Yi, Egert, Daniel, Evans, Ellie, Ficek, Aleksander, Fridman, Denys, Ghosh, Shaona, Ginsburg, Boris, Gitman, Igor, Grzegorzek, Tomasz, Hero, Robert, Huang, Jining, Jawa, Vibhu, Jennings, Joseph, Jhunjhunwala, Aastha, Kamalu, John, Khan, Sadaf, Kuchaiev, Oleksii, LeGresley, Patrick, Li, Hui, Liu, Jiwei, Liu, Zihan, Long, Eileen, Mahabaleshwarkar, Ameya Sunil, Majumdar, Somshubra, Maki, James, Martinez, Miguel, de Melo, Maer Rodrigues, Moshkov, Ivan, Narayanan, Deepak, Narenthiran, Sean, Navarro, Jesus, Nguyen, Phong, Nitski, Osvald, Noroozi, Vahid, Nutheti, Guruprasad, Parisien, Christopher, Parmar, Jupinder, Patwary, Mostofa, Pawelec, Krzysztof, Ping, Wei, Prabhumoye, Shrimai, Roy, Rajarshi, Saar, Trisha, Sabavat, Vasanth Rao Naik, Satheesh, Sanjeev, Scowcroft, Jane Polak, Sewall, Jason, Shamis, Pavel, Shen, Gerald, Shoeybi, Mohammad, Sizer, Dave, Smelyanskiy, Misha, Soares, Felipe, Sreedhar, Makesh Narsimhan, Su, Dan, Subramanian, Sandeep, Sun, Shengyang, Toshniwal, Shubham, Wang, Hao, Wang, Zhilin, You, Jiaxuan, Zeng, Jiaqi, Zhang, Jimmy, Zhang, Jing, Zhang, Vivienne, Zhang, Yian, Zhu, Chen

We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distri

Externí odkaz: http://arxiv.org/abs/2406.11704

Zobrazit plný text záznamu

Report

HelpSteer2: Open-source dataset for training top-performing reward models

Autor: Wang, Zhilin, Dong, Yi, Delalleau, Olivier, Zeng, Jiaqi, Shen, Gerald, Egert, Daniel, Zhang, Jimmy J., Sreedhar, Makesh Narsimhan, Kuchaiev, Oleksii

High-quality preference datasets are essential for training reward models that can effectively guide large language models (LLMs) in generating high-quality responses aligned with human preferences. As LLMs become stronger and better aligned, permiss

Externí odkaz: http://arxiv.org/abs/2406.08673

Zobrazit plný text záznamu

Report

All-sky Guide Star Catalog for CSST

The China Space Station Telescope (CSST) is a two-meter space telescope with multiple back-end instruments. The Fine Guidance Sensor (FGS) is an essential subsystem of the CSST Precision Image Stability System to ensure the required absolute pointing

Externí odkaz: http://arxiv.org/abs/2406.00972

Zobrazit plný text záznamu

Report

Safeguarding Large Language Models: A Survey

Autor: Dong, Yi, Mu, Ronghui, Zhang, Yanghao, Sun, Siqi, Zhang, Tianle, Wu, Changshun, Jin, Gaojie, Qi, Yi, Hu, Jinwei, Meng, Jie, Bensalem, Saddek, Huang, Xiaowei

In the burgeoning field of Large Language Models (LLMs), developing a robust safety mechanism, colloquially known as "safeguards" or "guardrails", has become imperative to ensure the ethical use of LLMs within prescribed boundaries. This article prov

Externí odkaz: http://arxiv.org/abs/2406.02622

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání