Zobrazeno 1 - 10
of 136 593
pro vyhledávání: '"Bang IS"'
Autor:
Le, Bang Giang, Ta, Viet Cuong
In this work, we study the problem of finding Pareto optimal policies in multi-agent reinforcement learning problems with cooperative reward structures. We show that any algorithm where each agent only optimizes their reward is subject to suboptimal
Externí odkaz:
http://arxiv.org/abs/2410.19372
Autor:
Wang, Xiaoqiang, Liu, Bang
Large language models (LLMs) and large multimodal models (LMMs) have shown great potential in automating complex tasks like web browsing and gaming. However, their ability to generalize across diverse applications remains limited, hindering broader u
Externí odkaz:
http://arxiv.org/abs/2410.18963
Autor:
You, Bang, Liu, Huaping
Publikováno v:
Neural Networks, 176(2024)
Reinforcement learning has achieved promising results on robotic control tasks but struggles to leverage information effectively from multiple sensory modalities that differ in many characteristics. Recent works construct auxiliary losses based on re
Externí odkaz:
http://arxiv.org/abs/2410.17551
Autor:
XLZD Collaboration, Aalbers, J., Abe, K., Adrover, M., Maouloud, S. Ahmed, Akerib, D. S., Musalhi, A. K. Al, Alder, F., Althueser, L., Amaral, D. W. P., Amarasinghe, C. S., Ames, A., Andrieu, B., Angelides, N., Angelino, E., Antunovic, B., Aprile, E., Araújo, H. M., Armstrong, J. E., Arthurs, M., Babicz, M., Bajpai, D., Baker, A., Balzer, M., Bang, J., Barberio, E., Bargemann, J. W., Barillier, E., Basharina-Freshville, A., Baudis, L., Bauer, D., Bazyk, M., Beattie, K., Beaupere, N., Bell, N. F., Bellagamba, L., Benson, T., Bhatti, A., Biesiadzinski, T. P., Biondi, R., Biondi, Y., Birch, H. J., Bishop, E., Bismark, A., Boehm, C., Boese, K., Bolotnikov, A., Brás, P., Braun, R., Breskin, A., Brew, C. A. J., Brommer, S., Brown, A., Bruni, G., Budnik, R., Burdin, S., Cai, C., Capelli, C., Carini, G., Carmona-Benitez, M. C., Carter, M., Chauvin, A., Chawla, A., Chen, H., Cherwinka, J. J., Chin, Y. T., Chott, N. I., Chavez, A. P. Cimental, Clark, K., Colijn, A. P., Colling, D. J., Conrad, J., Converse, M. V., Coronel, R., Costanzo, D., Cottle, A., Cox, G., Cuenca-García, J. J., Curran, D., Cussans, D., D'Andrea, V., Garcia, L. C. Daniel, Darlington, I., Dave, S., David, A., Davies, G. J., Decowski, M. P., Deisting, A., Delgaudio, J., Dey, S., Di Donato, C., Di Felice, L., Di Gangi, P., Diglio, S., Ding, C., Dobson, J. E. Y., Doerenkamp, M., Drexlin, G., Druszkiewicz, E., Dunbar, C. L., Eitel, K., Elykov, A., Engel, R., Eriksen, S. R., Fayer, S., Fearon, N. M., Ferella, A. D., Ferrari, C., Fieldhouse, N., Fischer, H., Flaecher, H., Flehmke, T., Flierman, M., Fraser, E. D., Fruth, T. M. A., Fujikawa, K., Fulgione, W., Fuselli, C., Gaemers, P., Gaior, R., Gaitskell, R. J., Gallice, N., Galloway, M., Gao, F., Garroum, N., Geffre, A., Genovesi, J., Ghag, C., Ghosh, S., Giacomobono, R., Gibbons, R., Girard, F., Glade-Beucke, R., Glück, F., Gokhale, S., Grandi, L., Green, J., Grigat, J., van der Grinten, M. G. D., Größle, R., Guan, H., Guida, M., Gyorgy, P., Haiston, J. J., Hall, C. R., Hall, T., Hammann, R., Hannen, V., Hansmann-Menzemer, S., Hargittai, N., Hartigan-O'Connor, E., Haselschwardt, S. J., Hernandez, M., Hertel, S. A., Higuera, A., Hils, C., Hiraoka, K., Hoetzsch, L., Hoferichter, M., Homenides, G. J., Hood, N. F., Horn, M., Huang, D. Q., Hughes, S., Hunt, D., Iacovacci, M., Itow, Y., Jacquet, E., Jakob, J., James, R. S., Joerg, F., Jones, S., Kaboth, A. C., Kahlert, F., Kamaha, A. C., Kaminaga, Y., Kara, M., Kavrigin, P., Kazama, S., Keller, M., Kemp-Russell, P., Khaitan, D., Kharbanda, P., Kilminster, B., Kim, J., Kirk, R., Kleifges, M., Klute, M., Kobayashi, M., Kodroff, D., Koke, D., Kopec, A., Korolkova, E. V., Kraus, H., Kravitz, S., Kreczko, L., von Krosigk, B., Kudryavtsev, V. A., Kuger, F., Kurita, N., Landsman, H., Lang, R. F., Lawes, C., Lee, J., Lehnert, B., Leonard, D. S., Lesko, K. T., Levinson, L., Li, A., Li, I., Li, S., Liang, S., Liang, Z., Lin, J., Lin, Y. -T., Lindemann, S., Linden, S., Lindner, M., Lindote, A., Lippincott, W. H., Liu, K., Loizeau, J., Lombardi, F., Lopes, J. A. M., Lopes, M. I., Lorenzon, W., Loutit, M., Lu, C., Lucchetti, G. M., Luce, T., Luitz, S., Ma, Y., Macolino, C., Mahlstedt, J., Maier, B., Majewski, P. A., Manalaysay, A., Mancuso, A., Manenti, L., Mannino, R. L., Marignetti, F., Marley, T., Undagoitia, T. Marrodán, Martens, K., Masbou, J., Masson, E., Mastroianni, S., Maupin, C., McCabe, C., McCarthy, M. E., McKinsey, D. N., McLaughlin, J. B., Melchiorre, A., Menéndez, J., Messina, M., Miller, E. H., Milosovic, B., Milutinovic, S., Miuchi, K., Miyata, R., Mizrachi, E., Molinario, A., Monteiro, C. M. B., Monzani, M. E., Morå, K., Moriyama, S., Morrison, E., Morteau, E., Mosbacher, Y., Mount, B. J., Müller, J., Murdy, M., Murphy, A. St. J., Murra, M., Naylor, A., Nelson, H. N., Neves, F., Newstead, J. L., Nguyen, A., Ni, K., O'Hare, C., Oberlack, U., Obradovic, M., Olcina, I., Oliver-Mallory, K. C., Gann, G. D. Orebi, Orpwood, J., Ostrowskiy, I., Ouahada, S., Oyulmaz, K., Paetsch, B., Palladino, K. J., Palmer, J., Pan, Y., Pandurovic, M., Pannifer, N. J., Paramesvaran, S., Patton, S. J., Pellegrini, Q., Penning, B., Pereira, G., Peres, R., Perry, E., Pershing, T., Piastra, F., Pienaar, J., Piepke, A., Pierre, M., Plante, G., Pollmann, T. R., Principe, L., Qi, J., Qiao, K., Qie, Y., Qin, J., Radeka, S., Radeka, V., Rajado, M., García, D. Ramírez, Ravindran, A., Razeto, A., Reichenbacher, J., Rhyne, C. A., Richards, A., Rischbieter, G. R. C., Riyat, H. S., Rosero, R., Roy, A., Rushton, T., Rynders, D., Saakyan, R., Sanchez, L., Sanchez-Lucas, P., Santone, D., Santos, J. M. F. dos, Sartorelli, G., Sazzad, A. B. M. R., Scaffidi, A., Schnee, R. W., Schreiner, J., Schulte, P., Schulze, H., Eißing, Schumann, M., Schwenck, A., Schwenk, A., Lavina, L. Scotto, Selvi, M., Semeria, F., Shagin, P., Sharma, S., Shaw, S., Shen, W., Sherman, L., Shi, S., Shi, S. Y., Shimada, T., Shutt, T., Silk, J. J., Silva, C., Simgen, H., Sinev, G., Singh, R., Siniscalco, J., Solmaz, M., Solovov, V. N., Song, Z., Sorensen, P., Soria, J., Stanley, O., Steidl, M., Stenhouse, T., Stevens, A., Stifter, K., Sumner, T. J., Takeda, A., Tan, P. -L., Taylor, D. J., Taylor, W. C., Thers, D., Thümmler, T., Tiedt, D. R., Tönnies, F., Tong, Z., Toschi, F., Tovey, D. R., Tranter, J., Trask, M., Trinchero, G., Tripathi, M., Tronstad, D. R., Trotta, R., Tunnell, C. D., Urquijo, P., Usón, A., Utoyama, M., Vaitkus, A. C., Valentino, O., Valerius, K., Vecchi, S., Velan, V., Vetter, S., de Viveiros, L., Volta, G., Vorkapic, D., Wang, A., Wang, J. J., Wang, W., Wang, Y., Waters, D., Weerman, K. M., Weinheimer, C., Weiss, M., Wenz, D., Whitis, T. J., Wild, K., Williams, M., Wilson, M., Wilson, S. T., Wittweg, C., Wolf, J., Wolfs, F. L. H., Woodford, S., Woodward, D., Worcester, M., Wright, C. J., Wu, V. H. S., üstling, S. W, Wurm, M., Xia, Q., Xing, Y., Xu, D., Xu, J., Xu, Y., Xu, Z., Yamashita, M., Yang, L., Ye, J., Yeh, M., Yu, B., Zavattini, G., Zha, W., Zhong, M., Zuber, K.
The XLZD collaboration is developing a two-phase xenon time projection chamber with an active mass of 60 to 80 t capable of probing the remaining WIMP-nucleon interaction parameter space down to the so-called neutrino fog. In this work we show that,
Externí odkaz:
http://arxiv.org/abs/2410.19016
Lithium-ion batteries (LIBs) are utilized as a major energy source in various fields because of their high energy density and long lifespan. During repeated charging and discharging, the degradation of LIBs, which reduces their maximum power output a
Externí odkaz:
http://arxiv.org/abs/2410.16749
Autor:
Chi, Yizhou, Lin, Yizhang, Hong, Sirui, Pan, Duyi, Fei, Yaying, Mei, Guanghao, Liu, Bangbang, Pang, Tianqi, Kwok, Jacky, Zhang, Ceyao, Liu, Bang, Wu, Chenglin
Automated Machine Learning (AutoML) approaches encompass traditional methods that optimize fixed pipelines for model selection and ensembling, as well as newer LLM-based frameworks that autonomously build pipelines. While LLM-based agents have shown
Externí odkaz:
http://arxiv.org/abs/2410.17238
Autor:
XLZD Collaboration, Aalbers, J., Abe, K., Adrover, M., Maouloud, S. Ahmed, Akerib, D. S., Musalhi, A. K. Al, Alder, F., Althueser, L., Amaral, D. W. P., Amarasinghe, C. S., Ames, A., Andrieu, B., Angelides, N., Angelino, E., Antunovic, B., Aprile, E., Araújo, H. M., Armstrong, J. E., Arthurs, M., Babicz, M., Bajpai, D., Baker, A., Balzer, M., Bang, J., Barberio, E., Bargemann, J. W., Barillier, E., Basharina-Freshville, A., Baudis, L., Bauer, D., Bazyk, M., Beattie, K., Beaupere, N., Bell, N. F., Bellagamba, L., Benson, T., Bhatti, A., Biesiadzinski, T. P., Biondi, R., Biondi, Y., Birch, H. J., Bishop, E., Bismark, A., Boehm, C., Boese, K., Bolotnikov, A., Brás, P., Braun, R., Breskin, A., Brew, C. A. J., Brommer, S., Brown, A., Bruni, G., Budnik, R., Burdin, S., Cai, C., Capelli, C., Carini, G., Carmona-Benitez, M. C., Carter, M., Chauvin, A., Chawla, A., Chen, H., Cherwinka, J. J., Chin, Y. T., Chott, N. I., Chavez, A. P. Cimental, Clark, K., Colijn, A. P., Colling, D. J., Conrad, J., Converse, M. V., Coronel, R., Costanzo, D., Cottle, A., Cox, G., Cuenca-García, J. J., Curran, D., Cussans, D., D'Andrea, V., Garcia, L. C. Daniel, Darlington, I., Dave, S., David, A., Davies, G. J., Decowski, M. P., Deisting, A., Delgaudio, J., Dey, S., Di Donato, C., Di Felice, L., Di Gangi, P., Diglio, S., Ding, C., Dobson, J. E. Y., Doerenkamp, M., Drexlin, G., Druszkiewicz, E., Dunbar, C. L., Eitel, K., Elykov, A., Engel, R., Eriksen, S. R., Fayer, S., Fearon, N. M., Ferella, A. D., Ferrari, C., Fieldhouse, N., Fischer, H., Flaecher, H., Flehmke, T., Flierman, M., Fraser, E. D., Fruth, T. M. A., Fujikawa, K., Fulgione, W., Fuselli, C., Gaemers, P., Gaior, R., Gaitskell, R. J., Gallice, N., Galloway, M., Gao, F., Garroum, N., Geffre, A., Genovesi, J., Ghag, C., Ghosh, S., Giacomobono, R., Gibbons, R., Girard, F., Glade-Beucke, R., Glück, F., Gokhale, S., Grandi, L., Green, J., Grigat, J., van der Grinten, M. G. D., Größle, R., Guan, H., Guida, M., Gyorgy, P., Haiston, J. J., Hall, C. R., Hall, T., Hammann, R., Hannen, V., Hansmann-Menzemer, S., Hargittai, N., Hartigan-O'Connor, E., Haselschwardt, S. J., Hernandez, M., Hertel, S. A., Higuera, A., Hils, C., Hiraoka, K., Hoetzsch, L., Hoferichter, M., Homenides, G. J., Hood, N. F., Horn, M., Huang, D. Q., Hughes, S., Hunt, D., Iacovacci, M., Itow, Y., Jacquet, E., Jakob, J., James, R. S., Joerg, F., Jones, S., Kaboth, A. C., Kahlert, F., Kamaha, A. C., Kaminaga, Y., Kara, M., Kavrigin, P., Kazama, S., Keller, M., Kemp-Russell, P., Khaitan, D., Kharbanda, P., Kilminster, B., Kim, J., Kirk, R., Kleifges, M., Klute, M., Kobayashi, M., Kodroff, D., Koke, D., Kopec, A., Korolkova, E. V., Kraus, H., Kravitz, S., Kreczko, L., von Krosigk, B., Kudryavtsev, V. A., Kuger, F., Kurita, N., Landsman, H., Lang, R. F., Lawes, C., Lee, J., Lehnert, B., Leonard, D. S., Lesko, K. T., Levinson, L., Li, A., Li, I., Li, S., Liang, S., Liang, Z., Lin, J., Lin, Y. -T., Lindemann, S., Linden, S., Lindner, M., Lindote, A., Lippincott, W. H., Liu, K., Loizeau, J., Lombardi, F., Lopes, J. A. M., Lopes, M. I., Lorenzon, W., Loutit, M., Lu, C., Lucchetti, G. M., Luce, T., Luitz, S., Ma, Y., Macolino, C., Mahlstedt, J., Maier, B., Majewski, P. A., Manalaysay, A., Mancuso, A., Manenti, L., Mannino, R. L., Marignetti, F., Marley, T., Undagoitia, T. Marrodán, Martens, K., Masbou, J., Masson, E., Mastroianni, S., Maupin, C., McCabe, C., McCarthy, M. E., McKinsey, D. N., McLaughlin, J. B., Melchiorre, A., Menéndez, J., Messina, M., Miller, E. H., Milosovic, B., Milutinovic, S., Miuchi, K., Miyata, R., Mizrachi, E., Molinario, A., Monteiro, C. M. B., Monzani, M. E., Morå, K., Moriyama, S., Morrison, E., Morteau, E., Mosbacher, Y., Mount, B. J., Müller, J., Murdy, M., Murphy, A. St. J., Murra, M., Naylor, A., Nelson, H. N., Neves, F., Newstead, J. L., Nguyen, A., Ni, K., O'Hare, C., Oberlack, U., Obradovic, M., Olcina, I., Oliver-Mallory, K. C., Gann, G. D. Orebi, Orpwood, J., Ostrowskiy, I., Ouahada, S., Oyulmaz, K., Paetsch, B., Palladino, K. J., Palmer, J., Pan, Y., Pandurovic, M., Pannifer, N. J., Paramesvaran, S., Patton, S. J., Pellegrini, Q., Penning, B., Pereira, G., Peres, R., Perry, E., Pershing, T., Piastra, F., Pienaar, J., Piepke, A., Pierre, M., Plante, G., Pollmann, T. R., Principe, L., Qi, J., Qiao, K., Qie, Y., Qin, J., Radeka, S., Radeka, V., Rajado, M., García, D. Ramírez, Ravindran, A., Razeto, A., Reichenbacher, J., Rhyne, C. A., Richards, A., Rischbieter, G. R. C., Riyat, H. S., Rosero, R., Roy, A., Rushton, T., Rynders, D., Saakyan, R., Sanchez, L., Sanchez-Lucas, P., Santone, D., Santos, J. M. F. dos, Sartorelli, G., Sazzad, A. B. M. R., Scaffidi, A., Schnee, R. W., Schreiner, J., Schulte, P., Schulze, H., Eißing, Schumann, M., Schwenck, A., Schwenk, A., Lavina, L. Scotto, Selvi, M., Semeria, F., Shagin, P., Sharma, S., Shaw, S., Shen, W., Sherman, L., Shi, S., Shi, S. Y., Shimada, T., Shutt, T., Silk, J. J., Silva, C., Simgen, H., Sinev, G., Singh, R., Siniscalco, J., Solmaz, M., Solovov, V. N., Song, Z., Sorensen, P., Soria, J., Stanley, O., Steidl, M., Stenhouse, T., Stevens, A., Stifter, K., Sumner, T. J., Takeda, A., Tan, P. -L., Taylor, D. J., Taylor, W. C., Thers, D., Thümmler, T., Tiedt, D. R., Tönnies, F., Tong, Z., Toschi, F., Tovey, D. R., Tranter, J., Trask, M., Trinchero, G., Tripathi, M., Tronstad, D. R., Trotta, R., Tunnell, C. D., Urquijo, P., Usón, A., Utoyama, M., Vaitkus, A. C., Valentino, O., Valerius, K., Vecchi, S., Velan, V., Vetter, S., de Viveiros, L., Volta, G., Vorkapic, D., Wang, A., Wang, J. J., Wang, W., Wang, Y., Waters, D., Weerman, K. M., Weinheimer, C., Weiss, M., Wenz, D., Whitis, T. J., Wild, K., Williams, M., Wilson, M., Wilson, S. T., Wittweg, C., Wolf, J., Wolfs, F. L. H., Woodford, S., Woodward, D., Worcester, M., Wright, C. J., Wu, V. H. S., üstling, S. W, Wurm, M., Xia, Q., Xing, Y., Xu, D., Xu, J., Xu, Y., Xu, Z., Yamashita, M., Yang, L., Ye, J., Yeh, M., Yu, B., Zavattini, G., Zha, W., Zhong, M., Zuber, K.
This report describes the experimental strategy and technologies for a next-generation xenon observatory sensitive to dark matter and neutrino physics. The detector will have an active liquid xenon target mass of 60-80 tonnes and is proposed by the X
Externí odkaz:
http://arxiv.org/abs/2410.17137
Autor:
Aalbers, J., Akerib, D. S., Musalhi, A. K. Al, Alder, F., Amarasinghe, C. S., Ames, A., Anderson, T. J., Angelides, N., Araújo, H. M., Armstrong, J. E., Arthurs, M., Baker, A., Balashov, S., Bang, J., Bargemann, J. W., Barillier, E. E., Bauer, D., Beattie, K., Benson, T., Bhatti, A., Biekert, A., Biesiadzinski, T. P., Birch, H. J., Bishop, E., Blockinger, G. M., Boxer, B., Brew, C. A. J., Brás, P., Burdin, S., Buuck, M., Carmona-Benitez, M. C., Carter, M., Chawla, A., Chen, H., Cherwinka, J. J., Chin, Y. T., Chott, N. I., Converse, M. V., Coronel, R., Cottle, A., Cox, G., Curran, D., Dahl, C. E., Darlington, I., Dave, S., David, A., Delgaudio, J., Dey, S., de Viveiros, L., Di Felice, L., Ding, C., Dobson, J. E. Y., Druszkiewicz, E., Dubey, S., Eriksen, S. R., Fan, A., Fayer, S., Fearon, N. M., Fieldhouse, N., Fiorucci, S., Flaecher, H., Fraser, E. D., Fruth, T. M. A., Gaitskell, R. J., Geffre, A., Genovesi, J., Ghag, C., Ghosh, A., Gibbons, R., Gokhale, S., Green, J., van der Grinten, M. G. D., Haiston, J. J., Hall, C. R., Hall, T. J., Han, S., Hartigan-O'Connor, E., Haselschwardt, S. J., Hernandez, M. A., Hertel, S. A., Heuermann, G., Homenides, G. J., Horn, M., Huang, D. Q., Hunt, D., Jacquet, E., James, R. S., Johnson, J., Kaboth, A. C., Kamaha, A. C., K., Meghna K., Khaitan, D., Khazov, A., Khurana, I., Kim, J., Kim, Y. D., Kingston, J., Kirk, R., Kodroff, D., Korley, L., Korolkova, E. V., Kraus, H., Kravitz, S., Kreczko, L., Kudryavtsev, V. A., Lawes, C., Leonard, D. S., Lesko, K. T., Levy, C., Lin, J., Lindote, A., Lippincott, W. H., Lopes, M. I., Lorenzon, W., Lu, C., Luitz, S., Majewski, P. A., Manalaysay, A., Mannino, R. L., Maupin, C., McCarthy, M. E., McDowell, G., McKinsey, D. N., McLaughlin, J., McLaughlin, J. B., McMonigle, R., Mizrachi, E., Monte, A., Monzani, M. E., Mendoza, J. D. Morales, Morrison, E., Mount, B. J., Murdy, M., Murphy, A. St. J., Naylor, A., Nelson, H. N., Neves, F., Nguyen, A., O'Brien, C. L., Olcina, I., Oliver-Mallory, K. C., Orpwood, J., Oyulmaz, K. Y, Palladino, K. J., Palmer, J., Pannifer, N. J., Parveen, N., Patton, S. J., Penning, B., Pereira, G., Perry, E., Pershing, T., Piepke, A., Qie, Y., Reichenbacher, J., Rhyne, C. A., Richards, A., Riffard, Q., Rischbieter, G. R. C., Ritchey, E., Riyat, H. S., Rosero, R., Rushton, T., Rynders, D., Santone, D., Sazzad, A. B. M. R., Schnee, R. W., Sehr, G., Shafer, B., Shaw, S., Shutt, T., Silk, J. J., Silva, C., Sinev, G., Siniscalco, J., Smith, R., Solovov, V. N., Sorensen, P., Soria, J., Stancu, I., Stevens, A., Stifter, K., Suerfu, B., Sumner, T. J., Szydagis, M., Tiedt, D. R., Timalsina, M., Tong, Z., Tovey, D. R., Tranter, J., Trask, M., Tripathi, M., Usón, A., Vacheret, A., Vaitkus, A. C., Valentino, O., Velan, V., Wang, A., Wang, J. J., Wang, Y., Watson, J. R., Weeldreyer, L., Whitis, T. J., Wild, K., Williams, M., Wisniewski, W. J., Wolf, L., Wolfs, F. L. H., Woodford, S., Woodward, D., Wright, C. J., Xia, Q., Xu, J., Xu, Y., Yeh, M., Yeum, D., Zha, W., Zweig, E. A.
We report results of a search for nuclear recoils induced by weakly interacting massive particle (WIMP) dark matter using the LUX-ZEPLIN (LZ) two-phase xenon time projection chamber. This analysis uses a total exposure of $4.2\pm0.1$ tonne-years from
Externí odkaz:
http://arxiv.org/abs/2410.17036
Autor:
Zhang, Jiayi, Xiang, Jinyu, Yu, Zhaoyang, Teng, Fengwei, Chen, Xionghui, Chen, Jiaqi, Zhuge, Mingchen, Cheng, Xin, Hong, Sirui, Wang, Jinlin, Zheng, Bingnan, Liu, Bang, Luo, Yuyu, Wu, Chenglin
Large language models (LLMs) have demonstrated remarkable potential in solving complex tasks across diverse domains, typically by employing agentic workflows that follow detailed instructions and operational sequences. However, constructing these wor
Externí odkaz:
http://arxiv.org/abs/2410.10762
Autor:
Xu, Yuancheng, Sehwag, Udari Madhushani, Koppel, Alec, Zhu, Sicheng, An, Bang, Huang, Furong, Ganesh, Sumitra
Large Language Models (LLMs) exhibit impressive capabilities but require careful alignment with human preferences. Traditional training-time methods finetune LLMs using human preference datasets but incur significant training costs and require repeat
Externí odkaz:
http://arxiv.org/abs/2410.08193