Zobrazeno 1 - 10
of 55
pro vyhledávání: '"Massaroli, Stefano"'
Autor:
Parnichkun, Rom N., Massaroli, Stefano, Moro, Alessandro, Smith, Jimmy T. H., Hasani, Ramin, Lechner, Mathias, An, Qi, Ré, Christopher, Asama, Hajime, Ermon, Stefano, Suzuki, Taiji, Yamashita, Atsushi, Poli, Michael
We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms
Externí odkaz:
http://arxiv.org/abs/2405.06147
Autor:
Poli, Michael, Thomas, Armin W, Nguyen, Eric, Ponnusamy, Pragaash, Deiseroth, Björn, Kersting, Kristian, Suzuki, Taiji, Hie, Brian, Ermon, Stefano, Ré, Christopher, Zhang, Ce, Massaroli, Stefano
The development of deep learning architectures is a resource-demanding process, due to a vast design space, long prototyping times, and high compute costs associated with at-scale model training and evaluation. We set out to simplify this process by
Externí odkaz:
http://arxiv.org/abs/2403.17844
Autor:
Massaroli, Stefano, Poli, Michael, Fu, Daniel Y., Kumbong, Hermann, Parnichkun, Rom N., Timalsina, Aman, Romero, David W., McIntyre, Quinn, Chen, Beidi, Rudra, Atri, Zhang, Ce, Re, Christopher, Ermon, Stefano, Bengio, Yoshua
Recent advances in attention-free sequence models rely on convolutions as alternatives to the attention operator at the core of Transformers. In particular, long convolution sequence models have achieved state-of-the-art performance in many domains,
Externí odkaz:
http://arxiv.org/abs/2310.18780
While complex simulations of physical systems have been widely used in engineering and scientific computing, lowering their often prohibitive computational requirements has only recently been tackled by deep learning approaches. In this paper, we pre
Externí odkaz:
http://arxiv.org/abs/2310.16397
Autor:
Nguyen, Eric, Poli, Michael, Faizi, Marjan, Thomas, Armin, Birch-Sykes, Callum, Wornow, Michael, Patel, Aman, Rabideau, Clayton, Massaroli, Stefano, Bengio, Yoshua, Ermon, Stefano, Baccus, Stephen A., Ré, Chris
Genomic (DNA) sequences encode an enormous amount of information for gene regulation and protein synthesis. Similar to natural language models, researchers have proposed foundation models in genomics to learn generalizable features from unlabeled gen
Externí odkaz:
http://arxiv.org/abs/2306.15794
Autor:
Boussif, Oussama, Boukachab, Ghait, Assouline, Dan, Massaroli, Stefano, Yuan, Tianle, Benabbou, Loubna, Bengio, Yoshua
Solar power harbors immense potential in mitigating climate change by substantially reducing CO$_{2}$ emissions. Nonetheless, the inherent variability of solar irradiance poses a significant challenge for seamlessly integrating solar power into the e
Externí odkaz:
http://arxiv.org/abs/2306.01112
We present a methodology for formulating simplifying abstractions in machine learning systems by identifying and harnessing the utility structure of decisions. Machine learning tasks commonly involve high-dimensional output spaces (e.g., predictions
Externí odkaz:
http://arxiv.org/abs/2303.17062
Autor:
Poli, Michael, Massaroli, Stefano, Nguyen, Eric, Fu, Daniel Y., Dao, Tri, Baccus, Stephen, Bengio, Yoshua, Ermon, Stefano, Ré, Christopher
Recent advances in deep learning have relied heavily on the use of large Transformers due to their ability to learn at scale. However, the core building block of Transformers, the attention operator, exhibits quadratic cost in sequence length, limiti
Externí odkaz:
http://arxiv.org/abs/2302.10866
Methods based on ordinary differential equations (ODEs) are widely used to build generative models of time-series. In addition to high computational overhead due to explicitly computing hidden states recurrence, existing ODE-based models fall short i
Externí odkaz:
http://arxiv.org/abs/2212.12749
Autor:
Poli, Michael, Massaroli, Stefano, Berto, Federico, Park, Jinykoo, Dao, Tri, Ré, Christopher, Ermon, Stefano
Spectral analysis provides one of the most effective paradigms for information-preserving dimensionality reduction, as simple descriptions of naturally occurring signals are often obtained via few terms of periodic basis functions. In this work, we s
Externí odkaz:
http://arxiv.org/abs/2211.14453