Výsledky vyhledávání - "Akiba, Takuya"

Report

Agent Skill Acquisition for Large Language Models via CycleQD

Autor: Kuroki, So, Nakamura, Taishi, Akiba, Takuya, Tang, Yujin

Training large language models to acquire specific skills remains a challenging endeavor. Conventional training approaches often struggle with data distribution imbalances and inadequacies in objective functions that do not align well with task-speci

Externí odkaz: http://arxiv.org/abs/2410.14735

Zobrazit plný text záznamu

Report

Evolutionary Optimization of Model Merging Recipes

Autor: Akiba, Takuya, Shing, Makoto, Tang, Yujin, Sun, Qi, Ha, David

We present a novel application of evolutionary algorithms to automate the creation of powerful foundation models. While model merging has emerged as a promising approach for LLM development due to its cost-effectiveness, it currently relies on human

Externí odkaz: http://arxiv.org/abs/2403.13187

Zobrazit plný text záznamu

Report

Team PFDet's Methods for Open Images Challenge 2019

Autor: Niitani, Yusuke, Ogawa, Toru, Suzuki, Shuji, Akiba, Takuya, Kerola, Tommi, Ozaki, Kohei, Sano, Shotaro

We present the instance segmentation and the object detection method used by team PFDet for Open Images Challenge 2019. We tackle a massive dataset size, huge class imbalance and federated annotations. Using this method, the team PFDet achieved 3rd a

Externí odkaz: http://arxiv.org/abs/1910.11534

Zobrazit plný text záznamu

Report

Chainer: A Deep Learning Framework for Accelerating the Research Cycle

Autor: Tokui, Seiya, Okuta, Ryosuke, Akiba, Takuya, Niitani, Yusuke, Ogawa, Toru, Saito, Shunta, Suzuki, Shuji, Uenishi, Kota, Vogel, Brian, Vincent, Hiroyuki Yamazaki

Software frameworks for neural networks play a key role in the development and application of deep learning methods. In this paper, we introduce the Chainer framework, which intends to provide a flexible, intuitive, and high performance means of impl

Externí odkaz: http://arxiv.org/abs/1908.00213

Zobrazit plný text záznamu

Report

Optuna: A Next-generation Hyperparameter Optimization Framework

Autor: Akiba, Takuya, Sano, Shotaro, Yanase, Toshihiko, Ohta, Takeru, Koyama, Masanori

The purpose of this study is to introduce new design-criteria for next-generation hyperparameter optimization software. The criteria we propose include (1) define-by-run API that allows users to construct the parameter search space dynamically, (2) e

Externí odkaz: http://arxiv.org/abs/1907.10902

Zobrazit plný text záznamu

Report

A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation

Autor: Kusumoto, Mitsuru, Inoue, Takuya, Watanabe, Gentaro, Akiba, Takuya, Koyama, Masanori

Recomputation algorithms collectively refer to a family of methods that aims to reduce the memory consumption of the backpropagation by selectively discarding the intermediate results of the forward propagation and recomputing the discarded results a

Externí odkaz: http://arxiv.org/abs/1905.11722

Zobrazit plný text záznamu

Report

Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects

Autor: Niitani, Yusuke, Akiba, Takuya, Kerola, Tommi, Ogawa, Toru, Sano, Shotaro, Suzuki, Shuji

Efficient and reliable methods for training of object detectors are in higher demand than ever, and more and more data relevant to the field is becoming available. However, large datasets like Open Images Dataset v4 (OID) are sparsely annotated, and

Externí odkaz: http://arxiv.org/abs/1811.10862

Zobrazit plný text záznamu

Report

PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track

Autor: Akiba, Takuya, Kerola, Tommi, Niitani, Yusuke, Ogawa, Toru, Sano, Shotaro, Suzuki, Shuji

We present a large-scale object detection system by team PFDet. Our system enables training with huge datasets using 512 GPUs, handles sparsely verified classes, and massive class imbalance. Using our method, we achieved 2nd place in the Google AI Op

Externí odkaz: http://arxiv.org/abs/1809.00778

Zobrazit plný text záznamu

Report

Adversarial Attacks and Defences Competition

To accelerate research on adversarial examples and robustness of machine learning classifiers, Google Brain organized a NIPS 2017 competition that encouraged researchers to develop new methods to generate adversarial examples as well as to develop ne

Externí odkaz: http://arxiv.org/abs/1804.00097

Zobrazit plný text záznamu

Report

Variance-based Gradient Compression for Efficient Distributed Deep Learning

Autor: Tsuzuku, Yusuke, Imachi, Hiroto, Akiba, Takuya

Due to the substantial computational cost, training state-of-the-art deep neural networks for large-scale datasets often requires distributed training using multiple computation workers. However, by nature, workers need to frequently communicate grad

Externí odkaz: http://arxiv.org/abs/1802.06058

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání