Výsledky vyhledávání

Report

Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial Odometry

Autor: Kurt, Yunus Bilge, Akman, Ahmet, Alatan, A. Aydın

In recent years, transformer-based architectures become the de facto standard for sequence modeling in deep learning frameworks. Inspired by the successful examples, we propose a causal visual-inertial fusion transformer (VIFT) for pose estimation in

Externí odkaz: http://arxiv.org/abs/2409.08769

Zobrazit plný text záznamu

Report

IG-SLAM: Instant Gaussian SLAM

Autor: Sarikamis, F. Aykut, Alatan, A. Aydin

3D Gaussian Splatting has recently shown promising results as an alternative scene representation in SLAM systems to neural implicit representations. However, current methods either lack dense depth maps to supervise the mapping process or detailed t

Externí odkaz: http://arxiv.org/abs/2408.01126

Zobrazit plný text záznamu

Report

XoFTR: Cross-modal Feature Matching Transformer

Autor: Tuzcuoğlu, Önder, Köksal, Aybora, Sofu, Buğra, Kalkan, Sinan, Alatan, A. Aydın

We introduce, XoFTR, a cross-modal cross-view method for local feature matching between thermal infrared (TIR) and visible images. Unlike visible images, TIR images are less susceptible to adverse lighting and weather conditions but present difficult

Externí odkaz: http://arxiv.org/abs/2404.09692

Zobrazit plný text záznamu

Akademický článek

A geospatial dataset of lichen key attributes in the Earth’s three poles

Autor: Zhula Alatan, Wenjin Wu, Xinwu Li, Liqing Zhao, Huadong Guo, Jinfeng Li, Chengzhi Hao

Publikováno v: Scientific Data, Vol 11, Iss 1, Pp 1-11 (2024)

Abstract In the Antarctic, Arctic, and Tibetan Plateau—recognized as the Earth’s three poles characterized by extremely harsh environments—lichens prevail in the ecosystem and play crucial roles as pioneer species. Despite their importance, stu

Externí odkaz: https://doaj.org/article/17a22d247166476eb33a36d09ec4e648

Zobrazit plný text záznamu

Report

Knowledge Distillation Layer that Lets the Student Decide

Autor: Gorgun, Ada, Gurbuz, Yeti Z., Alatan, A. Aydin

Typical technique in knowledge distillation (KD) is regularizing the learning of a limited capacity model (student) by pushing its responses to match a powerful model's (teacher). Albeit useful especially in the penultimate layer and beyond, its acti

Externí odkaz: http://arxiv.org/abs/2309.02843

Zobrazit plný text záznamu

Report

Generalized Sum Pooling for Metric Learning

Autor: Gurbuz, Yeti Z., Sener, Ozan, Alatan, A. Aydın

A common architectural choice for deep metric learning is a convolutional neural network followed by global average pooling (GAP). Albeit simple, GAP is a highly effective way to aggregate information. One possible explanation for the effectiveness o

Externí odkaz: http://arxiv.org/abs/2308.09228

Zobrazit plný text záznamu

Report

Generalizable Embeddings with Cross-batch Metric Learning

Autor: Gurbuz, Yeti Z., Alatan, A. Aydin

Global average pooling (GAP) is a popular component in deep metric learning (DML) for aggregating features. Its effectiveness is often attributed to treating each feature vector as a distinct semantic entity and GAP as a combination of them. Albeit s

Externí odkaz: http://arxiv.org/abs/2307.07620

Zobrazit plný text záznamu

Report

MAEVI: Motion Aware Event-Based Video Frame Interpolation

Autor: Akman, Ahmet, Kılıç, Onur Selim, Alatan, A. Aydın

Utilization of event-based cameras is expected to improve the visual quality of video frame interpolation solutions. We introduce a learning-based method to exploit moving region boundaries in a video sequence to increase the overall interpolation qu

Externí odkaz: http://arxiv.org/abs/2303.02025

Zobrazit plný text záznamu

Report

Feature Embedding by Template Matching as a ResNet Block

Autor: Gorgun, Ada, Gurbuz, Yeti Z., Alatan, A. Aydin

Convolution blocks serve as local feature extractors and are the key to success of the neural networks. To make local semantic feature embedding rather explicit, we reformulate convolution blocks as feature selection according to the best matching ke

Externí odkaz: http://arxiv.org/abs/2210.00992

Zobrazit plný text záznamu

Report

E-VFIA : Event-Based Video Frame Interpolation with Attention

Autor: Kılıç, Onur Selim, Akman, Ahmet, Alatan, A. Aydın

Video frame interpolation (VFI) is a fundamental vision task that aims to synthesize several frames between two consecutive original video images. Most algorithms aim to accomplish VFI by using only keyframes, which is an ill-posed problem since the

Externí odkaz: http://arxiv.org/abs/2209.09359

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání