MLinfo | 機械学習・AI論文まとめ

Perturbative Contrastive Physical Learning

Responses to perturbations are key to understanding physical systems. The ability to contrast such responses b

用途: 分類
難易度: Hard
コスト: Low

Hybrid Robustness Verification for Spatio-Temporal Neural Networks

With AI increasingly deployed in safety-critical systems, providing formal robustness guarantees for the under

深層学習Transformer分類動画3D

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知生成AI拡散モデル分類生成教師あり

Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles

生成モデルDiffusionモデルの強度推論を評価するフレームワークを提案します。Diffusionモデルの表現能力と生成能力を評価するために、特徴量を不変成分と余分な成分に分割し、不変性汚染という概念を導入します。

用途: 強度推論
難易度: Hard
コスト: High

What the Eyes See, the LLMs Miss: Exploiting Human Perception for Adversarial Text Attacks

大規模言語モデル（LLM）を運用するコンテンツモデレーションシステムは、有害なオンラインコンテンツを防止するために重要な役割を果たします。しかし、これらのシステムの主な目標は単にトークナイズされたテキストを操作することに

自然言語処理大規模言語モデル分類検出画像

用途: 文書の分類
難易度: Hard
コスト: High

Transition-Based Digital Twin Modelling for Alzheimer's Disease under Sparse Longitudinal Data

Alzheimer's disease (AD) progression is highly heterogeneous and is typically observed through sparse and irre

説明可能深層学習軽量化・量子化分類生成予測

用途: 分類
難易度: Hard
コスト: High

Muon Learns More Robust and Transferable Features than Adam

Muon has recently emerged as a state-of-the-art optimizer for pretraining Large Language Models (LLMs) and vis

深層学習Transformer分類画像テキスト

用途: 分類
難易度: Hard
コスト: High

説明可能深層学習Transformer分類教師あり自己教師

Integrating gene regulatory priors into Transformer attention with scTransformer for interpretable scRNA-seq analysis

scRNA-seq データの解釈を向上させる Transformer を提案。モデルにゲノム規制的情報を組み込むことで、遺伝子発現の解釈と予測の精度が向上することを示した。

用途: scRNA-seq データの解釈を向上させる Transformer
難易度: Hard
コスト: High

説明可能数学・理論解釈可能性 (XAI)分類検出画像

SAILS: Surrogate-based Analysis of Interactions via Local Effect Smooths

この研究では、Surrogate-based Analysis of Interactions via Local Effect Smooths (SAILS) と呼ばれる構造間の相互作用を検測し、機能的な相互作用を推定

用途: 構造間の機能的な相互作用の検出
難易度: Hard
コスト: Low

品質予測/異常検知深層学習Transformer分類予測

A Universal Dense Football Event Representation Based on TabTransformer

Football event data constitute a rich spatiotemporal source for quantitative analysis of player actions in tea

用途: フットボールイベ
難易度: Hard
コスト: High

説明可能品質予測/異常検知深層学習Transformer分類セグメンテーションテキスト

Intention Driven Identification of In-Possession Match Phases in Association Football through Temporal Graph Learning

Understanding tactical organisation of association football, hereafter referred to as football, requires ident

用途: 分類
難易度: Hard
コスト: Low

品質予測/異常検知深層学習グラフニューラルネット分類検出異常検知

Beyond Convolution: Advancing Hypergraph Neural Networks with Hypergraph U-Nets

Convolutions have successfully transitioned from image processing to the complex realm of non-Euclidean higher

用途: 分類
難易度: Hard
コスト: Low

Beyond Neural Collapse: Task-Intrinsic Geometry Governs Neural Representations in Modular Arithmetic

モジュラー演算を使用することで、メモリを最適に利用することができるようになり、パフォーマンスの向上につながります。

深層学習正規化・最適化手法分類

用途: メモリの最適化に関するモジュラー演算の機械学習
難易度: Hard
コスト: High

A systematic investigation of molecular encoding methods for drug property predictions across neural network and Transformer encoder-based model

分子設計のための機械学習モデルを作成することで、効率的な合成が可能になり、薬剤開発などの分野で大きく貢献することが可能です。

説明可能深層学習Transformer分類

用途: 分子設計のための機械学習モデル
難易度: Hard
コスト: Low

説明可能品質予測/異常検知深層学習Transformer分類

Backward Coherence and Hidden-State Stability in Recurrent Neural Networks: A Quasi-Reverse-Martingale Theory

リカURRENTニューラルネットワークの隠れ状態の安定性を推定することで、ネットワークの推論を進めることができるようになります。

用途: リカURRENTニューラルネットワークの隠れ状態の安定性の推定
難易度: Hard
コスト: Low

Oversight Has a Capacity: Calibrating Agent Guards to a Subjective, Fatiguing Human

As LLM agents begin to take real, irreversible actions (shell commands, file edits, deploys), the standard saf

自然言語処理大規模言語モデル分類

用途: エージェントの
難易度: Hard
コスト: High

arxivGitHubあり2026-06-08

Few-shot Class-variable Incremental Audio Classification via Prototype Adaptation and Pseudo Class-variable Training

In the task of few-shot class-incremental audio classification, the number of classes is assumed to always inc

少数データ向き自然言語処理RAG分類音声

用途: 分類
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer分類検出テキスト

ATN3D: Density-Aware LiDAR-Radar Early 3D Object Detection Under Extreme Sparsity

自動運転車やインテリジェント輸送システムなどの自動化された車両の感知には3次元オブジェクト検出が必要です。道路での長距離検出は困難ですが、道路ではこの「長距離」に対する感知と決定の時間は約1-2秒です。2つの主な課題が現

用途: 車のデッキの長距離認識に対する3次元オブジェクト検出
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer分類埋め込み自己教師

Next-Token Prediction Learns Generalisable Representations of Sleep Physiology

基礎モデルは、多モーダル生理信号を人間の健康に縮小された表現に圧縮することで、睡眠医学、心臓学、神経学など、広い応用域への道を開いています。既存のモデルは、一般的にはマスクした再構築または対比的目的で訓練されています。

用途: ngủの生理学的特性の学習
難易度: Hard
コスト: High

SecureClaw: Clawing Back Control of LLM Agents

Tool-using large language model (LLM) agents face two distinct security failures: unauthorized external action

自然言語処理大規模言語モデル分類テキスト

用途: 分類
難易度: Hard
コスト: High

MI向き品質予測/異常検知コンピュータビジョンマルチモーダル分類検出画像

Context-Aware Deep Learning for Defect Classification in Atomic-Resolution STEM

マテリアルの非破壊検査を目的としたContext-Aware Deep Learningが提案され、エアロックの欠陥を検出する。

用途: マテリアルの非破壊検査
難易度: Hard
コスト: High

Real-time body pose non-verbal communication with a consistency-based reliability measure

Body movement communicates intent at distances and in conditions where neither the face, nor speech can be cap

機械学習教師なし学習分類予測テキスト

用途: 分類
難易度: Hard
コスト: Low

品質予測/異常検知深層学習Transformer分類画像

Beyond Humans: Multispecies Animal Face Recognition Using Transfer Learning

異なる種類の動物を取り巻く面からの画像を使用して、動物の特定を行う方法を提案している。

用途: 獲得失われたペットや保護の対象になっている種類の個体の認識
難易度: Hard
コスト: Low

少数データ向き深層学習Transformer分類検出

Proposal Refinement for Few-Shot Object Detection

少ない例の問題のオブジェクト認識においては、オブジェクト認識の提案の精度を向上させることができる。

用途: オブジェクト認識における少ない例の問題に対する提案
難易度: Hard
コスト: High

End-to-End Training for Discrete Token LLM based TTS System

エンドツーエンドトレーニングによるTTSシステムを提案し、エンドツーエンドトレーニングの利点を確認している。

自然言語処理大規模言語モデル分類生成テキスト

用途: エンドツーエンドトレーニングによるTTSシステムの提案
難易度: Hard
コスト: High

Reliable to Expressive: A Curriculum for Rubric-Following Safety Judges

Safety judges are increasingly deployed to evaluate model outputs against evolving criteria, yet recent meta-e

自然言語処理大規模言語モデル分類

用途: 分類
難易度: Hard
コスト: High

Vision Language Model Helps Private Information De-Identification in Vision Data

ビジュアル言語モデル（VLM）は、プライバシー保護において有効性の高い能力をもつ。しかし、視覚データを扱う際のプライバシーリスクについては、それまでほとんど注目されていなかった。VLMを使用して、プライバシー保護を確保す

コンピュータビジョン物体検出分類検出画像

用途: ビジョン言語モデルを使用したビジュアルデータのプライバシー保護
難易度: Hard
コスト: High

arxivGitHubあり2026-06-08

An Enhanced Geometric-Spectral Feature Learning Framework for Airborne Multispectral Point Cloud Classification

空中マルチスペクトル点群（MPC）では、三次元空間とスペクトルの情報を組み合わせたデータが取得できるが、点群データの分類は難しい課題であったため、新しい学習フレームワークを提案。

深層学習Transformer分類3D

用途: 空中多スペクトル点群の分類
難易度: Hard
コスト: High

REFLECT: Intervention-Supported Error Attribution for Silent Failures in LLM Agent Traces

Large language model (LLM) agents now solve complex tasks through long plan-and-execution traces, yet the abil

自然言語処理大規模言語モデル分類検出テキスト

用途: 分類
難易度: Hard
コスト: High

少数データ向き表形式向き自然言語処理大規模言語モデル分類生成回帰

LATTEArena: An Evaluation Framework for LLM-powered Tabular Feature Engineering (Extended Version)

LLMがTABULARデータ分析で機能を自動化できるようにした。しかし、標準化されたプラットフォームの欠如は、比較やコスト的評価を行うのを難しくしている。複雑なメソッドの設計により、各コンポーネントの具体的な貢献をはっき

用途: TABULARデータ分析のLLMパラダイムの比較評価
難易度: Hard
コスト: High

A multi-agent system for spine MRI report generation from multi-sequence imaging

Spinal pathology is a leading cause of pain and disability worldwide. Spine MRI is central to clinical evaluat

説明可能自然言語処理埋め込み・検索分類検出生成

用途: 分類
難易度: Hard
コスト: High

MI向き品質予測/異常検知自然言語処理ファインチューニング分類生成テキスト

Quality-Diversity Search in Sound Generation: Investigating Innovation Engines for Audio Exploration

この研究では、音楽生成における多様性を促進するためのオープンソース・フレームワークを開発します。このフレームワークは、音楽生成における多様性の促進を支援するために、進化的プロセスと多様性促進アルゴリズムを組み合わせたもの

用途: 音楽生成における多様性の促進
難易度: Hard
コスト: Low

DECSELFMASK: Leveraging Unlabeled Text via Self-Relevance-Guided Masking for Decoder-Only Classification

予備情報が少ない場合や医療分野などの特定の分野の場合、分類タスクは難しいようになるが、この研究では、モデルが未分類データを操作して、分類モデルの性能を向上させる方法である、DecSelfMaskを提案した。

自然言語処理RAG分類生成テキスト

用途: 分類タスクの性能向上
難易度: Hard
コスト: High

品質予測/異常検知自然言語処理大規模言語モデル分類セグメンテーションテキスト

arxivGitHubあり2026-06-08

MUDIDI: A Two-Stage Framework for Multilingual Dictionary Digitization with Language Models

この研究では、低リソース言語や絶滅言語の辞書のデジタル化が重要であるが、マルチモーダル辞書をデジタル化する方法は今まで難しかったが、この研究では、最近のビジョン言語モデルを用いて辞書のデジタル化が容易になり、辞書内の文字

用途: ムルティリンガル辞書のデジタル化
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンマルチモーダル分類画像テキスト

Guide Me Out: A Framework to Benchmark VLM Operators Communication in Crisis Scenarios

危機管理では、コミュニケーションと地理

用途: 危機管理におけるコミュニケーションを評価する
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer分類テキスト音声

Is Text All You Need? Text as a Universal Information Bottleneck for Speech LLMs

Large language models (LLMs) provide a powerful reasoning backbone for speech understanding, but integrating c

用途: 分類
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer分類画像テキスト

NüshuVoice: Reviving the Voice of Endangered Nüshu with Pitch-Aware Text-to-Speech

Nüshu is an endangered phonetic script historically used by women in Jiangyong County, southern Hunan, China.

用途: 分類
難易度: Hard
コスト: Low

表形式向き品質予測/異常検知自然言語処理RAG分類QA画像

ChinaHeritaQA: A Culturally-Grounded Visual Question Answering Dataset for World Heritage Sites in China

We introduce ChinaHeritaQA, a multimodal benchmark dataset for evaluating the cultural reasoning abilities of

用途: 分類
難易度: Hard
コスト: High

End-to-End Optimization of Incoherent Imaging for Classification Under Detector-Limited Readout

End-to-end co-optimization of optical front-ends (e.g. metasurfaces) and neural network back-ends has been wid

コンピュータビジョンセグメンテーション分類検出

用途: 分類
難易度: Hard
コスト: Low

GenEyePose: Patient-Free, Knowledge-Based Saccadic Eye Movement Modeling for Digital Neurophysiologic Biomarker Development

Eye movements, including saccades, are widely regarded as highly sensitive and objective biomarkers of neuroph

深層学習Transformer分類検出生成

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知深層学習正規化・最適化手法分類検出セグメンテーション

Adversarial Attack and Disturbance Detection by Hadamard-Coded Output Representations for Object Detection and Semantic Segmentation

Conventional one-hot encodings often yield poorly calibrated models, being overconfident under attack, and let

用途: 分類
難易度: Hard
コスト: Low

表形式向きCPUで試しやすい自然言語処理RAG分類検出異常検知

Securing Self-supervised Data Curation for Foundation Models Robustness

Self-supervised data curation provides a pathway to scaling and improving the generalization capabilities of m

用途: 分類
難易度: Hard
コスト: High

Optical Music Recognition for Real-World Manuscripts with Synthetic Data

Optical Music Recognition (OMR) has seen major progress in model design, with end-to-end methods now capable o

MLOpsモデルデプロイ分類生成画像

用途: 分類
難易度: Hard
コスト: High

少数データ向き自然言語処理プロンプトエンジニアリング分類セグメンテーション画像

Training-Free Generalized Few-Shot Segmentation through Open-Vocabulary Semantic Arbitration

Generalized Few-Shot Semantic Segmentation (GFSS) has traditionally been approached as a representation-learni

用途: 分類
難易度: Hard
コスト: High

説明可能深層学習Transformer分類検出画像

Leveraging Morphology for Historical Script Metrological Analysis

Advances in handwritten text recognition have enabled large-scale transcription of historical documents, but s

用途: 分類
難易度: Hard
コスト: High

vesselFM-CT: Segmenting All Blood Vessels in CT Images for System-Level Cardiovascular Analysis

The vascular network in the human body is characterized by blood vessels exhibiting drastic structural variati

コンピュータビジョン3D・点群分類生成画像

用途: 分類
難易度: Hard
コスト: High

ExDet: Open-Domain Open-Vocabulary Detection with Cross-modal Extrapolation and Rectification

Open-domain open-vocabulary detection (ODOVD) requires detectors to generalize to both novel categories and un

深層学習軽量化・量子化分類検出画像

用途: 分類
難易度: Hard
コスト: High

Taming Perception Jitter: Uncertainty-Aware LiDAR Object Detection for Reliable Motion Classification

Reliable motion classification is critical for autonomous driving, as false dynamic predictions of static obje

深層学習軽量化・量子化分類検出3D

用途: 分類
難易度: Hard
コスト: High

深層学習正規化・最適化手法分類生成セグメンテーション

Reason Twice: Segmentation via Candidate Discovery and Comparative Reasoning

The rapid development of pretrained foundation models has enabled more general image segmentation. Multimodal

用途: 分類
難易度: Hard
コスト: High

Self-supervised Learning Matters: A Simple Ensemble Solution for Micro-Gesture Recognition

In this paper, we present XInsight Lab's solution to the micro-gesture classification track of the 4th MiGA Ch

自然言語処理ファインチューニング分類埋め込み動画

用途: 分類
難易度: Hard
コスト: High

TeamHerald@CHIPSAL 2026: Hate Speech Detection and Sentiment Analysis of Nepali Memes using Transformer-based Architectures and Ensemble Learning

The analysis of internet memes in the Nepali language is complicated by frequent code-mixing and a lack of est

深層学習Transformer分類検出画像

用途: 分類
難易度: Hard
コスト: Low

A Geometric Measure of Linear Separability for Neural Representations

Modern neural classifiers commonly rely on linear readouts, yet predictive metrics alone do not characterize t

自然言語処理埋め込み・検索分類

用途: 分類
難易度: Hard
コスト: Low

SNR-ST-Mix: Sample-specific Neighborhood Regression Mixup for Augmented Spatial Transcriptomics Imputation with Deep Neural Network

Purpose: Spatial transcriptomics (ST) enables gene expression measurements within the tissue context. However,

深層学習軽量化・量子化分類回帰テキスト

用途: 分類
難易度: Hard
コスト: High

Speaker-Invariant Representation Learning for Spoofing Detection via Gradient Reversal and A Variational Information Bottleneck

Sophisticated generative speech technology can undermined the reliability of voice biometrics. While spoofing

表形式向き自然言語処理RAG分類検出生成

用途: 分類
難易度: Hard
コスト: Low

A Comparison of SSL-Based Feature Extractors and Back-End Classifiers for Spoofing Detection: A Multi-Corpus Training and Cross-Linguistic Analysis

Voice biometric systems face growing threats from spoofing attacks, yet the evaluation of detection models rem

深層学習CNN分類検出テキスト

用途: 分類
難易度: Hard
コスト: High

How Much Capacity Does EEG Denoising Need? Ultra-Compact Networks reveal Benchmark Saturation and Metric-Utility Gap

Deep learning EEG denoising architectures have scaled from tens of thousands to tens of millions of parameters

用途: 分類
難易度: Hard
コスト: High

説明可能センサ/時系列機械学習教師あり学習分類検出画像

A spectral audit framework reveals task-dependent aperiodic reliance across EEG and ECG deep learning

Deep learning on physiological time series is interpreted through domain-specific features -- oscillatory rhyt

用途: 分類
難易度: Hard
コスト: Low

センサ/時系列自然言語処理大規模言語モデル分類テキスト音声

Titans-as-a-Layer: Test-Time Memory for Conversational Speech Emotion Recognition

Speech emotion recognition (SER) is commonly formulated as utterance-level classification, although conversati

用途: 分類
難易度: Hard
コスト: High

Intelligent Character Recognition of Handwritten Forms with Deep Neural Networks

The automatic processing of handwritten forms remains a challenging task, wherein detection and subsequent cla

機械学習教師あり学習分類検出

用途: 分類
難易度: Hard
コスト: High

Hybrid E-Assessment in Higher Education: Semi-Automated Grading of Paper-Based Written Examinations

This paper examines the limitations of fully digital and partially digital e-assessment approaches in summativ

自然言語処理大規模言語モデル分類テキスト

用途: 分類
難易度: Hard
コスト: High

表形式向き説明可能自然言語処理大規模言語モデル分類検出生成

Bridging Expert Knowledge and Automated Feature Engineering via Self-Evolution

In high-stakes settings such as brand compliance, clinical care, and content moderation, machine learning cann

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer分類生成テキスト

arxivGitHubあり2026-06-07

Can LLMs understand LilyPond? A benchmark for symbolic music generation and understanding

Symbolic music evaluation for large language models remains fragmented across representations, datasets, and m

用途: 分類
難易度: Hard
コスト: High

Operationalizing Linguistic Methods through Prompt-Engineering Skills: An Automatic Chinese Web Neologism Detection Pipeline

We present a method for automatic Chinese web neologism detection that operationalizes traditional linguistic

自然言語処理大規模言語モデル分類検出生成

用途: 分類
難易度: Hard
コスト: High

Lost in the Flow with Code Talkers: Unveiling the Instruction-Tuning Tax of Large Language Models in Code Tasks

AI coding assistants have significantly improved developer productivity by automatically suggesting code that

深層学習Transformer分類生成テキスト

用途: 分類
難易度: Hard
コスト: High

arxivGitHubあり2026-06-07

Multilingual Fact-Checking at Scale: Fine-Tuned Compact Models vs LLMs

We present a multilingual fact-checking system deployed at Factiverse, designed for high-throughput and low-la

深層学習Transformer分類検出

用途: 分類
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer分類検出生成

TRADE: Transducer-Augmented Decoder for Speech LLM

Speech Large Language Models (Speech LLMs) lack a principled mechanism for streaming inference: their label-sy

用途: 分類
難易度: Hard
コスト: High

arxivGitHubあり2026-06-07

Vision-Language Work Zone Intelligence for Safety-Critical Speed Regulation of Mixed-Autonomy Vehicles in Dynamic Environments

Temporary work-zone speed limits are communicated through visually inconsistent signage and are often missing

コンピュータビジョン物体検出分類検出画像

用途: 分類
難易度: Hard
コスト: High

Classifying galaxies in the Galaxy10 DECals dataset using Inception and Residual CNNs

Image data regarding galactic morphology is expected to increase both in quantity and quality for the next for

品質予測/異常検知深層学習CNN分類画像

用途: 分類
難易度: Hard
コスト: Low

品質予測/異常検知自然言語処理RAG分類検出セグメンテーション

PairWise Image Finder: An Open-source Tool for Finding Visually Aligned Street-Level Image Pairs for Urban Perception Studies

Change detection and scene recognition techniques have been widely applied to Street View Imagery (SVI) to und

用途: 分類
難易度: Hard
コスト: Low

SSAFE: Simple and Strong AI-Generated Image Detection via Frozen Vision Encoders

The rapid advancement of generative models has blurred the boundary between synthetic and real imagery, creati

自然言語処理ファインチューニング分類検出生成

用途: 分類
難易度: Hard
コスト: High

Facial Expression Recognition in the Deep Learning Era: A Systematic Multi-Criteria Review of Methods, Models, Datasets, Performance, Challenges, and Future Research Directions

Facial Expression Recognition (FER) has advanced rapidly over the last decade, driven by the shift from handcr

深層学習CNN分類マルチモーダル

用途: 分類
難易度: Hard
コスト: High

When Video Misreads: Closed-Loop Distillation of Reading Heuristics for Exploratory Manipulation Trace QA

Exploratory manipulation often turns an apparent failed attempt into the key evidence for what to do next. For

深層学習軽量化・量子化分類動画マルチモーダル

用途: 分類
難易度: Hard
コスト: High

Chiaroscuro Attention: Spending Compute in the Dark

Standard transformers apply self-attention uniformly at every layer and token, regardless of whether the input

深層学習Transformer分類テキスト

用途: 分類
難易度: Hard
コスト: Low

Cross Paraphrastic Invariance Learning for Hallucination Detection

Large language models (LLMs) frequently generate hallucinations, which are unsupported by a source document. T

深層学習軽量化・量子化分類検出テキスト

用途: 分類
難易度: Hard
コスト: High

What's the Point? Spatial Grammar & Index Resolution for Sign Language Processing

Sign language models are predominantly trained with gloss-sequence or text supervision, thereby under-modeling

センサ/時系列機械学習時系列分類検出テキスト

用途: 分類
難易度: Hard
コスト: High

MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models

Understanding where LLMs store factual knowledge is critical for hallucination mitigation. We systematically q

自然言語処理大規模言語モデル分類テキスト

用途: 分類
難易度: Hard
コスト: High

Shared Latent Structures Enable Unified Backdoor Detection and Mitigation in LLMs

Backdoor attacks in large language models (LLMs) are often treated as isolated trigger-response failures, moti

深層学習軽量化・量子化分類検出テキスト

用途: 分類
難易度: Hard
コスト: High

Self-Supervised Vision Transformers for CBCT-Based Detection of Temporomandibular Joint Osteoarthritis

Temporomandibular joint osteoarthritis (TMJ OA) is a prevalent degenerative condition whose osseous changes ar

深層学習Transformer分類検出生成

用途: 分類
難易度: Hard
コスト: High

Beyond Raw Signals: Undecoded Generative Latents as Privileged Synthetic Data

While multimodal integration significantly improves computer vision models, deploying them incurs prohibitive

深層学習軽量化・量子化分類生成画像

用途: 分類
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション分類画像3D

MS-COOT: Comparing Morse-Smale Complexes with Co-Optimal Transport

Understanding and comparing structures in scalar fields is a central challenge in scientific visualization, wi

用途: 分類
難易度: Hard
コスト: High

深層学習Transformer分類セグメンテーション回帰

arxivGitHubあり2026-06-06

How Much MRI Preprocessing Is Enough? A Cost-Utility Study for Brain MRI Foundation Models

MRI preprocessing defines the input distribution seen by brain MRI foundation models, yet it is usually treate

用途: 分類
難易度: Hard
コスト: High

RAPID: Layer-Wise Redundancy-Aware Pruning and Importance-Driven Token Merging for Efficient ViT

Vision Transformers (ViTs) achieve strong performance but suffer from high computational costs due to quadrati

用途: 分類
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョンセグメンテーション分類画像テキスト

One Stone, Three Birds: Self-adaptive Optimal Transport for Multi-VLM Selection, Adaptation, and Ensembling

Vision-language models (VLMs) enable visual recognition from semantic class descriptions, which makes them att

用途: 分類
難易度: Hard
コスト: High

Human-Centered Benchmarking of Driver Monitoring Models

Vision-based driver monitoring systems are increasingly deployed in safety-critical intelligent transportation

センサ/時系列深層学習Transformer分類

用途: 分類
難易度: Hard
コスト: Low

MI向き深層学習Transformer分類回帰予測

OSMGraphCLIP: Learning Global Location Representations from OpenStreetMap Graphs

We present OSMGraphCLIP, a CLIP-style geospatial representation model that learns global location embeddings f

用途: 分類
難易度: Hard
コスト: Low

Uncertainty-Aware Intention Prediction for Human-to-Robot Assembly Teleoperation

In assisted teleoperation for human-robot collaboration, accurate intention prediction is critical for enablin

自然言語処理RAG分類検出セグメンテーション

用途: 分類
難易度: Hard
コスト: High

説明可能自然言語処理大規模言語モデル分類画像テキスト

arxivGitHubあり2026-06-05

LLM-Guided Evolution for Medical Decision Pipelines

Adapting large language models (LLMs) to clinical workflows often requires costly fine-tuning or manual prompt

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-06-05

End-to-End Control of a Powered Knee-Ankle Prosthesis Towards Unified, Tuning-Free Assistance

Powered prostheses conventionally rely on impedance controllers that require extensive manual tuning and expli

深層学習CNN分類

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-06-05

Learning All-Terrain Locomotion for a Planetary Rover with Actively Articulated Suspension

This paper presents ERNEST, a four-wheeled planetary rover concept equipped with a two-degree-of-freedom Activ

センサ/時系列自然言語処理プロンプトエンジニアリング分類強化学習

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-06-04

PAC-Bayesian Adversarially Robust Generalization for Message Passing Graph Neural Networks: A Sensitivity Analysis

この研究では、強い攻撃に対してグラフニューラルネットワーク (GNN) の安定した推論を保つために、PAC-ベイズ分析を使用して GNN の一般性を分析します。新しい分析方法を提案することで、GNN の弱信頼性の推論結果

深層学習Transformer分類埋め込み

用途: 弱信頼性の推論結果を防ぐ方法を開発
難易度: Hard
コスト: Low

arxivPaper only2026-06-04

T-FunS3D: Task-Driven Hierarchical Open-Vocabulary 3D Functionality Segmentation

Open-vocabulary 3D functionality segmentation enables robots to localize functional object components in 3D sc

自然言語処理RAG分類セグメンテーション画像

用途: 分類
難易度: Hard
コスト: High

Identifying Gems from Roman RAPIDly

この研究では、将来の天文台 Roman が取得するデータに対して、変換検出と変換エラー検出の自動パイプラインを提案している。変換検出は、特に天文台 Roman のデータでは重要な機能であり、天文現象を検出するために迅速な

機械学習教師あり学習分類検出画像

用途: 有望な天体に自動エラー検出と変換検出機能
難易度: Hard
コスト: High

Graph Cascades: Contagion-Based Mesoscopic Rewiring for Structure-Aware Graph Machine Learning

この研究では、グラフ構造を考慮したグラフ機械学習アルゴリズムを提案しており、特にグラフ構造と多ホップ支援を考慮したリワイアリング技術を提案している。

用途: グラフ構造を考慮したグラフ機械学習
難易度: Hard
コスト: High

表形式向きCPUで試しやすいコンピュータビジョンマルチモーダル分類

Worker Utility as Hysteresis: A Preisach Model of Transaction Acceptance in Gig Labour Markets

この研究では、個人の意思決定に対する効率的な解析 (Worker Utility) を提案しており、個人の意思決定を効率的に解析し、それを活用する。

用途: 個人の意思決定に対する効率的な解析
難易度: Hard
コスト: High

センサ/時系列深層学習RNN / LSTM分類テキスト時系列

Seq103: A Unified Neuroevolution Framework for Compact Sequence Architecture Discovery

Neuroevolution is a representative neural architecture search paradigm that evolves both network topology and

用途: 分類
難易度: Hard
コスト: Low

QDS-SNN: Energy-efficient Quantum Deeply-Supervised Spiking Neural Network Algorithm for Traffic Sign Recognition

Traffic sign recognition is crucial for intelligent transportation and autonomous driving, as it can improve d

深層学習CNN分類

用途: 分類
難易度: Hard
コスト: High

説明可能条件最適化深層学習Transformer分類生成

ParetoPilot: Zero-Surrogate Offline Multi-Objective Optimization via Infer-Perturb-Guide Diffusion

パラメータの最適化を目的としたオフライン-MOOアルゴリズムを提案する。

用途: パラメータの最適化
難易度: Hard
コスト: High

arxivPaper only2026-06-02

Combining Statistical Features and Deep Encodings for Rehearsal-Based Class-Incremental Time Series Classification

時系列データの分類と新しいクラスを追加しやすいクラス増加モデルの開発と、それを用いた実験結果について論じます。

センサ/時系列自然言語処理RAG分類時系列

用途: 時系列データの分類と新しいクラスを追加しやすいクラス増加モデルの開発
難易度: Hard
コスト: Low

arxivPaper only2026-06-02

Hierarchies of Calibration: Classification meets Regression

Concepts of calibration formalize the compatibility between probabilistic predictions and the respective outco

自然言語処理RAG分類回帰

用途: クラス分
難易度: Hard
コスト: Low

arxivPaper only2026-06-02

Training a Predictive Coding Network on ImageNet using Equilibrium Propagation

Equilibrium Propagation (EP)は、エネルギーベースのモデル、特にPredcitveCodingNetwork (PCN)のトレーニングに利用できるフレームワークです。EPは、トレーニングの過程に

深層学習CNN分類画像

用途: 画像認識のためのEP法を用いたPCNのトレーニング
難易度: Hard
コスト: High

arxivPaper only2026-06-01

When Tabular Foundation Models Transfer Across Modalities: A Systematic Evaluation Across 95 Datasets, 7 Modalities, and Two Regimes

We present a single classification pipeline that combines an Equiangular Tight Frame (ETF) preprocessing stage

表形式向きセンサ/時系列品質予測/異常検知深層学習軽量化・量子化分類テキスト音声

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-06-01

Convex Distance Operator Transport: A Convex and Geometry-Preserving Formulation

We introduce Convex Distance Operator Transport (CDOT), the first convex optimal transport framework that alig

コンピュータビジョンセグメンテーション分類3D

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-06-01

Welfare-Optimal Classification with Accuracy Auctions

Prediction algorithms are increasingly used to inform decisions about humans, but maximizing accuracy$\rule[0.

深層学習軽量化・量子化分類

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-05-31

Spiking and Event-driven Neuromorphic Mamba Models for Efficient Speech Recognition

Deep learning has greatly advanced automatic speech recognition (ASR), enabling widespread deployment on edge

深層学習軽量化・量子化分類音声

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-05-30

Statistical Analysis of using the Shapley Value for Sensor Anomaly Localization with Accurate Classifiers

Recent publications have suggested using the Shap- ley value for sensor anomaly/attack localization. We study

説明可能センサ/時系列自然言語処理RAG分類検出

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-05-28

Deep Binarized Photonic Reservoir Computing for Ultrafast Multimedia Signal Processing

We present a deep photonic neural network architecture based on ultrafast binary optical modulation from a dig

センサ/時系列コンピュータビジョン動画認識分類検出画像

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-28

Evolutionary Rule Extraction from Corporate Default Prediction Models

Small and medium-sized enterprises (SMEs) represent the majority of firms in most economies and often face fin

説明可能条件最適化自然言語処理RAG分類生成回帰

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-05-27

CLANE: Continual Learning of Actions on Neuromorphic Hardware from Event Cameras

Recognizing and continuously learning novel human actions without forgetting prior classes is a requirement fo

センサ/時系列深層学習CNN分類画像動画

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-27

Learning to Assess the Reliability of Number-of-Runs Estimation in Stochastic Optimization

In large-scale benchmarking of stochastic optimization algorithms, the key challenge is no longer whether repe

コンピュータビジョンセグメンテーション分類検出

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-05-26

Signal-to-Noise Ratio and Sample Size Govern Representational Alignment in Neural Networks

Neural networks are known to develop latent representations that are $aligned$, namely structurally similar ac

品質予測/異常検知コンピュータビジョンセグメンテーション分類回帰

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-24

Growing a Neural Network in Breadth, Depth, and Time

Spatial and temporal resource constraints are critical for both biological and artificial intelligent systems.

深層学習CNN分類

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-22

Planktonzilla: Multimodal dataset and models for understanding plankton ecosystems

Marine plankton underpin aquatic food webs and play a key role in global CO2 sequestration, making reliable sp

少数データ向き深層学習Transformer分類画像テキスト

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-22

SpikingMoE: SDPrompt-Guided Dynamic Expert Fusion in Spiking Neural Networks

スパイキングニューラルネットワークを高速化するためのSpikingMoEを提案しています。このフレームワークは、スパイク通信を削減するためのSDPrompt-Guided Dynamics Expert Fusionを提

用途: スパイクを活用した知能を向上させるためのモジュール
難易度: Hard
コスト: Low

arxivPaper only2026-05-21

Temporal Coding as a Substrate for Sensorimotor Object Inference: A Spiking Reinterpretation of Thousand Brains Architecture

この研究では、時間空間オブジェクト認識のためのお気に入りのサブストラットを開発するため、Spiking Reinterpretation of Thousand Brains Theoryという方法を提案しました。これは

センサ/時系列コンピュータビジョン動画認識分類

用途: 時間空間オブジェクト認識のためのお気に入りのサブストラットの開発
難易度: Hard
コスト: High

arxivPaper only2026-05-21

Smoothed Elicitation Complexity for Approximate $Γ$-calibration of Discrete Classification Tasks

One prominent method of evaluating machine learning model trustworthiness is the notion of calibration. In the

機械学習教師あり学習分類

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-05-19

Training Neural Networks with Optimal Double-Bayesian Learning

Backpropagation with gradient descent is a common optimization strategy employed by most neural network archit

コンピュータビジョンセグメンテーション分類検出

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-17

Von Economo neurons enable reliable social skill acquisition in recurrent spiking neural networks: a computational account with clinical predictions

Von Economo neurons (VENs) are selectively lost in behavioural-variant frontotemporal dementia (bvFTD) and red

深層学習RNN / LSTM分類

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-16

Classification aggregation: a quantitative impossibility theorem

A group of individuals wishes to classify $m$ objects into $n$ categories in such a way that no class is left

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-05-15

Scalable neuromorphic computing from autonomous spiking dynamics in a clockless reconfigurable chip

We propose a scalable neuromorphic architecture based on spiking dynamics emerging from the autonomous time-co

深層学習軽量化・量子化分類音声

用途: スパイク計算精度向上
難易度: Hard
コスト: Low

arxivPaper only2026-05-15

Thermodynamic Networks: Harnessing Non-Equilibrium Steady States for Computation

この研究では、物理的システムを計算に利用する方法を提案した。研究によると、この方法により計算が高速化された。

用途: 物理的システムを計算に利用する
難易度: Hard
コスト: High

arxivPaper only2026-05-15

The Geometry of Cooperative Game Solutions: Stratified Egalitarian Shapley Values

The space L of linear value maps on a finite-player cooperative game G^N is finite-dimensional, and admits a c

品質予測/異常検知深層学習軽量化・量子化分類回帰3D

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-14

On the Stability of Growth in Structural Plasticity

Standard deep-learning pipelines usually choose the network architecture before training and keep it fixed thr

深層学習CNN分類画像テキスト

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-13

Genetic algorithm vs. gradient descent for training a neural network architecture dedicated to low data regimes in small medical datasets

Aim/Introduction: Distance-encoding biomorphic-informational neural network (DEBI-NN) is a recently proposed a

MI向きコンピュータビジョンセグメンテーション分類

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-12

Breaking Global Self-Attention Bottlenecks in Transformer-based Spiking Neural Networks with Local Structure-Aware Self-Attention

Transformer-based Spiking Neural Networks (SNNs) integrate SNNs with global self-attention and have demonstrat

用途: 分類
難易度: Hard
コスト: Low