MLinfo | 機械学習・AI論文まとめ

コンピュータビジョン物体検出分類検出セグメンテーション

ultralytics — Ultralytics YOLO26, YOLO11, YOLOv8 — object detection, instance segmentation, semantic segmentation, image classification, pose estimation, object tracking

ultralyticsはYOLO(You Only Look Once)の技術を使用したオブジェクト検出ライブラリで、高い精度を提供している。

用途: オブジェクト検出
難易度: Easy
コスト: Low

コンピュータビジョン物体検出分類検出セグメンテーション

yolov5 — Ultralytics YOLOv5 in PyTorch for object detection, instance segmentation, classification, training, and export.

YOLOv5という物体検出アルゴリズムをPyTorchから他の言語に変換できるライブラリ。

用途: 物体検出
難易度: Easy
コスト: High

コンピュータビジョン物体検出分類セグメンテーション画像

label-studio — Label Studio is a multi-type data labeling and annotation tool with standardized output format

データラベル化と注釈化を行うためのツールです。

用途: データラベル化ツール
難易度: Easy
コスト: Low

品質予測/異常検知コンピュータビジョンセグメンテーション分類検出画像

cvat — Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling services, for image, video, and 3D annotation with AI-assisted labeling, quality assurance, team collaboration, analytics, and developer APIs.

CVATは、機械学習用の業界標準のデータエンジンです。さまざまなスケールのチームが使用し、さまざまなスケールのデータに対応しています。

用途: データのラベル付けと管理
難易度: Easy
コスト: High

品質予測/異常検知機械学習教師あり学習分類検出画像

fiftyone — Refine high-quality datasets and visual AI models

FiftyOneは、データセットの精査とAIモデル可視化を支援するライブラリです。このライブラリは、データセットの品質を高め、AIモデルを可視化するのを支援するために使用できます。

用途: データセットの精査とAIモデル可視化
難易度: Easy
コスト: Low

FunASR — Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenAI-compatible/MCP serving.

電気生理信号から表現を学習し、脳コンピューターインターフェースの開発を支援する。

深層学習Transformer分類検出テキスト

用途: 電気生理信号から表現を学習する
難易度: Easy
コスト: High

表形式向き深層学習Transformer分類検出画像

presidio — An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

presidioは、テキスト、画像、構造化データを含む敏感データを検出、削除、マスク、アノニマイズするオープンソースフレームワークです。自然言語処理、パターンマッチング、カスタマイズ可能なパイプラインをサポートします。

用途: データのプライバシーを保護する
難易度: Easy
コスト: Low

3D-Aware VLMs with Implicit and Explicit Geometries

3次元空間理解技術のための新しいアプローチであるVLM-IE3D（Vision-Language Models with Implicit and Explicit 3D geometry）を提案しました。VLM-IE3

コンピュータビジョン3D・点群検出画像テキスト

用途: 3次元空間理解技術の開発
難易度: Hard
コスト: High

品質予測/異常検知画像検査深層学習Transformer検出生成画像

Synthetic data generation framework for quality control automation in gravure printing

印刷品質管理技術のための新しいアプローチであるシンセティックデータ生成フレームワークを提案しました。このフレームワークは、ロトグラビューグラビング技術における品質管理のためのシンセティックデータを生成することで、印刷

用途: 印刷品質管理技術の開発
難易度: Hard
コスト: High

センサ/時系列自然言語処理大規模言語モデル分類検出埋め込み

Toward Generalizable Cognitive Impairment Detection with Speech-Based Multimodal Large Language Models

Cognitive impairment (CI) is a growing public health concern. Early and accurate diagnosis is critical for ena

用途: 分類
難易度: Hard
コスト: High

Test-Time Scaling via Error Localization

Scaling inference-time computation has emerged as a reliable method to improve the performance of large langua

自然言語処理大規模言語モデル検出生成テキスト

用途: 検出
難易度: Hard
コスト: High

Token Budget Saturation and Mechanistic Early Detection of Reasoning Non-Convergence in Chain-of-Thought Models

チェーン・オブ・サウト reasoning モデルの収束不明確さを解決する研究。このモデルの不完全収束は、生成するトークンの数に依存し、モデルには収束しない限り問題を解決する能力がない。これを解決するための予測を終了する

自然言語処理プロンプトエンジニアリング検出生成

用途: チェーン・オブ・サウト reasoning モデルに適切に予測を終了する方法を検討する
難易度: Hard
コスト: High

自然言語処理大規模言語モデル異常検知テキスト強化学習

Training Large Language Models for Self-Explanation Faithfulness

この研究では、自己説明の信頼性を検証するためのRL方法を提案し、自己説明の信頼性を直接最適化するための新しいアプローチを検討します。

用途: 自己説明の信頼性
難易度: Hard
コスト: High

Nipping the Butterfly Effect in the Bud: Self-Output Fine-Tuning for Autoregressive Weather Prediction

この研究では、長期

自然言語処理RAG異常検知予測テキスト

用途: 天気予報
難易度: Hard
コスト: Low

Counterfactual Explainability Framework With CycleGAN And Counterfactual-Classifier Alignnment Score for Retinal Disease Classification

Automated detection of vision impairing retina-based ocular conditions from fundus images is important for ear

説明可能深層学習CNN分類検出画像

用途: 分類
難易度: Hard
コスト: Low

Regularized Optimization on Grassmann Manifold: Theory, Algorithm and Applications

Spectral methods are among the most widely used techniques for community detection, clustering, and graph lear

説明可能深層学習軽量化・量子化検出

用途: 検出
難易度: Hard
コスト: Medium

品質予測/異常検知自然言語処理ファインチューニング検出生成

RadioTrace: Transmitter-Aware Diffusion for Radio Map Estimation without Deployment-Time Fine-Tuning

RFマップ（無線周波数マップ）を推定するためのTransmitter-Aware Diffusion（送信機認識拡張）を提案した研究で、この方法によりRFマップを効率的に推定できる。

用途: RFマップの推定を支援する
難易度: Hard
コスト: High

Position Bias is Hidden Behind Ceiling Effects: A Permutation Diagnostic for LLM Benchmarks

LLM（言語モデル）の評価における位置バイアスを分析するための方法を提案した研究で、この方法により、位置バイアスが評価結果にどのような影響を与えるかが明らかにできる。

自然言語処理大規模言語モデル検出生成

用途: LLMの評価における位置バイアスを分析する
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer検出テキスト時系列

Beyond Heavy Log Curation: Perplexity-Based APT Detection via Unsupervised, Context-Augmented Language Models

Advanced Persistent Threats (APTs) remain difficult to detect because only a small fraction of events in large

用途: 検出
難易度: Hard
コスト: High

品質予測/異常検知深層学習RNN / LSTM検出異常検知教師なし

Unsupervised Consensus-Based Anomaly Detection for Spatiotemporal Malaria Incidence in Ghana

A consensus anomaly detection framework was applied to monthly malaria surveillance data from Ghana (2014-2023

用途: 検出
難易度: Hard
コスト: Medium

Detecting LLM-Generated Tokens in Human--LLM Coauthored Text

The rise of human-AI collaborative writing has created a growing need for fine-grained detection methods that

自然言語処理大規模言語モデル分類検出テキスト

用途: 分類
難易度: Hard
コスト: High

When Are Reasoning-Based Guardrails Not Efficient? ResponseGuard: A Fast Vision-Language Guard for Real-Time Moderation

A vision-language AI assistant returns its answer as a stream of generated tokens. Therefore, a safety guard t

深層学習軽量化・量子化検出画像テキスト

用途: 検出
難易度: Hard
コスト: High

説明可能深層学習Transformer検出埋め込みテキスト

Multimodal Pretraining for Generalizable EEG Representation Learning

Electroencephalography (EEG) models used for epilepsy are often limited to specific datasets and tasks. This l

用途: 検出
難易度: Hard
コスト: High

From Static Bibliometrics to Dynamic Knowledge Graphs: An LLM-Powered Framework for Modernizing Science, Technology, and Innovation (STI) Analytics

Bibliometric indicators - citation counts, h-indexes, co-authorship networks - have long anchored science, tec

自然言語処理大規模言語モデル検出テキスト

用途: 検出
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション検出画像テキスト

PC-Edit: Prompt-Contrastive Region Discovery and Region-Guided Editing

Replacing an object with one that differs in category or shape requires complete source removal, natural targe

用途: 検出
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション分類検出動画

BasketEvent: Understanding Who Did What and When in Basketball Videos

この研究では、大規模言語モデルを使用して、basketボールの動的理解に基づいて、プレイヤーへの関わりや時間境界を推測するモデルを開発しました。

用途: basketボールの動的理解
難易度: Hard
コスト: High

Explainable Belief Harmonization under Dynamic Epistemic Partitions

この研究では、大規模言語モデルを活用して、信念の共有を組み合わせるモデルを開発しました。大規模言語モデルを活用することで、信念の共有を推測することができました。

説明可能自然言語処理RAG検出

用途: 共有された信念を組み合わせるモデル
難易度: Hard
コスト: Low

Auditing Provenance Sensitivity in LLM Agent Action Selection

LLM agents choose tools and arguments from context that mixes user requests, tool outputs, retrieved records,

深層学習Transformer検出テキスト

用途: 検出
難易度: Hard
コスト: High

CSPF: A Constrained Shared-Private Fusion Method for Non-Verifiable Preference Evaluation

非真実性の評価において、評価手法が多様な評価基準を捕捉する能力に乏しく、評価者間の偏見が存在する問題を解決するために、CSPF (Constrained Shared-private Fusion) を提案している。

自然言語処理RAG異常検知

用途: 非真実性の評価
難易度: Hard
コスト: Low

MI向き深層学習軽量化・量子化セグメンテーション異常検知画像

Unified Video Dense Prediction from Disjoint Data

ビデオ内の物体の空間推論を同時に行うことで、現存するタスク固有の注釈を超えた統一的なビデオ推論システムを構築した。

用途: ビデオの分割推論
難易度: Hard
コスト: High

GLAM-SLAM: Real-time Gaussian Large-scale Mapping via Flow Densification and Spatial Decomposition

一部のGaussianスプレイティングを利用したSL

品質予測/異常検知深層学習軽量化・量子化検出3D

用途: シンプルで実用的なSLAM
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション検出テキスト動画

Incremental Optimal Assignment for Real-Time Crowd Tracking

Multi-object tracking in dense crowds requires solving a bipartite assignment problem between detections and t

用途: 検出
難易度: Hard
コスト: High

HGeo-TopoMap: Boosting Topological Mapping with Hierarchical Geometric Priors

Topological maps are key outputs of autonomous driving perception systems, delivering essential road informati

用途: 検出
難易度: Easy
コスト: Low

品質予測/異常検知自然言語処理RAG検出画像テキスト

Detectors Learn the Wrong Thing: Shortcut-Resistant Adversarial Training Against Physically Realizable Attacks

AI-enabled visual perception systems are increasingly deployed in intelligent transportation infrastructure an

用途: 検出
難易度: Hard
コスト: High

Stokes-Informed Diffusion for Robust Linear Polarization Estimation

Polarization cues benefit applications such as material detection and de-reflection, yet acquiring them typica

深層学習軽量化・量子化検出画像

用途: 検出
難易度: Hard
コスト: High

DTIF: Robust Loop Closure Detection via Delaunay Triangle Topology in Complex Forests

Accurate forest inventory and large-scale mapping are essential for ecosystem monitoring and sustainable fores

深層学習Transformer検出3D

用途: 検出
難易度: Hard
コスト: High

HalluScope: Fine-grained Hallucination Diagnosis for Multimodal Large Language Models

大規模言語モデルはさまざまな画像をテキストに変換する上で優れた性能を示しているが、発生するホログラフィックな診断にはまだ解決策が必要です。この研究では、主流の粗い検出方法の欠点を補うため、細部の診断方法を提案しています。

説明可能自然言語処理大規模言語モデル分類検出生成

用途: ホログラフィックハロウィーンの診断
難易度: Hard
コスト: High

Spectral-Spatial Synergistic Guided Network for Hyperspectral Salient Object Detection

Hyperspectral salient object detection aims to identify visually salient regions from hyperspectral images. Ex

深層学習軽量化・量子化検出画像

用途: 検出
難易度: Hard
コスト: Low

品質予測/異常検知深層学習Transformer検出生成画像

GroupVideo: Multi-Identity Customized Text-to-Video Generation

Current identity customized video generation methodologies are predominantly limited to single-identity scenar

用途: 検出
難易度: Hard
コスト: High

Explainable Deepfake Detection Challenge

Deepfake detection is moving beyond binary classification decisions toward systems that can also explain the v

説明可能コンピュータビジョン画像分類分類検出生成

用途: 分類
難易度: Easy
コスト: Low

深層学習Attention機構検出セグメンテーション

FSB-Net: Frequency-Spatial Boundary Network for Brain Stroke Lesion Segmentation in Non-Contrast CT

この論文では、非コントラストCT（NCCT）スキャン中の脳梗塞領域を正確に分割するために、周囲境界を特徴としているFrequency-Spatial Boundary Network（FSB-Net）を開発しました。

用途: 脳梗塞領域の分割
難易度: Hard
コスト: Low

RECO: Region-Aware Compensation for Extrinsic Perturbations in Roadside 3D Detection

この研究では、路上の3Dオブジェクト検出を改善するために、外部性を考慮した地域認識のアラーカンシーを提案します。

深層学習Transformer検出3D

用途: 鉄道沿いのオブジェクト検出
難易度: Hard
コスト: High

Engine-Native Editable 3D World Reconstruction with Objects and Lighting

この論文では、Lumeraという手法を提案します。Lumeraは、Engine-Native 3D World ReconstructionとLightsを検出するために使用します。

自然言語処理大規模言語モデル検出生成画像

用途: 3D世界の再構成
難易度: Hard
コスト: High

CPUで試しやすい深層学習軽量化・量子化検出3Dマルチモーダル

Factorized Spatio-Temporal Convolutions for Human Pose Estimation from Planar Lidar

この論文では、安全な人とロボット間の対話を目的とした、人間の姿勢推定とロボットの動作制御の一連のネットワークが提案されます。

用途: 人間とロボット間の安全な交互作用
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer検出画像音声

Human-Inspired Framework for Robotic Craniotomy: Integrating Multimodal Fusion and Adaptive Trajectory Adjustment

人間の知能を模倣するクロアニオトミー手術のフレームワークを提案します。このフレームワークは、前方計画と後方実行を組み合わせて、手術中に手術台の位置を自動的に調整することで、人間と同様の安全で効率的な手順を実現します。

用途: クロアニオトミー手術の自動化
難易度: Hard
コスト: High

GuidedAttention: Interpretable and Correctable Visual Attention for OOD-Robust Robot Manipulation via Imitation Learning

視覚モータリティポリシーを学習する際、人間が視覚アタッチメントを理解し、修正できるようにするため、視覚アタッチメントを明示的にしたフレームワークを提案します。

説明可能生成AI拡散モデル異常検知画像

用途: ロボットマニュピュレーションの視覺アタッチメント
難易度: Hard
コスト: High

CPUで試しやすい機械学習教師あり学習分類検出回帰

githubGitHubあり2026-07-23

pycaret — Open-source, low-code AutoML platform for Python. PyCaret 4.0: sklearn-native engine + React control plane.

pycaretは、Pythonによるオープンソースの低コストオートMLプラットフォームで、Reactコントロールプレーンを備えたsklearnネイティブエンジンを搭載しています。

用途: オートMLプラットフォーム
難易度: Easy
コスト: Low

When Does Recurrence Become an Algorithm? Convergence Selection in Weight-Tied Looped Transformers

When does a weight-tied looped transformer -- one block applied T times -- implement an actual algorithm? We a

深層学習Transformer異常検知

用途: 異常検知
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化検出生成異常検知

Classical Hardware Acceleration of Quantum Autoencoders for Real-Time Anomaly Detection in Collider Experiments

この研究では、クラスター検出アナライザーにおける量子力学の応用を研究し、精度を向上させた。

用途: クラスター検出アナライザーにおける量子力学の応用
難易度: Hard
コスト: Low

Detecting Neural Network Failures through Spectral Analysis of Internal Activations

Neural network misclassifications exhibit characteristic spectral instability in internal activations that is

深層学習軽量化・量子化分類検出

用途: 分類
難易度: Hard
コスト: Low

Hard Guarantees at a Measured Price: Entropy-Stable Learned Finite Volumes for Compressible Flow

圧縮流体の解析を目的とした新しいアプローチ、Entropy-Stable Learned Finite Volumes を提案する。

深層学習Transformer異常検知

用途: 圧縮流体の解析
難易度: Hard
コスト: High

CURED: Creating, Understanding, and Repairing Errors Demonstrator

データクリーンシングを扱う研究、CURED を用いてデータクリーンシングを提案する。

表形式向き機械学習表形式データ検出テキスト表形式

用途: データクリーンシング
難易度: Hard
コスト: Low

Bayesian uncertainty estimation improves clinical decision making in medical AI agents

Machine learning models for medical image analysis typically lack a reliable measure of confidence, limiting t

深層学習正規化・最適化手法分類検出画像

用途: 分類
難易度: Hard
コスト: High

CPUで試しやすいMI向き深層学習軽量化・量子化分類検出

Taming the Security-Energy Paradox: A Green AI Approach to Optimized Android Malware Detection

この研究では、Androidマルウェアの検出に使用されるデープラーニングモデルをOptimizeする方法を提案しました。

用途: Androidマルウェアの検出
難易度: Hard
コスト: Low

Diffusion ReRoll: Revisable Denoising for Robotic Sequential Prediction

この研究では、実世界ロボットのシーケンシャル予測に使用できる、diffusion-based frameworkを提案しました。

自然言語処理RAG生成異常検知テキスト

用途: 実世界ロボットのシーケンシャル予測
難易度: Hard
コスト: High

表形式向きCPUで試しやすいコンピュータビジョンセグメンテーション検出

Harnessing Disagreement: Detecting Correlated Agreement Blindness in Multi-Agent Triage

この研究では、マルチエージェントによるトリージュア

用途: マルチエージェントによるトリेजュアの安全性の評価
難易度: Hard
コスト: Medium

TriAgent: Divergence-Aware Multi-Agent Committees for Cost-Efficient Financial Sentiment Analysis

生産的言語モデルの利用による金銭的感情分析に対処するための方法を提案している。複数のエージェントを活用したコミティー方式を使用し、さまざまな粒度のテキストデータに対応できるように、単語レベルのルールベースアプローチ、句節

深層学習Transformer検出テキスト

用途: 金融分野の感情分析
難易度: Hard
コスト: High

Learning the Arabic Dialect Continuum as a Continuous Space: A Regression Approach to Speaker Origin Prediction

We present a regression-based approach to Arabic dialect geolocation that models dialectal variation as a cont

深層学習Transformer検出回帰

用途: 検出
難易度: Hard
コスト: High

Transition-Related Potentials as Markers of Narrative Comprehension in Continuous EEG

Harnessing the potential of electroencephalography (EEG) for brain research is fundamentally limited by intrin

深層学習Transformer検出テキスト

用途: 検出
難易度: Hard
コスト: Low

品質予測/異常検知生成AI拡散モデル検出生成テキスト

Generative AI floods and dilutes the market for books

Generative AI can produce book-length works of fiction at near-zero cost. These books are often dismissed as l

用途: 検出
難易度: Hard
コスト: High

Closing the Lab-to-Store Gap: A Data-Efficient Post-Training and Experience-Driven Learning VLA Framework for Retail Humanoids

Closing the gap between benchmark performance and reliable real-world operation remains a central challenge fo

深層学習軽量化・量子化異常検知画像テキスト

用途: 異常検知
難易度: Hard
コスト: High

CUSUM-Shaped Inference-Time Monitoring and Targeted Re-Decoding for Quantized Small Language Model Reasoning

Quantized small autoregressive reasoning models can enter long, repetitive, or unproductive trajectories, yet

自然言語処理RAG検出生成回帰

用途: 検出
難易度: Hard
コスト: Low

Drift-Aware RL-based Wavelet Denoising for Network-Traffic Anomaly Detection

回線流量データに対するノイズと漂移を考慮した波列減少アルゴリズムを実装し、静的な波列減少法が漂移のあるシナリオでは効果を低下していると指摘する。

品質予測/異常検知自然言語処理RAG検出異常検知

用途: 回線流量異常検出システムの精度向上
難易度: Hard
コスト: Low

HalluTruthQA: A Fine-Grained Benchmark for Hallucination Detection, Localization, and Explanation in Arabic Question Answering

大きな言語モデルは真実の情報を提供できるように見えますが、実際は虚偽情報を提供することが多く、これを検知、検出、および検証するための基準を作成するため、HalluTruthQAが開発されました。

自然言語処理大規模言語モデル検出QAテキスト

用途: 仮想の答えを検知、検出、および検証するための基準を作成する
難易度: Hard
コスト: High

TalentCLEF at CLEF2026: Skill and Job Title Intelligence for Human Capital Management

This paper presents the second edition of the TalentCLEF Challenge, which will run as an evaluation lab as par

深層学習Transformer分類検出テキスト

用途: 分類
難易度: Hard
コスト: Low

A Multi-Dimensional Evaluation of Explainability in Media Bias Detection

Detecting media bias automatically is difficult because biased framing is often subtle, yet in domains such as

深層学習Transformer分類検出

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション検出異常検知テキスト

Rethinking Open-World Video Anomaly Detection: Diagnosing Definition Blindness

Open-world video anomaly detection (OWVAD) is expected to detect events that match a user-specified definition

用途: 検出
難易度: Hard
コスト: High

表形式向き説明可能CPUで試しやすい品質予測/異常検知コンピュータビジョン物体検出分類検出画像

How Does Urban Context Relate to Residential Building Health? A Vision-POI Fusion Framework for Building-Level Housing Inspection

Housing-level urban physical examination is essential for identifying residential building problems and suppor

用途: 分類
難易度: Hard
コスト: Low

センサ/時系列深層学習軽量化・量子化検出セグメンテーション埋め込み

arxivGitHubあり2026-07-22

Not All Patches are Equal: Sampling Matters for Visible-Infrared Pre-Training

Visible-infrared (VIS-IR) alignment is a key pre-training task for robust multi-sensor perception. Most existi

用途: 検出
難易度: Hard
コスト: High

CPUで試しやすいコンピュータビジョン物体検出検出

Real-Time EEG Cap Electrode Detection for Guided Point-of-Care Placement

We present a two-stage vision system that detects EEG cap electrodes in a live webcam stream and validates the

用途: 検出
難易度: Hard
コスト: Medium

RIM: A Retrieval-In-Matching Framework for Cross-Domain Global Visual Localization of UAVs

Global visual localization of unmanned aerial vehicles (UAVs) using remote-sensing reference maps has attracte

センサ/時系列深層学習軽量化・量子化検出画像3D

用途: 検出
難易度: Hard
コスト: High

Toward Seasonal Guidelines for Robust Deep-Learning Sentinel-2 Building Detection in Different Area Types

OffNadirLocは地学化におけるオフナジアムの視点を考慮するための基準セットを提案します。これにより、ドローンと衛星画像の交差視点地学化プロセスでは重要な構造的シーン理解と内部ドメイン間の関係的制約に重点を置くこと

深層学習CNN分類検出セグメンテーション

用途: ドローンから衛星画像への地学化の改善
難易度: Hard
コスト: High

自然言語処理プロンプトエンジニアリング検出画像テキスト

OffNadirLoc: Benchmark and Framework for Challenging UAV-to-Satellite Geo-Localization under Large Off-Nadir Views

OffNadirLocは交差視点地理位置を推定するための基準セットを提案します。これにより、ドローンと衛星画像の交差視点地理位置推定プロセスでは重要な構造的シーン理解と内部ドメイン間の関係制約に焦点を当てることができます

用途: ユーザー間の地理的位置の推定改善
難易度: Hard
コスト: High

G-MAD: A Game-Based Data Generation Framework for Multi-View RGB-T Aerial Object Detection

This work introduces G-MAD, an open-source framework that uses Arma3 to generate synchronized multi-view RGB-T

コンピュータビジョン物体検出検出生成

用途: 検出
難易度: Hard
コスト: Medium

LoRFT: Benchmarking Long-Range Vehicle Trajectory Reconstruction from Fixed Highway Cameras

Long-range vehicle trajectories provide important spatio-temporal evidence for traffic safety analysis, autono

自然言語処理RAG検出動画

用途: 検出
難易度: Hard
コスト: High

LAVIFT: Latent-Action-Guided Vision Fine-Tuning for Surgical Interaction Recognition

Understanding instrument-tissue interactions is essential for context-aware surgical AI and autonomous robotic

自然言語処理ファインチューニング分類検出画像

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化検出セグメンテーション画像

Current Injection Spiking Neural Network for Infrared and Visible Image Fusion

Infrared and visible image fusion (IVIF) integrates the complementary information of two modalities into a sin

用途: 検出
難易度: Hard
コスト: High

KineBench: Benchmarking Embodied World Models via IDM-Free Kinematic Grounding

Evaluating the physical consistency of embodied world models(EWMs) is a critical open challenge. While closed-

コンピュータビジョン3D・点群生成異常検知画像

用途: 生成
難易度: Hard
コスト: High

少数データ向きCPUで試しやすい条件最適化自然言語処理ファインチューニング検出生成画像

PRISM-DR: Per-lesion Retinal Inference with Specialist Models for Diabetic Retinopathy

この研究では、糖尿病性黄斑病変の検出を目的としたPRISM-DRシステムを開発しました。このシステムは、医師が見逃す可能性がある小さな低コントラストな病変を見つけるのに役立ちます。

用途: 糖尿病性黄斑病変を検出する
難易度: Hard
コスト: High

WASABI: Whole-graph Assignment-based Stabilizer for lAne topology By Inter-frame tracking

マグネティックリゾナンスイメージング (MRI) のデータ収集には、多くのエネルギーと時間が必要です。アクティブサンプリングは MRI の速度を増加させる技術ですが、現在のアプローチでは、低周波数部分（解像度）と高

用途: MRIデータを効率よく収集する
難易度: Hard
コスト: Medium

センサ/時系列品質予測/異常検知コンピュータビジョン物体検出検出マルチモーダル

DRGBT-1K: A Large-scale High-quality Benchmark for Dynamic RGBT Tracking

地上を表す重力式マップの高解像度版が、多くの用途で役立ちます。たとえば、市区町村の変化を監視したり、エネルギー対策を向上させたり、温室効果ガスの排出量を追跡したりすることができます。4つの主要な全世界建物Rasterデー

用途: 宇宙に分布する建物の面積を正確に推定する
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化検出セグメンテーション動画

arxivGitHubあり2026-07-22

Efficient Tracking and Understanding Object Transformations

Tracking objects through state transformations is essential for understanding real-world dynamics. However, ex

用途: 疼痛位置
難易度: Hard
コスト: High

説明可能深層学習Transformer検出マルチモーダル

An Exploratory Analysis of Pain Localization via Explainable Computational Modeling

Automatic pain localization, which involves identifying the anatomical origin of pain from peripheral physiolo

用途: 検出
難易度: Hard
コスト: High

センサ/時系列強化学習方策勾配 (PPO / A3C)検出音声

Distributed Acoustic Localization Array Deployed Using a Soft Everting Vine Robot

Soft robot exteroception is increasingly being explored for a variety of field applications. In this work, we

用途: 検出
難易度: Hard
コスト: Medium

自然言語処理プロンプトエンジニアリング検出画像テキスト

arxivGitHubあり2026-07-22

ReferTrack: Referring Then Tracking for Embodied Visual Tracking

ReferTrack は、自然言語で対象の車両に付近する自動車を追従させるシステムである。このシステムでは、対象の車両に付近する自動車を認識する後、自動車の動きを予測する。

用途: 自動車が対象の車両に付きそわせるシステム
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョン動画認識検出異常検知マルチモーダル

Clinical Pathways as Safety Specifications for Physical AI in Hospital Wards

Clinical Pathways は、ロボットが実際の環境で安全に動作するためのシステムである。これは、ロボットが病室で安全に作業し、医療スタッフや患者を守る。

用途: 医療機関で使うロボットの安全性を確保するためのシステム
難易度: Hard
コスト: High

コンピュータビジョン物体検出分類検出セグメンテーション

githubGitHubあり2026-07-22

supervision — We write your reusable computer vision tools. 💜

supervisionは、機械学習技術を活用して、ユーザー独自のコンピュータビジョンツールを作成することができる。

用途: オリジナルコンピュータビジョンツール
難易度: Easy
コスト: High

githubGitHubあり2026-07-22

insightface — State-of-the-art 2D and 3D Face Analysis Project

このプロジェクトは２Ｄおよび３Ｄ顔の分析を実現するための基盤プロジェクトであり、最先端の技術を導入して顔の分析を実現します。

コンピュータビジョン3D・点群分類検出3D

用途: 面量認証
難易度: Easy
コスト: High

Scaling Laws for Hypernetwork-Based Knowledge Injection in Large Language Models

ハイパーネットワークを用いた知識付与法を提案し、大規模言語モデルに確実に知識を付与する方法について検討した。

自然言語処理大規模言語モデル異常検知テキスト

用途: LLMに知識を付与
難易度: Hard
コスト: High

MIRA-Ev:A Benchmark for Granular Evidence Detection and Relational Reasoning in Clinical Exams

Clinical NLP evaluation remains dominated by multiple-choice question answering (MCQA), which scores only fina

生成AIGAN分類検出QA

用途: 分類
難易度: Hard
コスト: Low

品質予測/異常検知自然言語処理大規模言語モデル分類検出生成

AutoJourn: Multi-Perspective Summarisation, Bias Detection and Bias Neutralisation for LLM-Generated News in Automated Journalism

We present AutoJourn, a demonstration system for multi-perspective news generation and bias-aware evaluation u

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知深層学習Attention機構検出音声

Transcription Policy as a Latent Variable: Activating Controllable Verbatim ASR with Word-Level Timing

記号化の種類 (verbatim vs. intended) は、現在の音声認識モデルの評価に影響を与えるが、このような制約はモデルのトレーニングに影響しないことが多い。しかし、ここでは、制約はモデルのトレーニングに影響

用途: 記号化の制約付き復元
難易度: Hard
コスト: High

Bounding Boxes to Improve Small Language Model Performance on Vision-Based Grading Tasks

The deployment of Small Language Models (SLMs) in educational settings offers significant advantages in terms

コンピュータビジョン物体検出検出画像テキスト

用途: 検出
難易度: Hard
コスト: Medium

Rationale-Guided Knowledge Distillation for Cross-Lingual Stance Detection

Stance detection aims to identify whether a text expresses a favorable or opposing attitude toward a given tar

深層学習軽量化・量子化検出テキスト

用途: 検出
難易度: Hard
コスト: High

センサ/時系列深層学習軽量化・量子化検出セグメンテーションテキスト

EGRNet: A Lightweight Semantic Segmentation Network with Edge-Gated Refinement and Adversarial Sensing

As autonomous systems and smart cities continue to evolve, the demand for efficient and robust scene understan

用途: 検出
難易度: Hard
コスト: Medium

Synthetic and Derived Training Images for Campus Waste Detection: A Multi-Seed Evaluation with YOLOv8n

Incorrect disposal can contaminate campus recycling streams, and a bin-mounted camera could provide feedback a

コンピュータビジョン物体検出検出画像

用途: 検出
難易度: Hard
コスト: High

arxivGitHubあり2026-07-21

Detect Early, Escalate Rarely: Anytime Detection of AI-Generated Video from the Compressed Bitstream

Detectors for AI-generated video are evaluated offline. A clip is decoded to pixels and scored once, increasin

CPUで試しやすい深層学習CNN検出画像テキスト

用途: 検出
難易度: Hard
コスト: High

InstructMixup: Instruction-Guided Salient Patch Editing for Robust Data Augmentation

記述情報に従って画像や動画データを混ぜ合わせる「対数混合法」を拡張する方法、InstructMixupを提案する。これにより、データを拡張しながらデータの内容とラベルが維持される。

深層学習Transformer分類検出生成

用途: データ拡張のための対数混合法を拡張する
難易度: Hard
コスト: High

From Distances to Trajectories: Real-Time Signed Distance Function Mapping and Distance-Accelerated Motion Planning for UAVs

難しい環境で運用するためには、自動空飛ブイロード（UAV）が実際に障害物に存在する距離を判断し、安全な軌跡を計画することが求められる。これを行うために、複数のステージ（マッピングと計画）を連続化した、サイン・ディスタン

コンピュータビジョンセグメンテーション検出3D

用途: UAVの安全な運用
難易度: Hard
コスト: High

PathAgentBench: Benchmarking Evidence-Seeking Vision-Language Models on Whole-Slide Pathology Image

Whole-slide image (WSI) diagnosis requires identifying diagnostically relevant regions, examining them across

自然言語処理ファインチューニング検出生成画像

用途: 検出
難易度: Hard
コスト: High

Milo, a Fully Autonomous Indoor/Outdoor Robotic Guide Dog

Many Blind and Low-Vision (BLV) people rely on guide dogs for moment-to-moment navigation, such as staying on

コンピュータビジョン3D・点群検出3D

用途: 検出
難易度: Hard
コスト: High

Computing on the Fly: Navigating a Vision for the Future of Drone Computing

The report envisions a decade in which drones move goods, medical supplies, and information at a scale compara

強化学習検出生成

用途: 検出
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョン動画認識検出生成画像

arxivGitHubあり2026-07-21

NGPS: GPS-Denied Aerial Geo-Localization and 2.5D Reconstruction via Deep Satellite Image Matching and Multi-Rate Sensor Fusion

この研究では、高空飛行の無信号位置指示のNGPS (Next-Generation Positioning System)というフレームワークを提案しました。NGPSは、GPSの信号を利用せずに位置推定を可能にします。N

用途: 高空飛行の無信号位置指示
難易度: Hard
コスト: High

品質予測/異常検知自然言語処理大規模言語モデル検出

LLM Detection as an Intervention: Downstream Impact under Strategic User Behavior

LLMが広く使用されるようになり、LLMを識別するツールが開発されている。しかし、識別システムは、使用者の行動に影響を与えている。つまり、識別システムが機能しないと、ユーザが別のシステムを使用することに関連し、最終的な

用途: LLMを識別
難易度: Hard
コスト: High

How Fast Do Signatures Learn? Statistical Theory and Applications for Path Regression

この論文では、パス値学習を行うためにpath signaturesという

MI向き生成AI拡散モデル検出回帰予測

用途: パス値学習を行う
難易度: Hard
コスト: High

Scalable and Efficient Joint Spiking Embedding Predictive Architecture for Large-Scale Dynamic Graphs

Dynamic graph learning aims to capture evolving structural and semantic patterns in real-world systems, such a

深層学習軽量化・量子化分類検出生成

用途: 分類
難易度: Hard
コスト: High

説明可能品質予測/異常検知自然言語処理ファインチューニング検出異常検知テキスト

O-VAD: Industrial Video Anomaly Detection through Object-Centric Tracking and Reasoning

工場の中の異常が検出されるように設計された機械学習モデルを提案しています。通常の方法では、モデルはビデオ内のすべての内容を考慮し、複雑な問題を解決することは困難です。提案されたモデルのアプローチは、オブジェクトを検出して

用途: 産業ビデオの異常発生検出
難易度: Hard
コスト: High

Integrity-Gated Eco-CACC: Epistemic Admissibility for Cooperative Driving at Signalized Intersections

Eco-Cooperative Adaptive Cruise Control (Eco-CACC) systems rely on accurate localization, signal timing, and i

センサ/時系列強化学習モデルベース検出

用途: 検出
難易度: Hard
コスト: Medium

センサ/時系列深層学習Transformer分類異常検知画像

Recti-Q: Feature-Space Rectification for Out-of-Distribution-Robust Quantized Perception in Edge Robotics

エッジロボチクスでの画像認識精度を安定させ、その安定性を確保するために、量化後のパフォーマンスを向上させ、分散型データ量化を実現し、分布シフトの影響を緩和する、新しい機械学習アプローチを提案します。

用途: エッジロボチクスでの画像認識の安定性
難易度: Hard
コスト: High

Optimization of sim-to-real transfer in the humanoid robot NICO

existing robotic grasping methodの限界を解決するためのsim-to-real transfer methodを提案し、成功率を向上させる。

コンピュータビジョン物体検出検出画像

用途: ロボットの手順を解決する
難易度: Hard
コスト: Medium

Importance Sampling and PCA for Finding Failures in Commercial Autonomous Vehicles

existing fault detection methodの限界を解決するためのadaptive stress testing methodを提案し、商用自動運転システムの故障率を減らす。

コンピュータビジョンセグメンテーション強化学習

用途: 自動運転システムの故障検出を解決する
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョン物体検出検出音声

Technical Design Review of Duke Robotics Club's Oogway & Crush: AUVs for RoboSub 2026

existing AUV development methodの制約を解決するためのrobustなオートニモティクス基盤と機械学習アライアンスを開発する。

用途: ROBOCUPのAUV開発を推進する
難易度: Hard
コスト: Medium

RoboHarness: Memory-Driven Orchestration of Heterogeneous Robot Policies for Long-Horizon Planning

existing robot control methodの限界を解決するためのmemory-driven orchestration method、RoboHarnessを提案し、長期計画を実現する。

自然言語処理プロンプトエンジニアリング異常検知

用途: ロボットの長期計画を解決する
難易度: Hard
コスト: High

RynnBrain 1.1: Towards More Capable and Generalizable Embodied Foundation Model

existing Embodied Foundation Modelの制限を解決するためのcontact-point prediction とnative 3D grounding methodを提案し、更に能力と

コンピュータビジョンセグメンテーション検出3D

用途: Embodied Foundation Modelの制限を解決する
難易度: Hard
コスト: High

UMCP: A Unified Multi-Task Collaborative Perception Network for Luggage Trolley Pose Estimation

ロボット車の視覚システムは、高精度でリアルタイム性能を持つロジスティクス車両の位置検出を実現する必要があります。従来の手法では、複数のモデルが連続してインフェレンズされ、インフェレンスラティシーが増加し、高規模デプロイメ

コンピュータビジョン物体検出検出画像

用途: luggage trolleyの位置推定
難易度: Hard
コスト: Medium

Lifelong Localization in Dynamic Indoor Environments Combining Odometry with Sparse Distance Sampling

自律ロボットの位置決めは、ロボットナビゲーションの主要なタスクです。ロボットが予測できない、非静的な障害物、またはロボットが未知の環境に入ることが多い。この研究では、ロボットのオドメトリと距離サンプリングを組み合わせて、

センサ/時系列自然言語処理RAG検出3D

用途: 自律ロボットの位置推定
難易度: Hard
コスト: High

A2RL V\textsubscript{max}: The A2RL autonomous racing dataset for long-range, high-speed perception and multi-vehicle interaction

In autonomous driving development, a perception dataset is crucial, as it provides fundamental data for traini

コンピュータビジョン3D・点群検出テキスト3D

用途: 検出
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知自然言語処理ファインチューニング検出画像

arxivGitHubあり2026-07-20

Polar Coordinate-based Differential Evolution for Moving Target Search Using Vision Sensor on Unmanned Aerial Vehicles

In search and rescue operations, there is a period known as the "golden time" during which the probability of

用途: 検出
難易度: Easy
コスト: Medium

深層学習Transformer分類検出セグメンテーション

Seg2Grasp: A Robust Modular Suction Grasping in Bin Picking

採掘ロボットの性能向上を目指したSeg2Graspを構築し、セグメンテーション、グレイシング、クラスフィルタリングの3つのモジュールで構成されます。セグメンテーションモジュールではTransformerを利用したオブジェ

用途: 採掘ロボットがオブジェクトを取り上げる能力の向上
難易度: Hard
コスト: Low

SLAM in Low-Light Environments: Project Report

この論文では、低照明状況のためのSLAM実現を目標とし、LiDAR、深さ、または熱センサなどの補助的なセンサを取り入れることでSLAMを改良します。

センサ/時系列機械学習時系列検出3D

用途: 低照明状況のためのSLAM実現
難易度: Hard
コスト: High

MI向きセンサ/時系列強化学習マルチエージェント異常検知

Compositional Semantic Communication for Physical AI: Category Theory Meets Game Theory

Physical artificial intelligence (AI) systems involve distributed sensing agents with embedded AI models that

用途: 異常検知
難易度: Hard
コスト: Medium

huggingfaceGitHubありHugging Faceあり2026-07-20

Differentiable Logic Gate Networks for Low-Latency EEG Classification on Edge Devices

Real-time EEG classification on edge devices is bottlenecked by the floating-point arithmetic of conventional

CPUで試しやすい強化学習マルチエージェント分類検出

用途: 分類
難易度: Easy
コスト: Low

huggingfaceHugging Faceあり2026-07-20

FlowMimic: Mask-free Visual Editing and Generation with Pixel-pair Warped Flow Field for Online Video Editing Data Generation and Modality Mimicry

In line with the prevailing direction of vision research, we explore the integration of both generation and ed

品質予測/異常検知自然言語処理大規模言語モデル検出生成セグメンテーション

用途: 検出
難易度: Easy
コスト: High

huggingfaceHugging Faceあり2026-07-20

Token-Level Off-Policy Learning for Faithful Generation Under Distribution Shift

We propose Token-Level Off-Policy Labeling (TOPL), an off-policy training paradigm that reframes post-training

説明可能自然言語処理ファインチューニング分類生成異常検知

用途: 分類
難易度: Easy
コスト: High

huggingfaceHugging Faceあり2026-07-20

Self-State Attacks on Self-Hosted AI Agents: How Far Can OS Defenses Go?

Self-hosted AI agents read and write their own memory and configuration files to function. An agent may get co

用途: 検出
難易度: Easy
コスト: Medium

arxivPaper only2026-07-19

Kernelized Linear Attention: Breaking the Capacity Wall with Symmetric Cones

Linear attention promises constant-time recurrent inference but degrades sharply on associative recall. We for

深層学習RNN / LSTM異常検知

用途: 異常検知
難易度: Hard
コスト: High

arxivPaper only2026-07-19

DeeperRadar: End-to-End MIMO Radar Design and Multi-Modal Fusion for Autonomous Vehicle Perception

DeeperRadar is a radar-centric, sensor-stack-conditioned framework that co-designs radar sensing and multi-mod

センサ/時系列コンピュータビジョンセグメンテーション検出画像3D

用途: 検出
難易度: Hard
コスト: High

huggingfaceHugging Faceあり2026-07-19

TimeLens2: Generalist Video Temporal Grounding with Multimodal LLMs

Video multimodal large language models (MLLMs) can describe what happens in a video, but rarely identify when

自然言語処理大規模言語モデル検出テキスト動画

用途: 検出
難易度: Easy
コスト: High

A BIM-enabled, Agent-based Discrete-event Simulation Platform for Robotic Studies: A Method based on Graph Theory

Indoor robots are increasingly employed for facility management tasks such as cleaning and inspection. These a

品質予測/異常検知深層学習軽量化・量子化検出

用途: 検出
難易度: Hard
コスト: Medium

センサ/時系列コンピュータビジョンセグメンテーション分類検出3D

InLiER: Learning-Free Heterogeneous LiDAR Place Recognition via Intermediate Mixed-Radix Structural Keypoint Tokenization

LiDAR place recognition supports loop closure, relocalization, and multi-agent map management. As robotic plat

用途: 分類
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョン物体検出検出画像

Hybrid Machine Learning for Articulation Angle Estimation of Truck-Semitrailer Combinations

Accurate articulation angle estimation of trucks with trailers is critical for autonomous driving and advanced

用途: 検出
難易度: Hard
コスト: Medium

An Indoor Navigation System for the Visually Impaired based on UWB Positioning and D* Lite Path Planning Algorithm

This paper proposes an indoor navigation system for the visually impaired, leveraging Ultra-Wideband (UWB) pos

自然言語処理RAG検出画像

用途: 検出
難易度: Hard
コスト: Low

自然言語処理プロンプトエンジニアリング分類検出テキスト

Hazard or Anomaly? Evaluating VLMs for Understanding Dangers and Discrepancies

Modern safety-critical systems increasingly rely on human-robot interaction to reduce disaster risk and suppor

用途: 分類
難易度: Hard
コスト: High

コンピュータビジョンマルチモーダル検出画像テキスト

Autonomous VR-Based Risk Detection for Situational Awareness in Dangerous Settings

In high-risk environments such as disaster response, situational awareness depends not only on detecting hazar

用途: 検出
難易度: Hard
コスト: High

An Efficient Likelihood Ratio Test for Online Changepoint Detection in the Presence of Autocorrelation

オフラインのデータ流れの中で、時間序列のデータに基づいてデータ変化を検知することができる方法が必要。この問題を解決するために、オンラインのデータ流れで検知した変更点をオフラインのデータ流れに適用する方法を提案。

用途: オフラインの変更点検知
難易度: Hard
コスト: Low

ASK-NN: An Asymmetric Nearest-Neighbor Test that detects Distribution Drifts in Natural Language

Hallucinations and artificial text in LLM-generated outputs often appear as distributional deviations between

自然言語処理大規模言語モデル検出テキスト

用途: 検出
難易度: Hard
コスト: High

Transient State Reorganization and Cell Differentiation in the Developmental Dynamics of Growing Neural Cellular Automata

Neural Cellular Automataが複雑な形状を形成するプロセスを研究しました。

コンピュータビジョン動画認識検出

用途: 画像認識
難易度: Hard
コスト: High

VTLoc: Learning-based Tactile Contact Localization in Visual Point Clouds

VTLocフレームワークは、視覚情報と触覚情報を統合し、ロボットハンドの位置を推定することで、ロボットハンドの位置推定と動作操作を実現します。

コンピュータビジョン3D・点群検出画像テキスト

用途: ロボットハンドの位置推定
難易度: Hard
コスト: High

Data and Learning Where it Matters for Contact-Rich Manipulation

この研究では、接触の豊富なマニピュレーションを実現するための、データの収集と学習を改良した方法を提案し、ロボットの制御の精度を

自然言語処理RAG異常検知強化学習

用途: 接触の豊富なマニピュレーションのためのデータ収集と学習
難易度: Hard
コスト: Low

少数データ向き条件最適化自然言語処理RAG検出画像

Embodied Active Learning under Limited Annotation and Navigation Budget for Object Detection

この研究では、ロボットのナビゲーション時間と注釈時間の制約を考慮したオブジェクト検出フレームワークを提案します。

用途: オブジェクト検出を適応化
難易度: Hard
コスト: Low

Prediction-Only Distillation in Linear and Logistic Regression

distillationにおける予測のみを扱う学習アプローチを提案し、それをテストした。

深層学習軽量化・量子化分類回帰異常検知

用途: distillationにおける予測のみ
難易度: Hard
コスト: High

Delocalization of bias in unadjusted Hamiltonian Monte Carlo and underdamped Langevin

この研究では、調整されていない HMC と Langevin Sampler の偏りの解消について議論しました。調整されていないサンプラーは、通常、偏りのあるものであることが知られています。この研究

コンピュータビジョンセグメンテーション検出

用途: HMC と Langevin Sampler の偏りの解消
難易度: Hard
コスト: Medium

センサ/時系列コンピュータビジョンセグメンテーション検出時系列

Post Hoc Inference for Component Attribution in Multivariate Change-Point Detection

時系列データを観察し、その中に分割が起こっているかどうかを検知する方法はある。時間系列変化点を検知することができ、変化点の位置がどこにあるかを推定することができる。

用途: 時系列データの分割探知
難易度: Hard
コスト: Low

MI向き品質予測/異常検知深層学習Transformer分類検出

Toward Energy-Efficient and Low-Power Arrhythmia Detection for Wearable Devices

この研究では、ウェアラブルデバイスで電気生理学記録（ECG）を分析するために使用される深層学習アルゴリズムを開発することを目的としています。このアルゴリズムは、エネルギー効率が高く、小型化が可能であるため、心臓の病気の検

用途: 心臓の病気の検出
難易度: Hard
コスト: Low

huggingfaceHugging Faceあり2026-07-16

Trajectory-aware Cross-view Geo-localization with Sequential Observations

Cross-view geo-localization matches ground-level observations against geo-tagged satellite imagery. Recent met

品質予測/異常検知深層学習軽量化・量子化検出画像テキスト

用途: 検出
難易度: Easy
コスト: High

arxivPaper only2026-07-15

PiVoT: A Variational Solution for Real-time Large-scale Multi-object Detection and Tracking under Heavy Clutter

難しい環境でマルチオブジェクトの検知と追跡が可能なPiVoTを開発、実用的なソリューションを提案した。

深層学習軽量化・量子化検出画像3D

用途: マルチオブジェクトの検知と追跡
難易度: Hard
コスト: High

arxivPaper only2026-07-14

MixCIT: A Kernel Based Local-Polynomial Debiased Test for Conditional Independence on Mixed-Type Data

多種多様なデータに対して条件的独立性の検定を行う方法です。混合タイプデータに対して統一的な、効率的な、あるいは統計的に有効な解決策は存在しませんでしたが、グラフ上のノード間の距離を比较する方法を提案しています。

用途: 複合データの条件的独立性検定
難易度: Hard
コスト: Medium

arxivPaper only2026-07-14

Statistical Properties and Power Analysis of Divergence Measures for Credit Risk Model Monitoring

金融データの分布の変化を検出し、信用リスクモデルの監視を行う方法です。Jensen-Shannon-DivergenceやKullback-Leibler-Divergenceなどの分散量は異なる種類の変化を検出できます

用途: 信用リスクモデルの監視
難易度: Hard
コスト: Medium

arxivPaper only2026-07-14

Stability Buys Time: A Re-Keying Game for Encrypted Multi-Agent Control

暗号化された制御システムでは、クラウドがホモモルフィック暗号化された状態を操作し、動物達の動作をプライバシーで管理することができる。安全を確保するために、サイドチャネル攻撃のリスクを考慮しながら、制御機器が信頼できると仮

用途: 暗号化された制御
難易度: Hard
コスト: Low

Disentangling Forced and Internal Climate Variability in Single Realizations using Dynamic Mode Decomposition with Control

We show that a single climate realization can be decomposed into forced and internal components by treating ex

説明可能強化学習モデルベース検出回帰

用途: 検出
難易度: Hard
コスト: Medium

Bet on Features: Anytime-Valid and Feature-Aware Auditing of Conditional Quantile Forecasters

分類Forecasterの監視と評価を容易にするために、可変期間のバックテストを導入し、情報依存性を考慮したCalibrationを可能にしました。

説明可能センサ/時系列機械学習時系列検出テキスト

用途: 分類Forecasterの監視と評価
難易度: Hard
コスト: Low

品質予測/異常検知強化学習方策勾配 (PPO / A3C)検出

Removable Defects: The Economics and Limits of Deliberate Deficiency

A specialist tolerates blind spots that a generalist does not. Usually this is treated as a cost to be minimiz

用途: 検出
難易度: Hard
コスト: High

表形式向き品質予測/異常検知深層学習Transformer検出表形式強化学習

Transformer-Guided Swarm Intelligence for Frugal Neural Architecture Search

この研究では、従来のNAS方法のコストを抑えるための方法を開発します。この方法では、NASをトランスフォーマーを使用して実行します。

用途: NAS (Neural Architecture Search) のコストを抑えるための方法を開発
難易度: Hard
コスト: Low

Efficient and Robust Spiking Neural Networks for sEMG-Based Muscle Fatigue Detection

Detecting muscle fatigue via surface electromyography (sEMG) is essential for applications in sports, rehabili

用途: 検出
難易度: Hard
コスト: High

huggingfaceGitHubありHugging Faceあり2026-07-13

RAGU: A Multi-Step GraphRAG Engine with a Compact Domain-Adapted LLM

Graph retrieval-augmented generation (GraphRAG) enhances large language models with structured knowledge, yet

自然言語処理大規模言語モデル検出生成要約

用途: 検出
難易度: Easy
コスト: High

arxivPaper only2026-07-12

Did We Actually Fix It? An Independent Adversarial Stress-Test of Post-Point-Adjustment Evaluation Metrics for Time-Series Anomaly Detection

Point-adjustment (PA), for years the default scoring protocol in time-series anomaly detection (TSAD), was sho

センサ/時系列品質予測/異常検知自然言語処理RAG検出異常検知時系列

用途: 検出
難易度: Hard
コスト: Low

arxivPaper only2026-07-11

Emergent Generalization by Representation Learning in Artificial Neural Networks

Dimensionality reduction has proven powerful for identifying neural manifolds, which are low-dimensional struc

説明可能センサ/時系列深層学習Transformer異常検知埋め込み時系列

用途: 異常検知
難易度: Hard
コスト: Low

huggingfaceHugging Faceあり2026-07-10

REBASE: Reference-Background Subspace Elimination for Training-Free In-Context Segmentation

Training-free in-context segmentation enables new object categories to be introduced at inference time from a

品質予測/異常検知自然言語処理プロンプトエンジニアリング検出セグメンテーション画像

用途: 検出
難易度: Easy
コスト: High

深層学習Transformer分類検出セグメンテーション

githubGitHubあり2026-07-10

pytorch-grad-cam — Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

このライブラリは、コンピュータービジョンのための高度なAI解釈と可視化ソリューションです。このライブラリは、CNN、ビジョントランスフォーム、分類、物体検出、分割、画像類似度など、さまざまなコンピュータービジョンの

用途: AIの解釈と可視化ソリューション
難易度: Easy
コスト: Low

arxivPaper only2026-07-07

6G Sensing Security: Distributed Game-Theoretic RL for Urban Beamforming and Attacker Detection

Next-generation wireless networksにおける分散型ゲーム理論を用いた6Gのセキュリティを研究します。分散型ゲーム理論は、6Gの通信システムが環境の認識とデータの伝送両方を実現するために必要な

センサ/時系列深層学習軽量化・量子化検出生成強化学習

用途: 6Gにおける分散型ゲーム理論
難易度: Hard
コスト: Medium

arxivPaper only2026-07-06

An event-driven framework for fly-inspired visual motion detection

イベントベースセンシングの活用と生物学的インスピレーションを利用した障害物検出を実現するために、飛行経路を用いた新しいアプローチが提案される。このアプローチは、イベントベースセンシングの活用と生物学的インスピレーションを

説明可能センサ/時系列深層学習Transformer検出画像

用途: イベントベースセンシングと飛行経路を用いた動的環境での障害物検出
難易度: Hard
コスト: High

huggingfaceGitHubありHugging Faceあり2026-07-05

Benchmarking Sensor Robustness in Plasma Diagnostic Models: A Systematic Evaluation on TokaMark

Plasma diagnostic models for tokamak fusion devices are almost universally evaluated on clean, complete sensor

表形式向きCPUで試しやすいセンサ/時系列深層学習Transformer検出

用途: 検出
難易度: Easy
コスト: Medium

arxivPaper only2026-07-02

Predicting Early Stages Of Alzheimer's Disease And Identifying Key Biomarkers Using Deep Artificial Neural Network And Ensemble Of Machine Learning Methodologies

この研究では、アルツハイマー病の前期診断と生物学的マーカーの検出にAI技術を適用します。AIモデルをトレーニングするために、電気エイセフィログラム（EEG）データを使用し、精度を高めます。また、AIモデルが得た情報を分析

表形式向きCPUで試しやすい深層学習Transformer分類検出回帰

用途: アルツハイマー病の前期診断と生物学的マーカーの検出
難易度: Hard
コスト: High

arxivPaper only2026-07-01

BFF: Simple explanations for complex phenomena

「計算的生命」論文は、ペアが相互作用する複雑なシステムにおいて、自己複製体を容易に発見できることを示しました。ここでは、逆説的には、単純な遺伝子突然変異ウォークを用いた自己複製体の検出に新しいアプローチを提案し、この方法

深層学習検出

用途: 自己複製体の検出
難易度: Hard
コスト: Medium

arxivPaper only2026-06-30

Distributed Hierarchical Temporal Memory with Shared Associative Memory for Cross-Entity Preemptive Warning

分散型時間関数記憶体を用いた異常検知システムを開発しました。このシステムは、関連のあるエンティティの予兆行動を共有メモリ空間に保存し、異常検知に役立ちます。このシステムは、異常検知に役立つ新しい方法を提供します。

センサ/時系列品質予測/異常検知自然言語処理RAG検出生成異常検知

用途: 分散型時間関数記憶体を用いた異常検知
難易度: Hard
コスト: Low

arxivPaper only2026-06-27

LLM Semantic Signaling Game and Mechanism Design: Systematic Blindness, Awareness Shaping, and Mindset Dynamics

Large language models (LLMs) increasingly mediate strategic interactions through natural language, making sema

自然言語処理大規模言語モデル検出テキスト

用途: 検出
難易度: Hard
コスト: High

githubGitHubあり2026-06-25

face_recognition — The world's simplest facial recognition api for Python and the command line

facial_recognitionライブラリはPythonとコマンドラインでface_recognition APIを提供します。ライブラリはOpenCVのdlibライブラリを利用し、顔認識を単純に扱います。

機械学習教師あり学習分類検出

用途: 面貌認識システムを構築する
難易度: Easy
コスト: Low

arxivPaper only2026-06-20

Learning a Normal World Model for Few-Shot Boundary-Calibrated Abnormality Detection

Abnormality detection in complex systems faces two practical barriers: abnormal labels are scarce, and binary

少数データ向きセンサ/時系列自然言語処理プロンプトエンジニアリング検出テキスト

用途: 検出
難易度: Hard
コスト: Medium

arxivPaper only2026-06-19

Gradient-Free Warm-Start Library Recovery: an Amortized-Regret Separation

Continual learning that is gradient-free, local, online, and append-only is attractive for edge and streaming

コンピュータビジョンセグメンテーション分類検出

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-06-17

FPGA-Accelerated Neuromorphic Vision System for Real-Time Orbital Object Detection

The escalating congestion in orbital space demands advanced monitoring solutions. This work presents a compreh

用途: 検出
難易度: Hard
コスト: Medium

arxivPaper only2026-06-16

A Neuromorphic Trigger for Efficient Audio Event Detection

Efficient processing of continuous audio streams remains a key challenge for real-time and resource-constraine

深層学習軽量化・量子化分類検出音声

用途: オーディオイベント検出の
難易度: Hard
コスト: Low

arxivPaper only2026-06-13

Controlled Dynamics Attractor Transformer

この研究では、Controlled Dynamics Attractor Transformer (CDAT)を提案しました。このTransformerは、Self-Attention MechanismとAssocia

説明可能品質予測/異常検知深層学習Transformer分類検出異常検知

用途: Controlled Dynamics Attractor Transformer (CDAT)を提案すること。
難易度: Hard
コスト: Low