MLinfo | 機械学習・AI論文まとめ

コンピュータビジョン物体検出分類検出セグメンテーション

ultralytics — Ultralytics YOLO 🚀

ultralyticsはYOLO(You Only Look Once)の技術を使用したオブジェクト検出ライブラリで、高い精度を提供している。

用途: オブジェクト検出
難易度: Easy
コスト: Low

コンピュータビジョン物体検出分類検出セグメンテーション

supervision — We write your reusable computer vision tools. 💜

supervisionは、機械学習技術を活用して、ユーザー独自のコンピュータビジョンツールを作成することができる。

用途: オリジナルコンピュータビジョンツール
難易度: Easy
コスト: High

コンピュータビジョン物体検出分類セグメンテーション画像

label-studio — Label Studio is a multi-type data labeling and annotation tool with standardized output format

データラベル化と注釈化を行うためのツールです。

用途: データラベル化ツール
難易度: Easy
コスト: Low

品質予測/異常検知コンピュータビジョンセグメンテーション分類検出画像

cvat — Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling services, for image, video, and 3D annotation with AI-assisted labeling, quality assurance, team collaboration, analytics, and developer APIs.

CVATは、機械学習用の業界標準のデータエンジンです。さまざまなスケールのチームが使用し、さまざまなスケールのデータに対応しています。

用途: データのラベル付けと管理
難易度: Easy
コスト: High

コンピュータビジョンセグメンテーション分類画像動画

labelme — Image annotation with Python. Supports polygon, rectangle, circle, line, point, and AI-assisted annotation.

イメージを注釈するツール。ポリゴン、長方形、円、線、点などを注釈することができる。

用途: イメージ注釈
難易度: Easy
コスト: High

stanza — Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

stanzaは、さまざまな言語を処理するための言語処理用ライブラリです。

コンピュータビジョンセグメンテーション分類

用途: 言語処理用ライブラリを提供する
難易度: Easy
コスト: Low

品質予測/異常検知コンピュータビジョンセグメンテーション生成画像テキスト

Echo-Memory: A Controlled Study of Memory in Action World Models

この研究では、エピソード記憶を制御するために、エピソード記憶モデルを設計および評価しました。エピソード記憶モデルは、エピソード内の重要な情報を記憶し、エピソード間の相関関係を特定することができます。

用途: エピソード記憶
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション生成教師あり教師なし

Disentanglement with Holographic Reduced Representations

画像分割を目的としたDeep learningモデルを提案した論文です。Deep learningモデルが画像を構成するオブジェクトに適切に分割できるようにするために、画像を分割したときの画像の特徴量を用いて学習します。

用途: 画像分割
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション教師なし

A Unifying Framework for Concept-Based Representational Similarity

Learned representations across models and modalities often exhibit striking structural similarities, suggestin

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Safe-RULE: Safe Reinforcement UnLEarning

安全な強化学習のためのデータの削除を提案。データポイズニング攻撃からデータを保護するために、データを削除する方法を提案した。

コンピュータビジョンセグメンテーション強化学習

用途: 安全な強化学習のためのデータの削除
難易度: Hard
コスト: High

Machine-Learning Emulation of Satellite Greenhouse Gas Retrievals: Stability over Time

Retrieval algorithms are used to estimate atmospheric concentrations of greenhouse gases (GHGs), such as carbo

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

説明可能品質予測/異常検知深層学習Transformer分類セグメンテーションテキスト

Intention Driven Identification of In-Possession Match Phases in Association Football through Temporal Graph Learning

Understanding tactical organisation of association football, hereafter referred to as football, requires ident

用途: 分類
難易度: Hard
コスト: Low

ERBench: A Benchmark and Testsuite for Equation Discovery Algorithms

方程式生成は、数値データから数学的方程式を生成することを目的としたものです。方程式生成を実現するためには、記号的回帰アルゴリズム（Symbolic Regression、SR）が使用されます。SRの実行のパフォーマンスは

用途: 方程式生成
難易度: Hard
コスト: High

表形式向きコンピュータビジョンセグメンテーション生成表形式

BSTabDiff: Block-Subunit Diffusion Priors for High-Dimensional Tabular Data Generation

高次元表形式データでは、数値サンプル（n）が特徴数（m）を上回ることが多いです。つまりこれらのドメインでは、$\mathbb{R}^m$ で直接密度関数を表現することは非実際である。私たちは、BSTabDiff：ブロック

用途: 高次元表形式データの生成
難易度: Hard
コスト: High

Asymptotic Optimality of Thompson Sampling for Risk-Averse Bandits with Sub-Gaussian Rewards

これは、不確実性やリスクを減らすために、$\rho$-NPTS (Nonparametric Thompson Sampling) というアレイフリーの非パラメトリックベースのThompson Samplingで、リスク

用途: リスク厳格なマルチ腕バンディットの最適化
難易度: Hard
コスト: Medium

説明可能コンピュータビジョンセグメンテーション生成

RAM: Reachability Across Morphologies

Many stages of the robotic lifecycle, from morphology synthesis to operation, rely fundamentally on the reacha

用途: 生成
難易度: Hard
コスト: Medium

コンピュータビジョンセグメンテーション動画マルチモーダル

C$^3$ache: Accelerating World Action Models with Cross Inference Chunk Cache

ワールドアクションモデルを高速化するために、情報のキャッシュと伝達を提案します。

用途: ワールドアクションモデルを高速化するためのキャッシュと伝達
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション生成

Synthetic but Not Realistic: The Evaluation Challenge in Generative Modelling for Structured Electronic Medical Records

Synthetic healthcare data are widely proposed as privacy-preserving substitutes for real patient data, yet the

用途: 生成
難易度: Hard
コスト: High

説明可能コンピュータビジョンセグメンテーションテキスト

Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

この論文では、AI エヴァルレーション結果をより効果的に解釈するために、新しいフレームワークを提案する。

用途: AI エヴァルレーション結果の解釈
難易度: Hard
コスト: Medium

品質予測/異常検知自然言語処理RAG検出セグメンテーション異常検知

Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision

Recent Anomaly Detection methods achieve perfect detection and segmentation scores on well-established dataset

用途: ア
難易度: Hard
コスト: High

Frequency-based Constrained Sampling for Interval Patterns

Output space pattern sampling is a powerful alternative to exhaustive pattern mining for exploring large patte

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

コンピュータビジョンセグメンテーション生成テキスト

The Token Not Taken: Sampling, State, and the Variability of AI Agent Outputs

Agentic AIシステムの不確実性が、同じ要求から異なる計画、ツールの呼び出しなどが生成されることを示唆している。このようにしてシステムの信頼性を確保するには、AIエージェントのパラメータを確立することが重要となる。

用途: AIエージェントのパラメータの確立に寄与する
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知深層学習Transformer検出生成セグメンテーション

PolyBuild: An End-to-End Method for Polygonal Building Contour Extraction from High-Resolution Remote Sensing Images

Extracting building polygon contours from high-resolution remote sensing images is a fundamental task for vari

用途: 検出
難易度: Hard
コスト: Low

条件最適化コンピュータビジョンセグメンテーションテキスト

Quantitative Performance Analysis of Stopping Criteria for CMA-ES

この研究では、CMA-ESアルゴリズムの停止条件を評価します。この研究では、CMA-ESアルゴリズムの停止条件が機能するかどうかを調べ、アルゴリズムを改良するための情報を提供します。

用途: 最適化アルゴリズムの評価
難易度: Hard
コスト: Medium

Causally Evaluating the Learnability of Formal Language Tasks

この研究では、形式言語の学習性を評価するための方法を開発します。この方法は、形式言語の学習性がどれだけのデータを必要とするかを評価することができます。

用途: 形式言語の学習性評価
難易度: Hard
コスト: High

When Built-in Thinking Helps and Hurts: Constraint-Level Error Shifts in Instruction Following

この研究では、指示のフォローにおける思考の役割を評価します。この研究では、指示のフォローにおける思考の役割がどれだけの影響を与えるかを調べ、指示のフォローを改良

用途: 指示のフォローにおける思考の役割
難易度: Hard
コスト: Medium

説明可能品質予測/異常検知コンピュータビジョンセグメンテーションマルチモーダル

Interpretable Crisis Behavior Analysis Using Mobility and Social Media Data

人間は危機時に移動パターンやメディアの投稿のパターンが変化し、分析が難しいようになった。この研究では、運動データやメディアデータの統合を用いて危機時の行動パターンを分析し、危機の状況における行動を予測した。

用途: クライシス時の行動分析
難易度: Hard
コスト: High

品質予測/異常検知自然言語処理大規模言語モデル分類セグメンテーションテキスト

arxivGitHubあり2026-06-08

MUDIDI: A Two-Stage Framework for Multilingual Dictionary Digitization with Language Models

この研究では、低リソース言語や絶滅言語の辞書のデジタル化が重要であるが、マルチモーダル辞書をデジタル化する方法は今まで難しかったが、この研究では、最近のビジョン言語モデルを用いて辞書のデジタル化が容易になり、辞書内の文字

用途: ムルティリンガル辞書のデジタル化
難易度: Hard
コスト: High

End-to-End Optimization of Incoherent Imaging for Classification Under Detector-Limited Readout

End-to-end co-optimization of optical front-ends (e.g. metasurfaces) and neural network back-ends has been wid

コンピュータビジョンセグメンテーション分類検出

用途: 分類
難易度: Hard
コスト: Low

品質予測/異常検知コンピュータビジョンセグメンテーション生成画像テキスト

Cranio-Diff: Diffusion-based Cross-domain Craniofacial Reconstruction with 2D X-ray Skull Guidance and Structural Identity Constraints

The state-of-the-art generative models, such as CycleGAN, Pix2Pix, and diffusion models have demonstrated rema

用途: 生成
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知コンピュータビジョンセグメンテーション生成画像

arxivGitHubあり2026-06-08

TUDSR: Twice Upsampling-Diffusion for Higher Super-Resolution

Diffusion-based generative models have achieved remarkable success in real-world image super-resolution (SR).

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知深層学習正規化・最適化手法分類検出セグメンテーション

Adversarial Attack and Disturbance Detection by Hadamard-Coded Output Representations for Object Detection and Semantic Segmentation

Conventional one-hot encodings often yield poorly calibrated models, being overconfident under attack, and let

用途: 分類
難易度: Hard
コスト: Low

コンピュータビジョンセグメンテーション生成画像動画

Prisma-World: Camera-Controllable Multi-Agent Video World Model

Video world models have made rapid progress in generating controllable visual experiences, but most of them st

用途: 生成
難易度: Hard
コスト: High

少数データ向き自然言語処理プロンプトエンジニアリング分類セグメンテーション画像

Training-Free Generalized Few-Shot Segmentation through Open-Vocabulary Semantic Arbitration

Generalized Few-Shot Semantic Segmentation (GFSS) has traditionally been approached as a representation-learni

用途: 分類
難易度: Hard
コスト: High

深層学習正規化・最適化手法分類生成セグメンテーション

Reason Twice: Segmentation via Candidate Discovery and Comparative Reasoning

The rapid development of pretrained foundation models has enabled more general image segmentation. Multimodal

用途: 分類
難易度: Hard
コスト: High

Zero-Parameter Geometric Gating for Temporally Stable Low-Altitude UAV Video Semantic Segmentation

Video semantic segmentation for low-altitude UAVs requires temporal consistency, yet dense optical flow introd

コンピュータビジョンセグメンテーション画像動画

用途: セグメンテーション
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成画像テキスト

OmniGen-AR: AutoRegressive Any-to-Image Generation

Autoregressive (AR) models have demonstrated strong potential in visual generation, offering superior performa

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション検出異常検知3D

Illumination-Invariant Anomaly Detection for Sub-Canopy UAV Multispectral Point Clouds

Unmanned Aerial Vehicle (UAV) multispectral point clouds (MPC) provide high-dimensional spatial-spectral data

用途: 検出
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション異常検知テキストマルチモーダル

Scaling by Diversified Experience for Vision-Language-Action Models

Vision-Language-Action models face significant challenges in real-world deployment due to the entanglement of

用途: 異常検知
難易度: Hard
コスト: High

EPS3D: End-to-End Feed-Forward 3D Panoptic Segmentation

This paper introduces EPS3D, a new end-to-end feed-forward framework for open-vocabulary 3D panoptic segmentat

深層学習軽量化・量子化セグメンテーション画像3D

用途: セグメンテーション
難易度: Hard
コスト: High

深層学習軽量化・量子化セグメンテーションマルチモーダル

DifferSeg: Towards Diverse Multimodal Binary Segmentation via Differential Perception and Frequency Guidance

In many binary segmentation tasks, most multimodal methods rely on fixed feature concatenation for cross-modal

用途: セグメンテーション
難易度: Hard
コスト: High

SynManDex: Synthesizing Human-like Dexterous Grasps from Synthetic Human Pre-Grasps

Human hand-object interactions encode functional intent, but direct transfer to robotic hands often fails unde

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

センサ/時系列コンピュータビジョンセグメンテーション検出3D

Safe Polytope-in-Polytope Motion Planning and Control with Control Barrier Functions

Autonomous mobile robots operating in tight environments require motion planning frameworks that account for t

用途: 検出
難易度: Hard
コスト: High

Deterministic Execution of ROS~2 Applications via Lingua Franca

The Robot Operating System~2 (ROS 2) is a widely used middleware for robotic systems, characterized by a publi

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

深層学習Transformerセグメンテーション画像

githubGitHubあり2026-06-08

segmentation_models.pytorch — Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

セマンティックシーケンス分割モデルのライブラリです。

用途: セマンティックシーケンス分割モデル
難易度: Easy
コスト: High

表形式向きコンピュータビジョンセグメンテーション生成表形式

Declarative Outcome-Conformant Synthesis: Exact, Closed-Form Specification Satisfaction and a Conformance Benchmark

We study a capability the dominant paradigm in synthetic tabular data does not provide: exact satisfaction of

用途: 生成
難易度: Hard
コスト: High

Discovering and decoding latent mean-field structure with variational autoencoders

Generative models are increasingly used to capture correlations in many-body systems, but the representations

コンピュータビジョンセグメンテーション生成

用途: 生成
難易度: Hard
コスト: Medium

センサ/時系列コンピュータビジョンセグメンテーション回帰時系列

When Are Neural Interaction Discoveries Real? Identifiability, Recoverability, and a Pre-Fit Diagnostic

When a neural time-series model reports that one variable modulates another's effect on a target, is the disco

用途: 回帰
難易度: Hard
コスト: Low

品質予測/異常検知コンピュータビジョンセグメンテーション強化学習

Structure-Conditioned Actor-Critic Branches for Quality-Diversity Reinforcement Learning

Quality-diversity reinforcement learning (QD-RL) aims to construct policy repertoires that contain both high-p

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

A Joint Finite-Sample Certificate for Adaptive Selective Conformal Risk Control

Selective predictors answer on confident inputs and abstain elsewhere; deploying one safely needs a single fin

深層学習CNNセグメンテーション画像

用途: セグメンテーション
難易度: Hard
コスト: Medium

センサ/時系列深層学習Transformer分類検出生成

TRADE: Transducer-Augmented Decoder for Speech LLM

Speech Large Language Models (Speech LLMs) lack a principled mechanism for streaming inference: their label-sy

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーションテキスト

More Yap Less Meaning: Uncovering Self-Improvement Behavior in SLMs

Recently, language models have made rapid progress across various domains and applications. However, their cap

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Generalizing Geometry-Guided Mamba as a Plug-and-Play Context Module for CNN-based Semantic Segmentation

CNN-based semantic segmentation networks usually rely on context heads such as ASPP, PPM, or attention modules

深層学習CNNセグメンテーションテキスト

用途: セグメンテーション
難易度: Hard
コスト: Medium

品質予測/異常検知自然言語処理RAG分類検出セグメンテーション

PairWise Image Finder: An Open-source Tool for Finding Visually Aligned Street-Level Image Pairs for Urban Perception Studies

Change detection and scene recognition techniques have been widely applied to Street View Imagery (SVI) to und

用途: 分類
難易度: Hard
コスト: Low

品質予測/異常検知コンピュータビジョンセグメンテーション画像3D

Less Is More: Training-Free Acceleration Framework of 3D Diffusion Models for Low-Count PET Denoising via Global-Local Trajectory Reduction

Accurate quantification and uptake measurement in PET are critical for assessing disease progression and suppo

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション検出画像教師なし

arxivGitHubあり2026-06-07

AUCp: Pseudo-AUC for Inference Model Selection with Unlabeled Validation Data in Abnormality Detection

Abnormality detection is a crucial yet challenging task in medical image analysis. Distinguishing abnormalitie

用途: 検出
難易度: Hard
コスト: High

Shift-Dependent Asymmetry: Orthogonal Inverse Low-Rank Adaptation for Federated Medical Segmentation

Low-Rank Adaptation (LoRA) enables efficient federated fine-tuning of segmentation foundation models for medic

深層学習軽量化・量子化セグメンテーション

用途: セグメンテーション
難易度: Hard
コスト: Medium

MI向きコンピュータビジョンセグメンテーション画像3D

PhysGraph: A Physics-aware 3D Scene Graph for Perception and Reasoning

To perform a wide range of daily tasks, robots need to construct a 3D representation that is semantically rich

用途: セグメンテーション
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer検出セグメンテーション異常検知

NGram-MoSE: Efficient Remote Sensing Super-Resolution via N-Gram Context and Mixture-of-Experts

Remote sensing applications for environmental monitoring and disaster management are frequently constrained by

用途: 検出
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成予測画像

EgoPriMo: Egocentric Motion Generation for Interactive Humanoid Control

Humanoid robots require whole-body motions that adapt to scene context, task requirements, and user intent. Mo

用途: 生成
難易度: Hard
コスト: High

表形式向きコンピュータビジョンセグメンテーション生成画像テキスト

arxivGitHubあり2026-06-07

Segmentation-Assisted Brain MRI Synthesis with Cross-Image Multi-Contrast Feature Memory Bank Retrieval Augmentation

Multi-contrast brain MRI provide complementary soft-tissue characteristics that aid in the screening and diagn

用途: 生成
難易度: Easy
コスト: Low

CheXanatomy: Anatomy-Aware Vision-Language Modeling for Chest Radiographs

Vision-language models (VLMs) pretrained on large-scale image-text pairs demonstrate strong image-level unders

深層学習CNN検出生成セグメンテーション

用途: 検出
難易度: Hard
コスト: High

MI向き自然言語処理RAG生成セグメンテーション画像

SceneConductor: 3D Scene Generation from Single Image with Multi-Agent Orchestration

Generating complete 3D scenes from a single image requires inferring globally consistent geometry, object rela

用途: 生成
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成マルチモーダル強化学習

Guided Discovery of New Behaviors using Diffusion Policies

Diffusion models have become a powerful tool for generative modeling in robotics, with diffusion policies exce

用途: 生成
難易度: Hard
コスト: High

Assessing model calibration with boosting trees

The main goal in regression modelling consists in approximating the conditional mean of a response given a set

用途: 回帰
難易度: Hard
コスト: Medium

Identifiability and Estimation for Unlabeled Finite Mixtures under Marginal Independence

We study component recovery and mixing-matrix estimation from unlabeled finite mixtures whose observable distr

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Arabic Sentence Segmentation Across Genres and Punctuation Conditions

Sentence segmentation in Arabic is challenging due to ambiguous and inconsistent punctuation, with many texts

深層学習軽量化・量子化セグメンテーションテキスト

用途: セグメンテーション
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション分類画像3D

MS-COOT: Comparing Morse-Smale Complexes with Co-Optimal Transport

Understanding and comparing structures in scalar fields is a central challenge in scientific visualization, wi

用途: 分類
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成マルチモーダル

Test-Time Scaling in Multimodal Foundation Models: A Comprehensive Survey of Generation and Reasoning

Test-time Scaling (TTS) has emerged as a pivotal research direction for enhancing model performance by dynamic

用途: 生成
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer検出セグメンテーション3D

SegmentAnyTreeV2: Scaling Transformer-Based Tree Instance Segmentation Across Sensors, Platforms, and Forests

We present SegmentAnyTreeV2, a sensor- and platform-agnostic framework for semantic and instance segmentation

用途: 検出
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成埋め込み画像

Neural Field Tokenizations with Hierarchy and Spatial Locality Priors

Neural fields parameterize data as functions from coordinates to values, providing a unified framework for rep

用途: 生成
難易度: Hard
コスト: High

深層学習Transformer分類セグメンテーション回帰

arxivGitHubあり2026-06-06

How Much MRI Preprocessing Is Enough? A Cost-Utility Study for Brain MRI Foundation Models

MRI preprocessing defines the input distribution seen by brain MRI foundation models, yet it is usually treate

用途: 分類
難易度: Hard
コスト: High

深層学習Transformerセグメンテーション画像

Phase Marginalization for Patch-Grid Instability in Vision Transformers

Vision Transformers operate on fixed patch grids, which can introduce phase-dependent instability for dense pr

用途: セグメンテーション
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョンセグメンテーション分類画像テキスト

One Stone, Three Birds: Self-adaptive Optimal Transport for Multi-VLM Selection, Adaptation, and Ensembling

Vision-language models (VLMs) enable visual recognition from semantic class descriptions, which makes them att

用途: 分類
難易度: Hard
コスト: High

深層学習Transformer検出セグメンテーションテキスト

OmniFaceRig: Fully Automatic Inner-Mouth-Aware Face Rigging Across Diverse 3D Character Topologies

Facial rigging - creating FACS-based blendshapes together with inner-mouth geometry (teeth, gums, and tongue)

用途: 検出
難易度: Hard
コスト: High

Uncertainty-Aware Intention Prediction for Human-to-Robot Assembly Teleoperation

In assisted teleoperation for human-robot collaboration, accurate intention prediction is critical for enablin

自然言語処理RAG分類検出セグメンテーション

用途: 分類
難易度: Hard
コスト: High

MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning

Robotic simulators are a cornerstone of modern research in aerial robotics, serving both as a vehicle for the

自然言語処理RAGセグメンテーション強化学習

用途: セグメンテーション
難易度: Hard
コスト: High

Entanglement in the Quantum Volunteer's Dilemma

A well-known model in game theory, the Volunteer's Dilemma describes a group of $n$ players who decide whether

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Empirical Transfer Operators and Finite-Sample Change Detection for Noisy Expanding Interval Maps

We study finite-sample change detection for one-dimensional noisy dynamical systems using partition-based empi

コンピュータビジョンセグメンテーション検出

用途: 検出
難易度: Hard
コスト: Medium

コンピュータビジョンセグメンテーション生成回帰テキスト

TBD-VLA: Temporal Block Diffusion Vision Language Action Model

Discrete Vision-Language-Action (VLA) models typically formulate action generation as next-token prediction ov

用途: 生成
難易度: Hard
コスト: High

Rapid co-design of Buoyancy-assisted robots for Challenging Locomotion using Gaussian Evolutionary Specialists

この論文では、水上ロボットの設計の高速化のための新しい方法を提案した。Gaussian Evolutionary Specialists（GES）を用いた設計システムを用い、ロボットの形状と制御を同時に最適化することがで

コンピュータビジョンセグメンテーション強化学習

用途: 水上ロボットの設計の高速化
難易度: Hard
コスト: High

Shield-Loco: Shielding Locomotion Policies with Predictive Safety Filtering

この論文では、Reinforcement Learning（RL）ポリシーを用いて安全ロボット制御を実現した。Shield-Locoは、安全な制御を提供するための予防的安全フィルタリングを実装し、ロボットの安全な行動を導

コンピュータビジョンセグメンテーション強化学習

用途: 安全ロボット制御
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョンセグメンテーション

Compliance-Based Sensor Placement for Force Sensing on a Sensorized Prostate Phantom

This work presents a compliance-based sensor placement method for force sensing on a sensorized prostate phant

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Causal Atlases from Entropic Inference: Bayesian Networks beyond Optimal DAGs

Deep Learningモデルのパラメータの最適化には、テスト時パフォーマンスを最適化することが重要なステップで、しかし、従来のアルゴリズムでは、テスト時パフォーマンスを最適化することは困難である。Double Pre

用途: パラメータの最適化
難易度: Hard
コスト: Medium

少数データ向きコンピュータビジョンセグメンテーション

Function-Space Priors for Bayesian Neural ODEs with Application to Vessel Trajectory Prediction

自動識別装置（AIS）データから船舶の軌跡を予測するためのオードインアリティー方程式（ODE）にベイズ推論を用いたモデルを開発しました。このモデルは、軌跡の不確実性も同様に予測されるため、安心して判断・意思決定が可能にな

用途: 船舶の軌跡予測
難易度: Hard
コスト: Medium

DAS-PINNs for high-dimensional partial differential equations: extending deep adaptive sampling to spacetime domains

時計関係の高次元偏微分方程式の解には、空間上では局所化し、動的にも変化する解を見つけなければならないが、これは、物理的に導かれたニューラルネットワーク（PINNs）を用いることで解くことができる。ただし、PINNsの単純

用途: 物理的に導かれたニューラルネットワークを用いてパラメトリック偏微分方程式を解く
難易度: Hard
コスト: Medium

説明可能コンピュータビジョンセグメンテーション生成埋め込みマルチモーダル

Discrete Causal Representations from Heterogeneous Domains: A Bayesian Approach with Social Survey Applications

この研究では、複数のドメインの複雑なデータを分析するために、Bayesian モデルを使用して因果関係を分析するツールを開発します。主に社会調査に使用できるツールです。

用途: 複数のドメインの因果関係を分析するツールを開発
難易度: Hard
コスト: High

Diffusion Models Observe Only Gradients: A Geometric Perspective on Score Matching Errors

本論文では、ディフュージョンモデルの解釈方法を提唱します。この方法は、ディフュージョンモデルによる標的分布の解釈を可能にすることを目的としており、モデルが標的分布に近づく速度を正確に評価し、解釈可能な結果を得ることができ

用途: ディフュージョンモデルの正確性と解釈
難易度: Hard
コスト: High

Conformal Risk-Averse Decision Making with Action Conditional Guarantee

この研究では、安全な決定を取るための方法を提案します。機械学習モデルを使った安全な決定は、不確実性の量化とUQメソッドが必要です。Conformal predictionは、予測結果を予

用途: 安全な決定を取るための方法
難易度: Hard
コスト: Medium

On the Hardness of Optimal Motion on Trees

This paper presents a simple framework that settles the complexity of Multi-Agent Path Finding (MAPF) on trees

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Waypoints Matter: A Systematic Study for Sampling-Based Trajectory Planning

この研究では、フェスタースター自動運転用の軌跡計画を改善するための新しい方法を提案します。 Waypoints Matter は、ロボットが目標地に向かって進むための最適なルートを決定します。

用途: フェスタースター自動運転用の軌跡計画
難易度: Hard
コスト: Medium

品質予測/異常検知自然言語処理RAGセグメンテーションテキスト動画

VOLT: Vision and Language Trajectory Segmentation for Faster-than-Demonstration Policies

この研究では、フェスタースター自動運

用途: フェスタースター自動運転用の高速動作
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化検出セグメンテーション3D

RadiusFPS: Efficient Farthest Point Sampling on CPUs and GPUs via Spherical Voxel Pruning

ポイントサンプリングを高速化する方法を開発しました。この方法は、ポイントサンプリングを高速化できます。

用途: ポイントサンプリングを高速化
難易度: Hard
コスト: High

T-FunS3D: Task-Driven Hierarchical Open-Vocabulary 3D Functionality Segmentation

Open-vocabulary 3D functionality segmentation enables robots to localize functional object components in 3D sc

自然言語処理RAG分類セグメンテーション画像

用途: 分類
難易度: Hard
コスト: High

The Economics of Proof-of-Useful-Work

Proof-of-work (PoW) blockchains rely on computational expenditure to secure a ledger supporting a native crypt

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Competing Auctions in Intermediated Markets

We analyze competing auctions in intermediated markets, where a seller selects among parallel mechanisms for t

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

DNQ: Deep Nash Q-Network for Partially Observable n-Player Games

DNQは、部分観測可能なnプレーヤーの競争ゲームを解決するためのフレームワークです。このフレームワークは、ゲームの最終結果を予測するために使用されます。

用途: 部分観測可能なnプレーヤーの競争ゲームを解決
難易度: Hard
コスト: High

表形式向き説明可能コンピュータビジョンセグメンテーション表形式

GOTabPFN: From Feature Ordering to Compact Tokenization for Tabular Foundation Models on High-Dimensional Data

大きな言語モデルをデータ効率的に訓練するための、新しい方法、 GOTabPFNを提案した。

用途: 大きな言語モデルのデータ効率化
難易度: Hard
コスト: High

表形式向きコンピュータビジョンセグメンテーション検出テキスト表形式

TabSODA: Tabular Diffusion based Imputation with Skip Pattern Detection and Ordinal Awareness

本論文では、欠損値がある表格型データの欠損補完に関して取り組み、欠損値がないセルと同様に動作するSkipパターン検出と順序性意識のあるdiffusionベースの欠損補完アルゴリズムを提案しました。

用途: 表格型データの欠損補完
難易度: Hard
コスト: High

説明可能コンピュータビジョンセグメンテーション予測

ReSGA: A Large Tail Risk Model for Learning Value-at-Risk and Expected Shortfall

この研究では、値または期待短期的なリスク管理 (Value-at-Risk and Expected Shortfall) を提案しており、短期的なリスクを効率的に管理する。

用途: 値または期待短期的なリスク管理
難易度: Hard
コスト: Low

Deterministic Envelopes for Tamed SGLD: Decoupling Stochastic-Gradient Noise and Localizing Taming

この研究では、ストッキング勾配降下法 (Stochastic-Gradient Langevin algorithms) の安定性を確保する方法を提案しており、

用途: ストッキング勾配降下法の安定性
難易度: Hard
コスト: Medium

コンピュータビジョンセグメンテーション検出生成テキスト

Global Sketch-Based Watermarking for Diffusion Language Models

Watermarking methods for language models have been studied extensively in the autoregressive setting, where to

用途: 検出
難易度: Hard
コスト: High

When Both Layers Learn: Training Dynamics of Representing Linear Models via ReLU Networks

In this paper, we study the gradient descent dynamics for jointly training both layers of a one-hidden-layer R

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

The price of multi-group transductive learning

We show every multi-group learner in the transductive setting may incur a multiplicative penalty in its error

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

説明可能品質予測/異常検知コンピュータビジョンセグメンテーション

U-Net-Accelerated Quality-Diversity Optimization for Climate-Adaptive Urban Layouts

都市計画を最適化するために、クオリティ-ダイバーシティ最適化を使用する方法を提案する。

用途: 都市計画の最適化
難易度: Hard
コスト: High

Improved Approximation Guarantees for Groupwise Maximin Share Fairness

We study the problem of fairly allocating a set of indivisible goods to a set of $n$ agents with additive valu

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

githubGitHubあり2026-06-03

CoreNLP — CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

CoreNLPはJavaで開発されたNLPツールのセットであり、分割、文分割、名詞認識、パーシング、コorefence、感情分析などを行える。

コンピュータビジョンセグメンテーション分類

用途: 分析
難易度: Easy
コスト: Low

Edge of Stability Selectively Shapes Learning Across the Data Distribution

Existing analyses of the edge of stability (EoS) treat it as a global property of optimization. We show that i

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Privacy-Robust Incrementality Measurement for Advertising Systems under Signal Loss

広告プラットフォームはランダム化されたLIFTテストを使用してインクレメントを評価しますが、これがプライバシを保障するためのレポートシステムを損なう可能性があります。プライバシーを保護しながら広告のインクレメント測定を可

用途: ランダム化されたLIFTテストを用いた広告のインクレメントの評価問題解決
難易度: Hard
コスト: Medium

Calibrating Urban Traffic Simulation from Sparse Road Observations via Genetic Optimization

Urban traffic simulation is a critical tool for infrastructure planning, including the placement of electric v

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Optimizing Explicit Unit-Distance Lower-Bound Certificates

The 2026 disproof of Erdős's unit-distance conjecture and Sawin's quantitative refinement show that the maximu

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-06-02

MAOAM: Unified Object and Material Selection with Vision-Language Models

Selection is a core operation in interactive image editing. To be practical, a user should be able to specify

MI向き自然言語処理RAG生成セグメンテーション画像

用途: 生成
難易度: Easy
コスト: High

Recovering Direct Price Effects of Environmental Amenities in Housing Markets: Regression and Causal Machine Learning Model Assessment with Empirical Monte Carlo Simulation

Hedonic price models are widely used to assess how environmental amenities affect property values, yet methodo

用途: 回帰
難易度: Hard
コスト: Medium

説明可能コンピュータビジョンセグメンテーション回帰

ScoreStop: Gradient-based early stopping using functional score tests

Gradient boosted decision trees require a stopping rule to avoid overfitting. The standard rule monitors a val

用途: 回帰
難易度: Hard
コスト: Medium

センサ/時系列コンピュータビジョンセグメンテーション予測時系列

ProbRes: Volatility Learning for Probabilistic Time-Series Forecasting

Probabilistic time series forecasting has attracted increasing attention in financial applications due to the

用途: 予測
難易度: Hard
コスト: High

Convex Distance Operator Transport: A Convex and Geometry-Preserving Formulation

We introduce Convex Distance Operator Transport (CDOT), the first convex optimal transport framework that alig

コンピュータビジョンセグメンテーション分類3D

用途: 分類
難易度: Hard
コスト: High

Tree-Guided Identify-Then-Exploit: A Unified Framework of Best Arm Identification and Regret Minimization for Dueling Bandits

We study $N$-armed stochastic dueling bandits under the Condorcet-winner assumption, where three widely adopte

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

MINTS: Minimalist Thompson Sampling

The Bayesian paradigm offers principled tools for sequential decision-making under uncertainty, but its relian

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Fast Generalization after Interpolation via Critically Damped Momentum Optimization

A central problem in machine learning is that models can achieve near-perfect training performance while gener

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション回帰テキスト

Simultaneous Model-Based Evolution of Constants and Expression Structure in GP-GOMEA for Symbolic Regression

Genetic programming (GP) approaches are among the state-of-the-art for symbolic regression, the task of constr

用途: 回帰
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-06-01

Quality-Guided Semi-Supervised Learning for Medical Image Segmentation

Training accurate medical image segmentation models requires large amounts of densely annotated data, which is

品質予測/異常検知コンピュータビジョンセグメンテーション画像教師あり半教師あり

用途: セグメンテーション
難易度: Easy
コスト: High

arxivPaper only2026-05-31

Transferring Information Across Interventions in Causal Bayesian Optimization

Bayesian optimization is a popular way to optimize expensive systems, where every experiment, simulation, or i

少数データ向きCPUで試しやすい条件最適化コンピュータビジョンセグメンテーション

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-05-31

FlowSDR: Sufficient Dimension Reduction via Conditional Normalizing Flows

Sufficient dimension reduction (SDR) seeks a low-dimensional linear projection of predictors that preserves th

用途: 回帰
難易度: Hard
コスト: Medium

arxivPaper only2026-05-31

Repeated Descent: A Framework for Online Budget-Feasible Auctions

We study budget feasible procurement auctions, in which $n$ agents, each with a privately held service cost, o

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-05-30

Robust inference for risk heterogeneity under group imbalance

Population-level heterogeneity is ubiquitous in biomedical data, where differences across demographic or clini

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-05-29

Institutions and the transmission of upper-tail human capital: scientific lineages across a millennium

What made useful knowledge cumulative was not discovery alone but the institutions that transmitted it. We pro

コンピュータビジョンセグメンテーション生成テキスト

用途: 生成
難易度: Hard
コスト: High

huggingfaceGitHubありHugging Faceあり2026-05-29

Score-Control for Hallucination Reduction in Diffusion Models

Diffusion models have emerged as the backbone of modern generative AI, powering advances in vision, language,

コンピュータビジョンセグメンテーション生成画像音声

用途: 生成
難易度: Easy
コスト: High

arxivPaper only2026-05-27

Learning to Assess the Reliability of Number-of-Runs Estimation in Stochastic Optimization

In large-scale benchmarking of stochastic optimization algorithms, the key challenge is no longer whether repe

コンピュータビジョンセグメンテーション分類検出

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-05-26

Signal-to-Noise Ratio and Sample Size Govern Representational Alignment in Neural Networks

Neural networks are known to develop latent representations that are $aligned$, namely structurally similar ac

品質予測/異常検知コンピュータビジョンセグメンテーション分類回帰

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-26

A Trilemma in AMM Mechanism Design

Blockchains have popularized the Automated Market Makers (AMMs), where users trade crypto-assets directly with

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-05-25

Coalition Free Energy and Adaptive Precision in Multi-Agent Cooperation

Cooperative multi-agent systems require robust mechanisms for credit assignment under uncertainty. Here we int

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-05-21

Do Not Trust The Auctioneer: Learning to Bid in Feedback-Manipulated Auctions

意図的なプレーヤーに反応するビジネス戦略を開発する方法が提案される。意図的なプレーヤーは、価格を高く抑えるためにアーティフィシャルビジネスを利用する。この方法により、ビジネスが意図的なプレーヤーに反応するビジネス戦略を開

用途: 意図的なプレーヤーに反応するビジネス戦略を開発する
難易度: Hard
コスト: Medium

深層学習Transformer分類検出セグメンテーション

githubGitHubあり2026-05-21

pytorch-grad-cam — Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

このライブラリは、コンピュータービジョンのための高度なAI解釈と可視化ソリューションです。このライブラリは、CNN、ビジョントランスフォーム、分類、物体検出、分割、画像類似度など、さまざまなコンピュータービジョンの

用途: AIの解釈と可視化ソリューション
難易度: Easy
コスト: Low

githubGitHubあり2026-05-21

notebooks — A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM 3, and Qwen3-VL.

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from f

深層学習CNN分類検出セグメンテーション

用途: 分類
難易度: Easy
コスト: Low

arxivPaper only2026-05-20

How to Build Marcus's Algebraic Mind: Algebro-Deterministic Substrate over Galois Fields

この研究では、GaryマルクスのAlgebraicMindというアイデアに基づいて、計算モジュールと表現言語の間の橋渡しとして代数的認識を構築するための提案を紹介します。このモジュールでは、registerとtreele

用途: 辺端的な思考を可能とし、変数と階層化された構造化された表現の操作を実現するために、代数的認識を導入する
難易度: Hard
コスト: Medium

arxivPaper only2026-05-20

Convergence Analysis of Evolution Strategies for Mixed-Integer Optimization

この研究では、混合整数最適化の進化戦略に基づくオブジェクト関数の近似精度を確保するためのアプローチを示します。従来の進化戦略では、選択された座標の整数変数の標準偏差に下限を設けて、整数変数の収束を防ぐことが一般的です。こ

用途: 混合整数最適化問題の解法として、進化戦略(ES)に基づくオブジェクト関数の近似精度を確保する
難易度: Hard
コスト: Medium

arxivPaper only2026-05-19

Training Neural Networks with Optimal Double-Bayesian Learning

Backpropagation with gradient descent is a common optimization strategy employed by most neural network archit

コンピュータビジョンセグメンテーション分類検出

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-13

Nonsmooth Set-Gradient Ascent to the Pareto Front via Layered Hypervolume and Magnitude Indicators

パレートフロントに近づくための非-smooth な集団を使用する方法を提案し、この方法が効果的なパレートフロントの近似を実現することを実験結果により示した。

用途: パレートフロントに近づくための非-smooth な集団を使用する方法を提案する
難易度: Hard
コスト: Medium

arxivPaper only2026-05-13

Genetic algorithm vs. gradient descent for training a neural network architecture dedicated to low data regimes in small medical datasets

Aim/Introduction: Distance-encoding biomorphic-informational neural network (DEBI-NN) is a recently proposed a

MI向きコンピュータビジョンセグメンテーション分類

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-05-13

Offline Two-Player Zero-Sum Markov Games with KL Regularization

We study the problem of learning Nash equilibria in offline two-player zero-sum Markov games. While existing a

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-05-12

Social Welfare under Heterogeneous Time Preferences

In several socioeconomic-critical decision-making settings, such as fair resource allocation, climate policy,

コンピュータビジョンセグメンテーション生成

用途: 生成
難易度: Hard
コスト: Medium

arxivPaper only2026-05-12

Bayesian Persuasion with a Risk-Conscious Receiver

We study Bayesian persuasion when the receiver evaluates actions by reward-side Conditional Value-at-Risk (CVa

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-05-10

RDEx-CASK: Cauchy Mutation, Archive, and Stagnation Kick for RDEx-CSOP

We extend RDEx-CSOP with 3 changes that target stagnation & late-stage variance, plus minor parameter tuning.

品質予測/異常検知コンピュータビジョンセグメンテーション生成

用途: 生成
難易度: Hard
コスト: Medium