コンピュータビジョン

ultralytics — Ultralytics YOLO26, YOLO11, YOLOv8 — object detection, instance segmentation, semantic segmentation, image classification, pose estimation, object tracking

ultralyticsはYOLO(You Only Look Once)の技術を使用したオブジェクト検出ライブラリで、高い精度を提供している。

yolov5 — Ultralytics YOLOv5 in PyTorch for object detection, instance segmentation, classification, training, and export.

YOLOv5という物体検出アルゴリズムをPyTorchから他の言語に変換できるライブラリ。

コンピュータビジョン物体検出分類セグメンテーション画像

label-studio — Label Studio is a multi-type data labeling and annotation tool with standardized output format

データラベル化と注釈化を行うためのツールです。

未読 245件

ultralytics — Ultralytics YOLO26, YOLO11, YOLOv8 — object detection, instance segmentation, semantic segmentation, image classification, pose estimation, object tracking

ultralyticsはYOLO(You Only Look Once)の技術を使用したオブジェクト検出ライブラリで、高い精度を提供している。

用途: オブジェクト検出
難易度: Easy
コスト: Low

yolov5 — Ultralytics YOLOv5 in PyTorch for object detection, instance segmentation, classification, training, and export.

YOLOv5という物体検出アルゴリズムをPyTorchから他の言語に変換できるライブラリ。

用途: 物体検出
難易度: Easy
コスト: High

コンピュータビジョン物体検出分類セグメンテーション画像

label-studio — Label Studio is a multi-type data labeling and annotation tool with standardized output format

データラベル化と注釈化を行うためのツールです。

用途: データラベル化ツール
難易度: Easy
コスト: Low

learnopencv — Learn OpenCV : C++ and Python Examples

OpenCVを用いて画像処理の学習方法を紹介している。

用途: OpenCVの学習
難易度: Easy
コスト: Medium

vision — Datasets, Transforms and Models specific to Computer Vision

コンピュータビジョンのデータセット、変換、モデルのライブラリ。

用途: コンピュータビジョン
難易度: Easy
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション分類検出画像

cvat — Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling services, for image, video, and 3D annotation with AI-assisted labeling, quality assurance, team collaboration, analytics, and developer APIs.

CVATは、機械学習用の業界標準のデータエンジンです。さまざまなスケールのチームが使用し、さまざまなスケールのデータに対応しています。

用途: データのラベル付けと管理
難易度: Easy
コスト: High

コンピュータビジョンセグメンテーション分類画像動画

labelme — Image annotation with Python. Supports polygon, rectangle, circle, line, point, and AI-assisted annotation.

イメージを注釈するツール。ポリゴン、長方形、円、線、点などを注釈することができる。

用途: イメージ注釈
難易度: Easy
コスト: High

carla — Open-source simulator for autonomous driving research.

CARLAは、オープンソースのシミュレータで、主に自動運転研究のために使われます。このシミュレータを使うことで、車両などのロボットをシミュレートし、様々なシナリオを実行できます。

用途: 自動運転研究用のオープンソースシミュレータ
難易度: Easy
コスト: Medium

Meshroom — Node-based Visual Programming Toolbox

ノードベースのビジュアルプログラミングツールです。

コンピュータビジョン3D・点群画像テキスト3D

用途: ビジュアルプログラミングツール
難易度: Easy
コスト: High

colmap — COLMAP - Structure-from-Motion and Multi-View Stereo

このライブラリは、3次元幾何学とモーションの解析のためのオープンソースライブラリです。このライブラリは、複数の視点からの画像を扱い、構造計算とマルチビューステレオの解析をサポートしています。

用途: 3次元幾何学とモーションの解析
難易度: Easy
コスト: Medium

kornia — 🐍 Geometric Computer Vision Library for Spatial AI

このリポジトリでは、金融分野に適したLarge Language Modelsを提供しています。

コンピュータビジョン画像

用途: 金融用のLarge Language Models
難易度: Easy
コスト: High

rerun — Visualize, query, and stream to train on multimodal robotics data.

データをロギング・ストーリング・クエリして視覚化できるSDKです。

コンピュータビジョンマルチモーダル画像

用途: データロギングおよび視覚化
難易度: Easy
コスト: High

stanza — Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

stanzaは、さまざまな言語を処理するための言語処理用ライブラリです。

用途: 言語処理用ライブラリを提供する
難易度: Easy
コスト: Low

3D-Aware VLMs with Implicit and Explicit Geometries

3次元空間理解技術のための新しいアプローチであるVLM-IE3D（Vision-Language Models with Implicit and Explicit 3D geometry）を提案しました。VLM-IE3

コンピュータビジョン3D・点群検出画像テキスト

用途: 3次元空間理解技術の開発
難易度: Hard
コスト: High

Barzilai-Borwein Fails Superlinear Convergence on an Open Set of Quadratics for Every Dimension $n\geq 4$

バルザリ＝ボレイン法のスーパー非線形収束問題に関する論文を発表しました。この論文では、バルザリ＝ボレイン法が非線形収束できないオープン集合のすべての二次型問題に対してスーパー非線形収束できないことを示しました。これは、強

用途: 最適化アルゴリズムの検証
難易度: Hard
コスト: Medium

センサ/時系列コンピュータビジョンセグメンテーション分類生成テキスト

Beyond Sufficiency: Time Series Explanation with Counterfactual Necessity

時系列データ分析技術のための新しいアプローチであるTimePNS（Time Series Explanation with Counterfactual Necessity）を提案しました。TimePNSは、時系列データ

用途: 時系列データ分析技術の開発
難易度: Hard
コスト: Low

Zero-Flow Two-Sample Tests

We propose a new approach to two-sample testing for deciding whether two sets of samples are drawn from the sa

コンピュータビジョンセグメンテーション回帰画像

用途: 回帰
難易度: Hard
コスト: Medium

What, Where, and How: Disentangling the Roles of Task, Language, and Model in Code Model Representations

Do independently trained language models come to represent the same thing in the same way? We answer for code,

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

説明可能センサ/時系列コンピュータビジョン動画認識予測テキスト

Climate-resilient electric vehicle charging infrastructure for sustainable cities: An interpretable causal-ensemble framework for preventive maintenance and low-carbon mobility

都市の電気自動車充電インフラは、可及的速やかに故障を予測・修理することで、耐久性と低炭素化を向上させる必要がある。機械学習を用い、故障を予測するモデルの開発を研究した。

用途: 都市の電気自動車充電インフラの耐久性向上
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション生成テキスト

Context-weighted Discrete Flow Matching

ディスクリートフロー・マッチングにおけるコンテキストの正しい有用性の利用を検討した。この研究では、ディスクリートフロー・マッチングのモデルの正確さを高めるためにコンテキストの有用性を適切に利用する方法を提案した。

用途: ディスクリートフロー・マッチングにおけるコンテキストの有用性
難易度: Hard
コスト: High

Safety-oriented sidewalk and road segmentation for smartphone-based assistive navigation

この研究では、車椅子の位置情報を取得するために、安全な歩道と道路を分類するセグメントを提案し、視覚障害がある人々や盲人の移動を支援する手段になる可能性があります。

コンピュータビジョンセグメンテーション画像

用途: 車椅子の位置情報取得
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション生成マルチモーダル

Best-of-Evidence: Best-of-N Selection under Partial Verification

モデル出力の選択のためのBoN（ベストオブナ）を、部分検証が含まれるビジョン言語タスクに適用する。この方法により、モデル出力を効率化できる。

用途: 部分検証を含むビジョン言語タスクを効率化する
難易度: Hard
コスト: High

Bridging the Gap Between Plausibility and Admissibility: Constraint-Aware Flow Maps for Dynamic Graph Systems

Generative models can support decision-making under uncertainty by producing ensembles of plausible future sys

コンピュータビジョンセグメンテーション生成

用途: 生成
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション埋め込みテキスト自己教師

MSBraM: A Multi-scale Self-supervised Brain Foundation Model for Hierarchical EEG Dynamics Learning

Self-supervised foundation models have recently shown strong potential for electroencephalogram (EEG)-based an

用途: 埋め込み
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション検出画像テキスト

PC-Edit: Prompt-Contrastive Region Discovery and Region-Guided Editing

Replacing an object with one that differs in category or shape requires complete source removal, natural targe

用途: 検出
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション分類検出動画

BasketEvent: Understanding Who Did What and When in Basketball Videos

この研究では、大規模言語モデルを使用して、basketボールの動的理解に基づいて、プレイヤーへの関わりや時間境界を推測するモデルを開発しました。

用途: basketボールの動的理解
難易度: Hard
コスト: High

Logic Programming Semantics for Causal Processes

この研究では、大規模言語モデルを使用して、因果プロセスの理解を進めました。大規模言語モデルを活用することで、因果関係を予測することができました。

用途: 因果プロセスの理解
難易度: Hard
コスト: High

How Rules Represent Causal Knowledge: Causal Modeling with Probabilistic Logic Programming

この研究では、大規模言語モデルを活用して、因果関係のモデル化を研究しました。大規模言語モデルを活用することで、因果関係を予測することができました。

用途: 因果関係のモデル化
難易度: Hard
コスト: High

Can Generative Recommendation Reach Cold Items? A Temporal Perspective on Semantic-ID Generation

Semantic-ID-based generative recommendation represents items as sequences of shared semantic tokens, enabling

コンピュータビジョン動画認識生成テキスト

用途: 生成的な推奨システムの冷たいアイテム
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョンセグメンテーション

Interaction Dynamics Modeling and Predictive Control for Safe Steerable Catheter--Tissue Interaction

Safe steerable catheter control is fundamentally a problem of interaction dynamics: the tip must follow a plan

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

説明可能品質予測/異常検知コンピュータビジョンセグメンテーション画像

Scene Parameter Saliency via Differentiable Light Transport

光の伝達の可微分化を用いて、入力が最も影響するシーン要素を特定するための方法を提案した。

用途: グラデンスの推測
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョン3D・点群生成3D

Future Rendering $\neq$ Future Surface: A Benchmark and Dataset for Dynamic Surface Reconstruction Beyond the Observed Window

Dynamic-scene reconstruction is almost always evaluated inside the observed time window, yet deployment settin

用途: 生成
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション検出テキスト動画

Incremental Optimal Assignment for Real-Time Crowd Tracking

Multi-object tracking in dense crowds requires solving a bipartite assignment problem between detections and t

用途: 検出
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成テキスト動画

T-STAR: A Large-Scale Benchmark for Spatio-Temporal Panoptic Scene Graph Generation in Satellite Video

Structured understanding of satellite video is essential for advancing dynamic geospatial scene analysis from

用途: 生成
難易度: Hard
コスト: High

The Second LoViF 2026 Challenge on Real-World All-in-One Image Restoration: Methods and Results

LoViF の 2 回目のチャレンジでは、画像修復に新たなアプローチを提案しています。実世界の画像を修復するための包括的な評価基準を提供しており、低光照度、ハッジ、雨、雪などのさまざまな障害に対する解決策を研究者に求めて

コンピュータビジョン画像

用途: 画像修復
難易度: Hard
コスト: Medium

コンピュータビジョンセグメンテーションテキスト3D

Loss Landscape Topology Reveals Why Simple Baselines are Competitive at 3D Point Cloud Segmentation Under Class Imbalance

3D点群のセグメンテーションではクラス不均衡が発生し、有効な解決策が必要です。この研究では、11 つの不均衡対策を 2D のコンピュータビジョンとは異なる 3D の上で評価し、標準的な交差エントロピーと均衡の重み付けが競

用途: 3D点群のセグメンテーション
難易度: Hard
コスト: High

The RealDefocus Benchmark for Defocus Deblurring

ドリフス脱失は画像を再構築するために不可欠ですが、再構築画像とドリフス画像のペアリングや標準化されたプロトコルなどの要件を満たすデータセットが不足しているため、評価が難しいです。この研究では、レアルワールドに基づくドリフ

品質予測/異常検知コンピュータビジョン画像

用途: ドリフス脱失の解除
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション画像3D

TransBiolab: A Real-World Multi-View Dataset of Cluttered Transparent Biomedical Objects

自動化された生理学ラボでは、透明なプラスチック製品を認識、位置付け、操作するために視覚知覚が必要ですが、対象となる高品質のリアルワールドデータセットは現在限られています。この研究では、複雑なマルチオブジェクトのシーンを扱

用途: 膚質物体の可視化
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョンセグメンテーション分類画像

HyperImageNet: A Large-Scale High-Spatial Resolution Hyperspectral Imagery Classification Benchmark

We present HyperImageNet, a large-scale benchmark for fine-grained hyperspectral land-cover understanding. The

用途: 分類
難易度: Hard
コスト: Low

説明可能センサ/時系列コンピュータビジョンマルチモーダル画像テキスト

GeoThreat: Transferable Targeted Adversarial Attacks on Large Vision-Language Models for Remote Sensing Image Interpretation

Adversarial attacks against large vision-language models (LVLMs) serve as an effective means of assessing thei

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Explainable Deepfake Detection Challenge

Deepfake detection is moving beyond binary classification decisions toward systems that can also explain the v

説明可能コンピュータビジョン画像分類分類検出生成

用途: 分類
難易度: Easy
コスト: Low

コンピュータビジョンセグメンテーション分類画像教師あり

Webly Supervised Multi-Label Recognition: Evaluation Benchmark and Dual-Branch Multi-Label Contrastive Learning

この論文では、Webly Supervised Multi-Label Recognition（WS-MLR）という手法を提案します。WS-MLRは、web画像データセットを使用して、多ラベルを解釈します。

用途: Weblyスーパーバイズ多ラベル認識
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知コンピュータビジョンマルチモーダル画像

AXIS: A Growable Community-Driven Data Engine for Scalable Robot Manipulation

Learning effective robot manipulation policies requires diverse, high-quality demonstrations, yet existing dat

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーションQA画像テキスト

Beyond Episodic Evaluation: Memory Architectural Bottlenecks in Sequential Embodied Question Answering

Embodied question answering (EQA) is traditionally evaluated under an episodic formulation, where agents solve

用途: QA
難易度: Hard
コスト: High

Grasp, Handover, Rotate: Bimanual Object Reorientation via Compositional Diffusion and Energy-Based Optimization

Bimanual object reorientation - picking an object, handing it over between two arms, and placing it in a desir

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーションテキストマルチモーダル

URF: A Unified Robot Control-Policy Framework for Stable Contact Aware Manipulation

Learning-based manipulation policies usually predict robot actions from sensory observations and leave their e

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

githubGitHubあり2026-07-23

ml-agents — The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

Unityを使用してマシンラーニングエージェントを訓練して訓練できるツールです。

コンピュータビジョン3D・点群3D強化学習

用途: Unityでマシンラーニングエージェント
難易度: Easy
コスト: High

Memoir: Should a Model Write to Its Memory While It Thinks?

Memoir combines per-sample fast memory, shared slow parameters, variable-depth latent recurrence, and a future

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Synthetic minority data is redundant or invalid: a data-dependent validity theory and a de-biased test

For two decades, the standard remedy for class-imbalanced learning has been to fabricate synthetic minority ex

用途: 分類
難易度: Hard
コスト: Low

品質予測/異常検知コンピュータビジョンマルチモーダル分類

Adaptive Confidence-weighted Expansion for Trustworthy Multi-Omics Multimodal Fusion

Multimodal learning is a robust approach to improve predictive performance in applications such as medical pro

用途: 分類
難易度: Hard
コスト: High

Perspective Latents as an Architectural Condition for Causal Emergence in Active Inference Agents

A recent line of work measures causal emergence in reinforcement learning agents through Integrated Informatio

コンピュータビジョン動画認識強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョン動画認識テキスト

Attribution Markets: A Fisher-Market Formulation for Fractional Credit Assignment Between Planned Tasks and Performed Actions

Personal and organizational planning systems maintain two records that drift apart: what was planned (a task's

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Adaptive Multi-Horizon Reinforcement Learning

Effective decision-making in complex and changing environments requires balancing short-term and long-term con

コンピュータビジョン動画認識強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

SalesLoop: Reinforcement Learning from Performance Feedback for Sales Lead Ranking

Lead ranking in Customer Relationship Management (CRM) systems faces a persistent challenge: models achieving

コンピュータビジョン動画認識強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

One Round Is All You Need: Analytic Federated Learning for Task-Heterogeneous Multi-Label Medical Image Classification

Federated learning (FL) enables multiple clinical institutions to collaboratively train a shared disease class

コンピュータビジョン画像分類分類回帰画像

用途: 分類
難易度: Hard
コスト: Low

Algorithmic Approaches to Sequential Decision-Making and Social Epistemology

As humans, we face many decisions that require us to choose between sticking to something and giving up. This

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Lipschitzian SLLNs for random functions

本研究では、局所的にリップシッツ函数に対する強い大数法を証明し、モデル理論的条件や対称性条件の下でも実現するものと主張した。

用途: 強い法の数理上の問題の解決
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション分類予測

Statevector-Referenced Geometry Survival of a Four-Qubit ZZ Quantum Kernel on IBM Quantum Hardware: A Fixed-Subset Diagnostic Across Three Execution Configurations

この研究では、IBM Quantum ハードウェア上で実行される4キュビットの量子カーネルについて、スルーハードウェアで状態ベクトルを使用して、グラム行列における幾何学的情報の実行時の正確性を検証した。

用途: クラウド上の量子コンピュータの実行環境への適切化
難易度: Hard
コスト: Low

Variance-reduced Domain Adaptation using Paired Sampling

この研究では、分配マッチングにおける高変動の削減に伴い、最適化の安定化、精度の向上を実現するために、paired サンプリングという新しい手法を提案した。

コンピュータビジョンセグメンテーション教師なし

用途: 分布マッチングにおける高変動の削減
難易度: Hard
コスト: High

Decentralized Online Riemannian Optimization for Strongly Geodesically Convex Functions

この研究では、Riemann マンIFOLD上で、誤差が小さいオプティマイザを設計し、強いG-凸な関数に対応するものを実現した。

用途: 誤差が小さいオプティマイザの作成
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション

Breaking the $T^{3/4}$ Barrier for Regret Minimization With Bi-Dimensional CDFs

We study regret minimization for learning CDF-related objectives of the form \[ g(x)\cdot\mathbb{P}_{X\sim\mat

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション分類

Multi-stage Dynamic Selection for Cross-Project Defect Prediction

複数プロジェクト間の欠陥予測を扱う研究、Multi-stage Dynamic Selection を用いて複数プロジェクト間の欠陥予測を提案する。

用途: 複数プロジェクト間の欠陥予測
難易度: Hard
コスト: High

表形式向きCPUで試しやすいコンピュータビジョンセグメンテーション検出

Harnessing Disagreement: Detecting Correlated Agreement Blindness in Multi-Agent Triage

この研究では、マルチエージェントによるトリージュア

用途: マルチエージェントによるトリेजュアの安全性の評価
難易度: Hard
コスト: Medium

Local Causal Structure Learning in the Presence of Latent Variables and Selection Bias

Discovering the direct causes and effects of a target variable from observational data is a fundamental proble

コンピュータビジョンセグメンテーション音声

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Adversarial Frontiers: Minimum-Norm Attack Ensembles for Robustness Evaluation

Adversarial robustness is commonly evaluated with predefined attack ensembles, such as AutoAttack, at a single

コンピュータビジョン画像分類画像

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

A Multiclass Quantum Aligned Centroid Kernel

量子合成関数を使用した分類問題を解決するための方法を提案している。この方法は、学習可能な量子関数を使用し、訓練データサイズの線形スケーリングを実現している。

用途: 分類問題の量子合成関数
難易度: Hard
コスト: High

Domain-Adapted Power Curve for Cross-Farm Applications

The wind energy industry relies on accurate power curve models to make power forecast, evaluate turbine perfor

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション分類生成画像

Analytic Distribution of Classifier-Free Guidance for Schedule Design

Classifier-free guidance (CFG) is the default mechanism for conditional generation in diffusion models, but th

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション3D

Koopman Dreamer: Spectrally Constrained Latent Dynamics for Stable World-Model Imagination

Latent world models improve sample efficiency in continuous control by optimizing policies over imagined laten

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

CPUで試しやすいコンピュータビジョンセグメンテーション生成

How Fast Can Reward Models Score? A Systems Study of C++ and PyTorch Inference Runtimes for RLHF

In RLHF pipelines, reward scoring blocks policy updates. Slow scoring bottlenecks the entire loop, since no up

用途: 生成
難易度: Hard
コスト: Medium

Optimal Recalibration of an Online Predictor

この研究では、オンライン予測の再調整アルゴリズムを提案します。このアルゴリズムは、適切な損失に対して誤差が小さい新しい予測を生成でき、過去の予測に比べて性能が向上した結果を確保することができます。

用途: オンライン予測の再調整
難易度: Hard
コスト: Low

MI向きコンピュータビジョンセグメンテーション生成テキスト

Nuclear Quantum Effects as a Denoising Problem

この研究では、核量子効果をシミュレートするために、画像時刻パス積分を利用した分散機械学習アルゴリズムを提案します。このアルゴリズムは、分散機械学習を利用して核量子効果をシミュレートすることに成功し、核量子効果に関連する問

用途: 核量子効果のシミュレーション
難易度: Hard
コスト: High

少数データ向きコンピュータビジョンセグメンテーション

High Minima of Gaussian Processes: Overshoots and Minimizer Locations

Let $X(t)$, $t\in K$, be a centred Gaussian process with continuous sample paths on a compact metric space $K$

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

コンピュータビジョンセグメンテーション回帰テキスト

Quantum Kernels and the Cross-Section of Stock Returns: Anatomy of a Vanishing Advantage

Do quantum kernels improve cross-sectional stock return prediction? We run a controlled horse race on the Chin

用途: 回帰
難易度: Hard
コスト: High

Operational Identity: A Finite Audit of Declared and Implemented Rules of Sameness

A record system declares when two records refer to the same entity, occurrence, scope, or rule. Its disclosed

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョン3D・点群3D自己教師

PRIME-SVR: Physics-infoRmed Implicit Multi-Echo Slice-to-Volume Reconstruction for Fetal T2 mapping

Slice-to-volume reconstruction (SVR) is the standard method for obtaining high-resolution (HR) 3D fetal brain

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

ENTRAP-VL: A Taxonomic Probe for Dual Contextual Entrainment in Vision-Language Models

Contextual entrainment is the tendency of a model to let auxiliary context in its input pull its output, indep

コンピュータビジョンマルチモーダル画像テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Foundation-model-guided radiogenomic discovery linking cancer genomes to cancer scans

The function of many genes is still unknown, and conventional driver-discovery methods, which rely on how freq

用途: セグメンテーション
難易度: Hard
コスト: High

A Systematic Benchmark of Intensity Normalisation Methods for 3D Knee MRI Segmentation and Cross-Domain Generalisability

MRI画像の強度正規化方法を7つ比較し、3DUネットワークモデルでMeniscusの分割精度を評価。

コンピュータビジョンセグメンテーション画像3D

用途: MRI画像の強度正規化を解決する
難易度: Hard
コスト: High

RALS: Resources and Baselines for Romanian Automatic Lexical Simplification

文語の簡素化は、言語学習者の理解を促進するための有効な手法ですが、現在実際に有効であるかどうかが確立されていません。ルーマニア語で文語の簡素化に関する基準とリソースが作成されました。

用途: 文語の簡素化のためのルーマニア語のリソースと基準
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンマルチモーダル分類生成画像

Ocular Verification for Virtual Reality

Virtual reality (VR) headsets (e.g., Meta Quest, Apple Vision Pro) provide a seamless user experience due to t

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション検出異常検知テキスト

Rethinking Open-World Video Anomaly Detection: Diagnosing Definition Blindness

Open-world video anomaly detection (OWVAD) is expected to detect events that match a user-specified definition

用途: 検出
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成画像3D

A real-time RGB-D perception pipeline for autonomous impact hammers in mining: self-filtering, rock segmentation and rock-breaking poses generation

Impact hammers, also known as rock-breakers, are essential machines in mining operations, where they perform s

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション生成画像3D

Axolotl3D: a Unified Framework for Faithful 3D Shape Completion

Recent 3D generative models produce high-quality geometry from a single image using large-scale priors and dif

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョン3D・点群生成画像3D

ATSplat: Compact Feed-forward 3D Gaussian Splatting with Adaptive Token Expansion

Novel View Synthesisは、入力画像から新しい視点の画像を生成するタスクです。ATSplatアルゴリズムは、3次元ガウススプラッタリングを Feed-forward に適合させました。これにより、ATSp

用途: Novel View Synthesis
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成テキスト動画

Self Gradient Forcing: Native Long Video Extrapolation

長時間ビデオエクストラポレーションには、高度な視覚的知能が必要です。Self Gradient Forcingアルゴリズムは、学生モデルを教師モデルから生成される歴史の下で学習させることで、長時間ビデオエクストラポレーシ

用途: 長時間ビデオエクストラポレーションのための自力勾配強制
難易度: Hard
コスト: High

表形式向き説明可能CPUで試しやすい品質予測/異常検知コンピュータビジョン物体検出分類検出画像

How Does Urban Context Relate to Residential Building Health? A Vision-POI Fusion Framework for Building-Level Housing Inspection

Housing-level urban physical examination is essential for identifying residential building problems and suppor

用途: 分類
難易度: Hard
コスト: Low

コンピュータビジョンセグメンテーション生成画像動画

Vera: Identity-Faithful Human Subject-to-Video Generation

Subject-to-video (S2V) generation has made substantial progress in preserving reference subjects across divers

用途: 生成
難易度: Hard
コスト: High

CPUで試しやすいコンピュータビジョン物体検出検出

Real-Time EEG Cap Electrode Detection for Guided Point-of-Care Placement

We present a two-stage vision system that detects EEG cap electrodes in a live webcam stream and validates the

用途: 検出
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョン3D・点群テキスト3D

GaussianSeed: Hierarchical Gaussian Seeding for High-Resolution 3D Occupancy Prediction

Vision-centric 3D occupancy prediction provides dense scene representations essential for autonomous driving a

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンセグメンテーション3D自己教師

SIINR: Structurally Informed Implicit Neural Representations for super-resolution with uncertainty quantification of clinical quality diffusion MRI datasets

Diffusion Magnetic Resonance Imaging (dMRI) is a powerful tool for probing brain microstructure, but clinical

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

G-MAD: A Game-Based Data Generation Framework for Multi-View RGB-T Aerial Object Detection

This work introduces G-MAD, an open-source framework that uses Arma3 to generate synchronized multi-view RGB-T

コンピュータビジョン物体検出検出生成

用途: 検出
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション生成画像テキスト

OSVE: One Step Video Editing with One Step Diffusion Models

Text-guided video editing with diffusion models is impractically slow, hindered by costly multi-step sampling

用途: 生成
難易度: Hard
コスト: High

KineBench: Benchmarking Embodied World Models via IDM-Free Kinematic Grounding

Evaluating the physical consistency of embodied world models(EWMs) is a critical open challenge. While closed-

コンピュータビジョン3D・点群生成異常検知画像

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンマルチモーダルQA画像

Silent Failures in Multimodal Agentic Search:A Diagnostic Taxonomy and Cross-Judge Evaluation

この研究では、可視化された質問への対応を評価するために、新しい方法を提案しました。この方法は、質問への回答の正確性だけでなく、質問への回答のパターンや特徴も評価することができます。

用途: 可視化された質問への対応を評価する
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション分類テキスト

Frequency-Hierarchical Active k-Space Sampling for Diagnostic MRI

3D Gaussian Splatting (3DGS) は 3D セグメント間の接着を実行するために使用され、テキストドライブの 3D シーンエディットには不可欠です。現行の方法では、固定位置撮影から 2D ディ

用途: 3D Gaussian Splatting (3DGS)のエディットを改善
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知コンピュータビジョン物体検出検出マルチモーダル

DRGBT-1K: A Large-scale High-quality Benchmark for Dynamic RGBT Tracking

地上を表す重力式マップの高解像度版が、多くの用途で役立ちます。たとえば、市区町村の変化を監視したり、エネルギー対策を向上させたり、温室効果ガスの排出量を追跡したりすることができます。4つの主要な全世界建物Rasterデー

用途: 宇宙に分布する建物の面積を正確に推定する
難易度: Hard
コスト: High

Global Building Area Estimation Products: How Accurate Are They?

大規模視点合成モデルは、視点間の注意を交差させることで、未知の視点から3Dシーンを推論します。近年、そのようなモデルはRGB情報だけで3Dの空間関係を学習することができたため、近年の研究者たちは、3Dセグメンテーションに

用途: 新たな視点から3Dシーンを推論する
難易度: Hard
コスト: High

ReFace: Reorganizing Facial Spatiotemporal Representations for Improved Pain Assessment

Automatic pain assessment from facial video remains challenging due to the spatial heterogeneity of pain-relat

コンピュータビジョンセグメンテーション画像動画

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

説明可能センサ/時系列コンピュータビジョンセグメンテーション音声

Domain Shift in Echocardiography: Interpretable Quantification and Prediction of Cross-Dataset Left Ventricular Segmentation

Cross-dataset generalisation remains a major barrier to clinical deployment of echocardiographic left ventricu

用途: セグメンテーション
難易度: Hard
コスト: Medium

センサ/時系列コンピュータビジョン3D・点群テキスト3D

Scalable Low-Cost Laboratory Automation: A Digital Twin-Integrated Robotic Platform for Autonomous Liquid Handling (RAINBOTTM)

Laboratory automation accelerates discovery, yet its adoption is constrained by the high cost, proprietary des

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Extreme-RGMT: Continual Learning of Highly Dynamic Skills for Robust Generalist Humanoid Control

Extreme-RGMT は、高動的運動を複数のエンバーでロボットが実行できる制御システムである。ロボットは一般目的（一般運動）と専門的な運動能力（専門的な目的）を両方持つことができ、人工的な環境で人間が実行する運動を学

用途: 人物の高動的運動を複数のエンバーするロボット制御
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョン動画認識検出異常検知マルチモーダル

Clinical Pathways as Safety Specifications for Physical AI in Hospital Wards

Clinical Pathways は、ロボットが実際の環境で安全に動作するためのシステムである。これは、ロボットが病室で安全に作業し、医療スタッフや患者を守る。

用途: 医療機関で使うロボットの安全性を確保するためのシステム
難易度: Hard
コスト: High

No Extra Signals Needed: The Uniform Price of Explainable Information Design

In information design, an informed sender aims to influence a receiver's decision by committing to a signaling

説明可能コンピュータビジョンセグメンテーション

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

supervision — We write your reusable computer vision tools. 💜

supervisionは、機械学習技術を活用して、ユーザー独自のコンピュータビジョンツールを作成することができる。

用途: オリジナルコンピュータビジョンツール
難易度: Easy
コスト: High

dlib — A toolkit for making real world machine learning and data analysis applications in C++

機械学習とデータ分析のためのC++のツールキット。

用途: 機械学習ツールの提供
難易度: Easy
コスト: Medium

OpenWorldLib — Unified Codebase for Advanced World Models.

OpenWorldLibは、進化する世界モデルを提供する統一されたコードベースです。

コンピュータビジョン3D・点群生成動画3D

用途: 世界モデルを提供する
難易度: Easy
コスト: High

Awesome-CVPR2026-CVPR2025-ICCV2025-CVPR2024-ECCV2026-ECCV2024-AIGC — A Collection of Papers and Codes for CVPR2026/CVPR2025/ICCV2025/CVPR2024/ECCV2026/ECCV2024 AIGC

CVPRに基づくAIを取り入れるための資料集を提供します。CVPR 2026、2025、2024、およびECCV 2024に基づくAIGCに関する研究論文とソフトウェアコードを含みます。

コンピュータビジョン3D・点群生成画像動画

用途: AIをCVPRに応用する
難易度: Easy
コスト: High

insightface — State-of-the-art 2D and 3D Face Analysis Project

このプロジェクトは２Ｄおよび３Ｄ顔の分析を実現するための基盤プロジェクトであり、最先端の技術を導入して顔の分析を実現します。

コンピュータビジョン3D・点群分類検出3D

用途: 面量認証
難易度: Easy
コスト: High

Boltzmann-Expected Molecular Design with Decoupled Annealing Flows

分子設計を自動化する方法「Boltzmann-Expected Molecular Design with Decoupled Annealing Flows（DECAF）」を提案。分子設計で重要な3次元構造の特性を確率

コンピュータビジョン3D・点群生成テキスト3D

用途: 分子設計の自動化
難易度: Hard
コスト: High

The Tractability Landscape of Sampling with Inexact Scores

この研究では、サンプリングのトラクトビリタを分析する新しい方法を提案しています。この方法は、サンプリングの誤差に関係する条件を分析し、サンプリングのトラクトビリタを精度よく評価します。

用途: サンプリングのトラクトビリタ
難易度: Hard
コスト: Medium

The Price of Hidden Curvature: An $\widetildeΩ (d^{5/4} \sqrt{T})$ Lower Bound for Bandit Convex Optimization

この文書では、バンディット型凸最適化の最小公倍数期待誤差について、最初の非ゼロの誤差下限を提案しました。これは、2次元空間で構成された凸関数のハードクラスであり、ドメインのサイズdとデータ数Tの関数です。

用途: バンディット型凸最適化の下限
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション分類テキスト

Selection Shapes the Boundary: A Preregistered Replication of Monotonicity and Label Agreement in Unselected NLI Populations

人間が与えるラベル相違を研究した研究では、主に分異議のあるデータを選んで分析したところ、非上昇漸近性オペレータを持つ仮説がChaosNLIで低いラベル相互に関連性があると結論づけた。しかしながら、データが選択されていない

用途: NLIデータのラベル相違の境界の検証
難易度: Hard
コスト: Low

Disentangling Curriculum Learning in NLP: Towards a Unifying Taxonomy

課程学習 (Curriculum Learning) は、AIのトレーニングに使われる学習プロセスの一つで、学習が進むにつれてトレーニングデータを難易度順に変更することを含む。そのうちの難易度評価に関して議論が尽きない。

用途: 課程学習の分類
難易度: Hard
コスト: High

Bounding Boxes to Improve Small Language Model Performance on Vision-Based Grading Tasks

The deployment of Small Language Models (SLMs) in educational settings offers significant advantages in terms

コンピュータビジョン物体検出検出画像テキスト

用途: 検出
難易度: Hard
コスト: Medium

MI向きコンピュータビジョンセグメンテーションQA画像テキスト

ChronoStitch: Training-Free Composition of Visual KV Memories for Long-Horizon Temporal Reasoning

Long-video question answering requires a model to preserve visual evidence over time without repeatedly reproc

用途: QA
難易度: Hard
コスト: High

Synthetic and Derived Training Images for Campus Waste Detection: A Multi-Seed Evaluation with YOLOv8n

Incorrect disposal can contaminate campus recycling streams, and a bin-mounted camera could provide feedback a

コンピュータビジョン物体検出検出画像

用途: 検出
難易度: Hard
コスト: High

Masked Visual Actions for Unified World Modeling

Video models absorb rich priors over how the visual world moves, interacts, and responds to contact, making th

コンピュータビジョンセグメンテーション画像動画

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

ERank in Latent Space as an Image-Complexity and Richness Measure

計算機ビジョンと画像認識では、画像の視覚的なリッチネスを評価するために有用な指標が求められるが、これまでの指標は制限があった。この問題を解決するために、チャンネル空間の分散を利用した指標を提案する。

コンピュータビジョンセグメンテーション分類画像

用途: 画像の視覚的なリッチネスを評価するための新しい指標を提案する
難易度: Hard
コスト: High

From Distances to Trajectories: Real-Time Signed Distance Function Mapping and Distance-Accelerated Motion Planning for UAVs

難しい環境で運用するためには、自動空飛ブイロード（UAV）が実際に障害物に存在する距離を判断し、安全な軌跡を計画することが求められる。これを行うために、複数のステージ（マッピングと計画）を連続化した、サイン・ディスタン

コンピュータビジョンセグメンテーション検出3D

用途: UAVの安全な運用
難易度: Hard
コスト: High

No Training, Better Flights: Test-Time Scaled VLMs for UAV Navigation

無線無人飛行機のルートプランニングでは、視空間と言語モデルを利用して安全なルートを生成する必要がある。この問題を解決するために、テスト時にモデルをスケールアップさせる方法を提案する。

コンピュータビジョンマルチモーダルテキスト

用途: 無線無人飛行機のルートプランニングを改善する
難易度: Hard
コスト: High

Milo, a Fully Autonomous Indoor/Outdoor Robotic Guide Dog

Many Blind and Low-Vision (BLV) people rely on guide dogs for moment-to-moment navigation, such as staying on

コンピュータビジョン3D・点群検出3D

用途: 検出
難易度: Hard
コスト: High

Eversion-based robots can enable safe access,steering and endoscopic imaging within the spinal subarachnoid space

この研究では、スパイナルサブアルテラノスパース内の安全な移動、操縦、内視鏡撮影を可能にする医療用ロボットを提案します。

コンピュータビジョンマルチモーダル画像

用途: 肌下腔内の医療ロボット
難易度: Hard
コスト: High

STL-GCS: A Planner-Controller Framework for Signal Temporal Logic via Graphs of Time-varying Convex Sets

We present a unified trajectory planning and control framework for the satisfaction of Signal Temporal Logic (

用途: STL
難易度: Hard
コスト: Medium

Agentic Real2Sim: Physics-based World Modeling with Vision-Language Agents

Real-to-sim conversion for robotic interaction with objects remains labor-intensive because it requires more t

コンピュータビジョンマルチモーダル画像テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョン動画認識検出生成画像

arxivGitHubあり2026-07-21

NGPS: GPS-Denied Aerial Geo-Localization and 2.5D Reconstruction via Deep Satellite Image Matching and Multi-Rate Sensor Fusion

この研究では、高空飛行の無信号位置指示のNGPS (Next-Generation Positioning System)というフレームワークを提案しました。NGPSは、GPSの信号を利用せずに位置推定を可能にします。N

用途: 高空飛行の無信号位置指示
難易度: Hard
コスト: High

Pose-Parameterized Motion Planning and CBF-QP Self-Collision Filtering for a Long-Reach Drilling Boom

Long-reach drilling booms must reach successive poses without self-collision. Moving from operator-supervised

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Motion Primitive Discovery in a Humanoid Robot via Self-Organising Maps for Phase Recognition

行動モーター特徴は、社会認知や人間ロボットインターフェースなどの行動認識の核心です。人間ロボットのNICO用に、2段階のアーキテクチャを提案します。1段階目では、腕の移動を学習するSOMと、手の移動を学習するSOMを使用

コンピュータビジョン動画認識分類テキスト動画

用途: マニピュレーターの動作モーター特徴を解決する
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョン3D・点群分類画像動画

MVP-Tac: A Miniaturized Dual-Modal Vision and Photoelastic Tactile Sensor for Robot-Assisted Minimally Invasive Surgery

Robot-assisted minimally invasive surgery (RMIS) offers major benefits over open and conventional laparoscopic

用途: 分類
難易度: Hard
コスト: High

Mixing-Free and Signal-Optimal Learning of Gaussian Graphical Models from Glauber Dynamics

グラフ構造をグラウバー動力学から回復するために、ミクシングフリーや最適信号回復に重点を置く手法を研究します。

コンピュータビジョンセグメンテーション回帰

用途: データ分析におけるグラフ構造の回復
難易度: Hard
コスト: Medium

Optimizing the Preconditioner: A Black-box Online-to-Nonconvex Conversion with Static Regret Minimization Oracles

この研究では、非凸最適化をオナミ式最適化と変換する方法を提案します。この変換は、静的遺憲最適化の学習者が順列的グレードトラッカーを維持し、静的遺憲最適化では選択できるプレダクターコンパラタートを選択することで実現されます

用途: 非凸最適化のバックボックス変換
難易度: Hard
コスト: Medium

Organization of computation in reservoir computing

Reservoir computing exploits nonlinear dynamical systems to encode temporal inputs into high-dimensional state

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Logical Judgments Under Pressure: Diagnosing Syllogistic Stability with Learned Soft Prefixes

To test how correct logical judgments respond to learned context, we prepend a soft prefix to an exactly label

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョン3D・点群3D

Two-Stage Extrinsic Calibration of a Static Line-Scanning Lidar with a Rotary Platform

A line-scanning lidar yields range and azimuth values in a fixed plane. To perceive surrounding objects in 3D,

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンマルチモーダル画像

MAGE: Human-Like Macro Placement via Agentic Multimodal Reasoning

Macro placement still requires substantial manual refinement in industrial physical design flows. We present M

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Optimization of sim-to-real transfer in the humanoid robot NICO

existing robotic grasping methodの限界を解決するためのsim-to-real transfer methodを提案し、成功率を向上させる。

コンピュータビジョン物体検出検出画像

用途: ロボットの手順を解決する
難易度: Hard
コスト: Medium

Learning Adaptive Safety Margins for Visual Navigation

Robots in cluttered indoor spaces often fail not because they cannot generate collision-free paths, but becaus

コンピュータビジョン3D・点群画像テキスト3D

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Imitation of Arm Gestures by the Semi-Humanoid Robot NICO

existing HRI methodの制約を解決するためのgestures imitation methodを提案し、robustなジェスチャー認識を達成する。

コンピュータビジョン3D・点群3D

用途: 人間のジェスチャーの模倣を解決する
難易度: Hard
コスト: High

World Translation: Minimizing Sim-to-Real Gap with Backward Dynamics Extraction and Unpaired Domain Translation

existing robotic control methodの限界を解決するためのbackward dynamics extractionとunpaired domain translation methodを提案し、

用途: ロボットの実行を解決する
難易度: Hard
コスト: Medium

Importance Sampling and PCA for Finding Failures in Commercial Autonomous Vehicles

existing fault detection methodの限界を解決するためのadaptive stress testing methodを提案し、商用自動運転システムの故障率を減らす。

コンピュータビジョンセグメンテーション強化学習

用途: 自動運転システムの故障検出を解決する
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョン物体検出検出音声

Technical Design Review of Duke Robotics Club's Oogway & Crush: AUVs for RoboSub 2026

existing AUV development methodの制約を解決するためのrobustなオートニモティクス基盤と機械学習アライアンスを開発する。

用途: ROBOCUPのAUV開発を推進する
難易度: Hard
コスト: Medium

コンピュータビジョンセグメンテーション生成3Dマルチモーダル

Closing the Loop in Humanoid VLA: Persistent 3D Object Tokens for Verifiable Loco-Manipulation

existing VLA methodの制約を解決するためのpersistent object token methodを提案し、ロボット制御をより実用的なものにする。

用途: 人間のロボット制御を解決する
難易度: Hard
コスト: High

RynnBrain 1.1: Towards More Capable and Generalizable Embodied Foundation Model

existing Embodied Foundation Modelの制限を解決するためのcontact-point prediction とnative 3D grounding methodを提案し、更に能力と

コンピュータビジョンセグメンテーション検出3D

用途: Embodied Foundation Modelの制限を解決する
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成画像動画

Does Robust VIO Need More Learning? Geometry-Verified Visual Measurements under Distribution Shift

Learning is increasingly introduced into visual-inertial odometry (VIO), ranging from learned feature front-en

用途: 生成
難易度: Hard
コスト: High

UMCP: A Unified Multi-Task Collaborative Perception Network for Luggage Trolley Pose Estimation

ロボット車の視覚システムは、高精度でリアルタイム性能を持つロジスティクス車両の位置検出を実現する必要があります。従来の手法では、複数のモデルが連続してインフェレンズされ、インフェレンスラティシーが増加し、高規模デプロイメ

コンピュータビジョン物体検出検出画像

用途: luggage trolleyの位置推定
難易度: Hard
コスト: Medium

A2RL V\textsubscript{max}: The A2RL autonomous racing dataset for long-range, high-speed perception and multi-vehicle interaction

In autonomous driving development, a perception dataset is crucial, as it provides fundamental data for traini

コンピュータビジョン3D・点群検出テキスト3D

用途: 検出
難易度: Hard
コスト: High

From Sign Language Generation to Humanoid Execution: Vision-Language Guided Retargeting with Collision Mitigation

この論文では、ラインダブルロボットのための自発的アクション生成を実現することを目標とし、vision-language 指向性の指令によりロボットが自発的に動作することができることを示します。

コンピュータビジョン3D・点群生成画像3D

用途: ラインダブルロボットのための自発的アクション生成
難易度: Hard
コスト: High

Monotonicity and Frank-Wolfe Dynamics in Atomic Splittable Congestion Games

We study universal monotonicity and Frank--Wolfe stability properties for atomic splittable congestion games.

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-07-20

DiFA: Inference-Time Forward-Process Alignment for Diffusion Models

The prevailing inference framework for diffusion models formulates generation fundamentally as a problem of nu

コンピュータビジョン画像分類生成画像

用途: 生成
難易度: Easy
コスト: High

huggingfaceGitHubありHugging Faceあり2026-07-20

WorldCupArena: Fine-Grained Evaluation of Language Models and Deep-Research Agents on Football Forecasting

Predicting a football match before kickoff requires more than knowing past results: a model must use changing

コンピュータビジョンセグメンテーション予測テキスト

用途: 予測
難易度: Easy
コスト: Low

センサ/時系列コンピュータビジョンセグメンテーション検出画像3D

DeeperRadar: End-to-End MIMO Radar Design and Multi-Modal Fusion for Autonomous Vehicle Perception

DeeperRadar is a radar-centric, sensor-stack-conditioned framework that co-designs radar sensing and multi-mod

用途: 検出
難易度: Hard
コスト: High

Multi-Resolution Voxelized Map-Based Stereo Visual-Inertial Odometry

Incorporating prior maps significantly enhances the accuracy and robustness of pose estimation in visual-inert

コンピュータビジョン3D・点群画像3D

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーションテキスト動画マルチモーダル

From Perception to Assistance: Open-Vocabulary Shared Autonomy for Robotic Manipulation

Teleoperating a robotic manipulator in industrial environments demands precision that camera-based interfaces

用途: セグメンテーション
難易度: Hard
コスト: High

MI向きセンサ/時系列コンピュータビジョンマルチモーダル

Asynchronous Multimodal Diffusion Policy Composition via Latency-Aware Guidance Fusion

Diffusion policies have shown strong potential for robotic imitation learning, and recent extensions incorpora

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Move First, Commit Later: Selective LiDAR-to-BIM Global Initialization via Sequential Consensus with Symmetry-Aware Abstention

Global LiDAR-to-BIM initialization must place a robot within an as-designed building model without a prior pos

コンピュータビジョンセグメンテーション3D

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Temporal Fair Division of Indivisible Goods with Structured Constraints

This paper investigates temporal fair division, a setting where items are allocated over multiple rounds and a

コンピュータビジョン動画認識テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

表形式向き品質予測/異常検知コンピュータビジョンセグメンテーション生成画像表形式

Semi-Supervised Conditional Diffusion via Label Augmentation

Conditional diffusion models have become a powerful and flexible framework for learning complex conditional di

用途: 生成
難易度: Hard
コスト: High

User-Driven Learning from Demonstration: A Trajectory and Impedance Learning Method

This paper presents a method for user-driven robot Learning from Demonstration (LfD) that reduces user effort

コンピュータビジョンセグメンテーション3D

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

説明可能センサ/時系列コンピュータビジョンマルチモーダル生成画像

What Do They See? Interpreting Complex Road Scenarios Through the Eyes of Vision-Language-Action Models for Safe and Trustworthy Autonomous Vehicle Learning

End-to-end autonomous driving models are now able to navigate complex road scenarios, mapping raw sensor obser

用途: 生成
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョンセグメンテーション分類検出3D

InLiER: Learning-Free Heterogeneous LiDAR Place Recognition via Intermediate Mixed-Radix Structural Keypoint Tokenization

LiDAR place recognition supports loop closure, relocalization, and multi-agent map management. As robotic plat

用途: 分類
難易度: Hard
コスト: High

Token-Wise Latent Streaming from Slow Reasoners to Fast Planners for Dynamic Vision Language Navigation

Vision-Language Navigation in dynamic, human-centric environments exposes a fundamental tension: linguistic re

コンピュータビジョンマルチモーダル生成

用途: 生成
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョン物体検出検出画像

Hybrid Machine Learning for Articulation Angle Estimation of Truck-Semitrailer Combinations

Accurate articulation angle estimation of trucks with trailers is critical for autonomous driving and advanced

用途: 検出
難易度: Hard
コスト: Medium

PhyAgentOS: A Self-Evolving Operating System for Embodied Agents with Decoupled Cognitive Planning and Physical Execution

Vision-language-action models, world models, and agentic planners each advance physical intelligence, yet thei

MI向きコンピュータビジョンマルチモーダル

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

コンピュータビジョンマルチモーダル検出画像テキスト

Autonomous VR-Based Risk Detection for Situational Awareness in Dangerous Settings

In high-risk environments such as disaster response, situational awareness depends not only on detecting hazar

用途: 検出
難易度: Hard
コスト: High

githubGitHubあり2026-07-18

maths-cs-ai-compendium — Become a cracked AI/ML Research Engineer

Becoming a cracked AI/ML Research Engineerには、AI/ML研究者のスキルと知識を高めるための手法が紹介されています。

コンピュータビジョンマルチモーダルテキスト音声

用途: AI/ML研究者を育成
難易度: Easy
コスト: High

Projective Maximum Entropy: Universality and Acceptance-Region Calibration

Maximum-entropy reference distributions are usually constructed on the normalized probability simplex. This fo

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

コンピュータビジョンセグメンテーションテキスト教師あり教師なし

MTSSL: Meta-Thresholding Semi-Supervised Learning

A large body of Semi-supervised Learning~(SSL) algorithms encounter the threshold $τ$ to select pseudo-labels.

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

表形式向きコンピュータビジョンセグメンテーション生成表形式

Do Generative Models Keep Time? A Time-Aware Evaluation of Synthetic Sequential Tabular Data

Synthetic sequential tabular data are increasingly used for privacy-preserving data sharing, yet a generator c

用途: 生成
難易度: Hard
コスト: Low

説明可能センサ/時系列コンピュータビジョンセグメンテーション

On the Role of Normalization in Binary Iterative Hard Thresholding for 1-bit Compressed Sensing

1ビット圧縮センシングは、情報が圧縮された状態で保存され、データ量の最適化が必要で、この問題を解決するために、Binary Iterative Hard Thresholding（BIHT）を最適化する方法を提案。

用途: 1ビット圧縮センシングの最適化
難易度: Hard
コスト: Medium

Transient State Reorganization and Cell Differentiation in the Developmental Dynamics of Growing Neural Cellular Automata

Neural Cellular Automataが複雑な形状を形成するプロセスを研究しました。

コンピュータビジョン動画認識検出

用途: 画像認識
難易度: Hard
コスト: High

品質予測/異常検知コンピュータビジョンマルチモーダル画像強化学習

Foresight Residual RL for Long-Horizon Robot Manipulation with Vision-Language-Action Models

Vision-Language-Action (VLA) policies offer strong general-purpose manipulation priors, but often fail on tigh

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

コンピュータビジョンセグメンテーション生成テキスト

Handroid: Bridging Dexterous Hand and Humanoid

この研究では、Robotのヘッドレスアンドメインアームの両方を1台のロボットに組み込み、両機能を切り替えれるようにする技術、Handroidを開発しています。

用途: ヘッドレスアンドメインアームの両方の開発
難易度: Hard
コスト: Medium

VTLoc: Learning-based Tactile Contact Localization in Visual Point Clouds

VTLocフレームワークは、視覚情報と触覚情報を統合し、ロボットハンドの位置を推定することで、ロボットハンドの位置推定と動作操作を実現します。

コンピュータビジョン3D・点群検出画像テキスト

用途: ロボットハンドの位置推定
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョンセグメンテーションマルチモーダル

BayesContact: Uncertain Pose Estimation via Visuo-Tactile Proposals and Simulation-based Inference

この研究では、Vision-とTactile-based ProposalとSimulation-based Inferenceを組み合わせ、物体の位置と姿勢を推定する方法、BayesContactを提案しています。

用途: 視覚情報と触覚情報の融合によるロボットの動きの推定
難易度: Hard
コスト: High

huggingfaceHugging Faceあり2026-07-17

Beyond Success Rate: Cost-Aware Evaluation of Offensive and Defensive Security Agents

Security-agent evaluations commonly measure peak offensive capability under generous inference budgets, emphas

用途: 技術検証・論文読解補助
難易度: Easy
コスト: Medium

Delocalization of bias in unadjusted Hamiltonian Monte Carlo and underdamped Langevin

この研究では、調整されていない HMC と Langevin Sampler の偏りの解消について議論しました。調整されていないサンプラーは、通常、偏りのあるものであることが知られています。この研究

コンピュータビジョンセグメンテーション検出

用途: HMC と Langevin Sampler の偏りの解消
難易度: Hard
コスト: Medium

Subjective Risk Decomposition: A New View for Uncertainty Quantification

高次元カテゴリデータを視覚化できるツール「cGAP」を開発。 heat mapsを含む視覚化フレームワークの開発。

用途: 可視化ツールが提供される高い次元数のカテゴリのデータ
難易度: Hard
コスト: Medium

センサ/時系列コンピュータビジョンセグメンテーション検出時系列

Post Hoc Inference for Component Attribution in Multivariate Change-Point Detection

時系列データを観察し、その中に分割が起こっているかどうかを検知する方法はある。時間系列変化点を検知することができ、変化点の位置がどこにあるかを推定することができる。

用途: 時系列データの分割探知
難易度: Hard
コスト: Low

Precise sample covariance spectral norm error -- an RDT view

この研究では、サンプル協方差行列の精度を向上させる方法を検討しました。特に、サンプルサイズが小さく、収集されたデータの特性から、正解率の期待値は小さい場合に、問題が最も発生しやすくなる可能性があります。この研究では、正解

用途: サンプル协方差行列の精度を向上させる
難易度: Hard
コスト: Medium

PAC Learning in Turn-Based Stochastic Games with Reachability Objectives: A Decentralized Private Approach via Expected Conditional Distance

この研究は、多人数確率ゲームへのPAC学習を研究することに関心があります。PAC学習は、機械学習モデルの確信度を高めると同時に、モデルの誤差を低下させるものです。

コンピュータビジョンセグメンテーション強化学習

用途: 多人数確率ゲームへのPAC学習を研究する
難易度: Hard
コスト: Medium

githubGitHubあり2026-07-16

pcl — Point Cloud Library (PCL)

3D点群処理のためのライブラリであるPoint Cloud Library（PCL）。

コンピュータビジョン3D・点群3D

用途: 3D点群処理
難易度: Easy
コスト: High

Minimax Theory of Likelihood-Based Deep Learning for Speckle Regression

マルチ倍雑音を考慮したスペックルの除去法を提案し、特徴間の相関と正確なモデルを考慮することで、複雑な雑音モデルに対応した。

コンピュータビジョンセグメンテーション回帰

用途: 雑音の除去
難易度: Hard
コスト: Medium

Plausible Deniability Guarantees for Whistleblowers

Whistleblowers are a key safeguard against organizational wrongdoing, but the threat of retaliation deters rep

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Cluster with Auctions for Vector Search

距離検索のためのオーソリティ検索 (Cluster with Auctions for Vector Search) を提案し、検索プロセスを効率化しやすくします。

用途: 距離検索のオーソリティ検索
難易度: Hard
コスト: Medium

Non-Expansive Two-Time-Scale Stochastic Approximation: A Fixed-Schedule One-Quarter Barrier and Bias-Corrected Acceleration

不拡張確率過程を用いた解析を行い、時刻スケールに基づいて過程を分解します。この分解により、遅いスケールに基づく収束の分析が実現されます。

用途: 多相の時刻スケールをもつ不拡張確率過程の解析
難易度: Hard
コスト: Medium

Evaluating Encoding Strategies for Closed-Loop Classification in Biological Neural Networks

Interfacing with Biological Neural Networks (BNNs) requires encoding information into stimulation patterns tha

コンピュータビジョン動画認識分類画像

用途: 分類
難易度: Hard
コスト: High

When Is Delegated Play Truthful? Within-Range Regret and the Trilemma of Aligned Delegation

Advertisers delegate bidding to autobidders; users delegate tasks to language-model agents. A person describes

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-07-15

Generalizable VLA Finetuning via Representation Anchoring and Language-Action Alignment

Finetuning a pretrained vision-language model (VLM) on robot demonstrations via behavior cloning (BC) has beco

コンピュータビジョンセグメンテーション画像テキストマルチモーダル

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

huggingfaceHugging Faceあり2026-07-15

Open-AoE: An Open Egocentric Manipulation Dataset and Toolchain for Embodied Learning

Egocentric videos of human manipulation provide scalable supervision for embodied intelligence, yet existing r

コンピュータビジョンセグメンテーション画像テキスト動画

用途: セグメンテーション
難易度: Easy
コスト: High

説明可能コンピュータビジョンセグメンテーションマルチモーダル

Ensemble Controlled-Flow Filtering for Implicit Data Assimilation

非線形オブザーバシオンメカニズムや多次元データには適合しない伝統的なエンサンブルフィルタリングアルゴリズムを導入し、隠蔽データアシミレーションを提案

用途: 隠蔽データアシミレーション
難易度: Hard
コスト: High

CPUで試しやすいコンピュータビジョンセグメンテーション画像

LatentFlow: A General Framework for Conditioning Stochastic Processes

ストロチャスティックプロセスに観察値を組み込むことが困難であれば、単に観察値を観察できるものを学習しているという理解を拡張する新しいフレームワークを発表

用途: ストロチャスティックプロセスの調整
難易度: Hard
コスト: High

Thompson Sampling Is 2-Competitive for Mistakes

バンディット問題を解くために、Thompson sampling法を用いる。

用途: バンディット問題を解く
難易度: Hard
コスト: Medium

The Limits of Price Discrimination with a Bayesian Seller

販売者は顧客を異なるセグメントに区分し、顧客に異なる価格を設定することで売上を最大化したい。その場合、顧客の幸福と販売者の幸福がどのように関係するかを調べた。

用途: プライスディスカイミネーション
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-07-14

ReflectWorld-MM: An Entity-Oriented Multimodal Memory System for Open-Ended Video Streams

Building assistants that can continually watch the world, remember what they see, and reason over their accumu

コンピュータビジョンマルチモーダル画像テキスト音声

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

センサ/時系列コンピュータビジョンセグメンテーション時系列

Causal Graphs, Markov Properties and Do-calculus for Stochastic Differential Equations

Stochastic differential equations (SDEs) are widely used to model continuous-time dynamical systems, but graph

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Fundamental Limitations of Fixed-Budget Best-Arm Identification

ベストアームのアイデンティファイメントについては、固定予算の限界があり、高い精度を獲得するのは困難です。

用途: ベストアームのアイデンティファイメント
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーションテキスト

Trustworthy synthetic data for campaign decision support: strategy simulation fidelity and the PolicySynth framework

Decision support systems (DSS) increasingly run retention what-if analysis on synthetic customer populations,

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

センサ/時系列コンピュータビジョンセグメンテーション画像音声

Difference-Driven Gating: Adaptive Feature Fusion for U-Net Decoder

この研究では、新しい特徴融合手法を提案した。この手法は、上からの特徴と下からの特徴の関係性を考慮することで、特徴を効率的に融合し、三次元データを2次元サムライグラフにコンパクトに表現する機能をもたらせる。

用途: 特徴融合
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョンセグメンテーション生成

Representing the Non-dominated Set of Multi-objective Network Problems by Supported Non-dominated Points

In multi-objective combinatorial optimization, unsupported non-dominated points typically outnumber supported

用途: 生成
難易度: Hard
コスト: Medium

APMM: Automated Parlay Market Maker

Parlays - joint contracts on the simultaneous resolution of several events - are among the most heavily traded

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

MI向きコンピュータビジョンセグメンテーションテキスト

One Vote, Several Parliaments: An Empirical Analysis of the Algorithmic Ambiguity of the Italian Electoral Law on the 2022 General Election Data

Crafa's algorithmic analysis of the Italian electoral law (the "Rosatellum") showed that the statutory text de

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

From PBS to ePBS: the Microstructure of Block Building

Ethereum's Glamsterdam upgrade introduces enshrined proposer-builder separation (ePBS), replacing relay-centri

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-07-13

See like a Robot: Robot-Centric Pointmaps for Vision-Language-Action Models

Vision-language-action (VLA) models predict robot actions from visual observations and language instructions.

コンピュータビジョン3D・点群画像3Dマルチモーダル

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

huggingfaceHugging Faceあり2026-07-13

SVR-R1: Bootstrapping Multi-modal Reasoning with Self-verification in Reinforcement Learning

We introduce Self-Verified Reasoner (SVR-R1), a multi-turn RL framework that turns a model's own verification

コンピュータビジョンセグメンテーション生成マルチモーダル強化学習

用途: 生成
難易度: Easy
コスト: High

githubGitHubあり2026-07-13

UniPic — Open-source SOTA multi-image editing model

UniPicは、オープンソースの最先端の画像編集モデルの実装です。

コンピュータビジョンマルチモーダル生成画像

用途: 多画像編集モデルの実装
難易度: Easy
コスト: High

arxivPaper only2026-07-12

The Spectral Structure of Latent Treatment Effects

Identifying heterogeneous treatment effects under unobserved confounding is central in observational causal in

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-07-12

Demixing Sparse Signals from Nonlinear Observations using Generalized Non-convex Regularization

We consider the recovery of a pair of sparse vectors from a limited number of nonlinear observations of their

説明可能コンピュータビジョンマルチモーダル

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

arxivPaper only2026-07-12

Representation theorems for actual and alpha powers over general concurrent game frames without assuming independence of agents

Concurrent game frames are a standard semantic framework for logics of strategic reasoning. Two notions of coa

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-07-10

Deep Gaussian Processes on Directed Acyclic Graphs

この研究は、多くの現実世界のプロセスがディレクトグラフ(DAG)上で実装できることを証明しました。 DAG上の関数の部分観測、観測のノイズや不規則な測定値は、機能の再構成、不確実性の伝播、推定に大きな障壁となることがあり

説明可能コンピュータビジョンセグメンテーション

用途: DAG上での実験結果の再構成
難易度: Hard
コスト: Medium

arxivPaper only2026-07-10

Characterization of the basin of convexity for multi-snapshot spike deconvolution via variable projection

We study the problem of multi-snapshot spike deconvolution, where the goal is to recover the locations of spar

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-07-10

Threshold Dynamics and Correlated Prophet Inequalities

Prophet inequalities have become a central tool for analyzing the performance of online algorithms. However, m

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-07-09

Multi-Sender Bayesian Persuasion with Imperfect Information

エキスパート間で信頼性の高い意思伝達を行うために設計されたフレームワーク。

用途: エキスパートの意思伝達
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-07-08

Agon: Competitive Cross-Model RL with Implicit Rival Grading of Reasoning

Reinforcement learning from verifiable rewards (e.g. GRPO) is the engine behind today's reasoning models, yet

コンピュータビジョンセグメンテーションテキスト強化学習

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

arxivPaper only2026-07-07

Slack and Budget Breaking in Threshold Team Production

しきい値システムでは、任意のタスクを実行するには、$\Nstar = \kappa + \Delta$の公証書が必要です。この場合、$\Delta$が余分な公証書で、タスクの遅延は公証書が$\Nstar - \kappa

用途: タスクの完了を確保する方法
難易度: Hard
コスト: Medium

arxivPaper only2026-07-06

A Large-Scale Sparse Multiobjective Optimization Algorithm Based on Optimal Performance Scores

この論文では、大規模スパース多目標最適化の問題に取り組むために、新しく提唱された適応可能な初期値生成アルゴリズムを提案し、アルゴリズムの効率とパフォーマンスを評価する。

品質予測/異常検知コンピュータビジョンセグメンテーション生成

用途: 大規模スパース多目標最適化
難易度: Hard
コスト: Medium

arxivPaper only2026-07-06

Strategic Buying Agents

オンライン購入の最適化を目的とするストラテジックビーイングアージェントフレームワークを発表する。

用途: オンライン購入の最適化
難易度: Hard
コスト: Medium

品質予測/異常検知コンピュータビジョン3D・点群生成画像3D

githubGitHubあり2026-07-06

Magic123 — [ICLR'24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Magic123は、画像を1枚入力し、画像と3Dデータ双方の情報を利用して高質の3Dオブジェクトを生成することができる。

用途: 高質の3Dオブジェクト生成
難易度: Easy
コスト: High

arxivPaper only2026-07-03

The Oracle's Gambit: A Game-Theoretic Framework for Responsible AI Release

Responsible vulnerability disclosure can secure the defender's head start by controlling when a vulnerability

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

githubGitHubあり2026-07-03

EEGUnity — An open source tool for large-scale EEG datasets processing

ビデオ diffusioin trasformerは、ビデオの長さに依存しない推論能力を持っているが、この長さのエキサポレーションは実際には困難なものである。RIFLExという手法を開発し、ビデオ長さのエキサポレーション

コンピュータビジョンマルチモーダル

用途: ビデオ diffusioin trasformerで長さのエキサポレーション
難易度: Easy
コスト: High

arxivPaper only2026-07-01

MMAO-Cls: Metabolic Multi-Agent Optimization for Joint Feature Selection and Classifier Tuning

マルチアジェント最適化を使用して、クラスター選択とモデル調整のためのMMAOクラスの実現を提案しました。

表形式向きコンピュータビジョンセグメンテーション分類表形式

用途: クラスター選択とモデル調整のためのメタボリックマルチアジェント最適化
難易度: Hard
コスト: Low

arxivPaper only2026-07-01

Online Fair Division Meets Reordering Buffers

この研究では、個人的な価値が付与されたアイテムを公平に分配する問題を研究します。アイテムは個人の価値を付与することがあり、アイテムの分配が公平または公正ではない場合があります。この研究では、分配に公平性を考慮する方法を提

用途: 分配問題の公平性
難易度: Hard
コスト: Medium

arxivPaper only2026-06-30

The Cooperation Ceiling: Extrinsic Population Dynamics and the Intrinsic Escape

共作を促進するための進化ゲーム理論の研究。この研究では、複数の個人が協力することで共作を促進するためのゲーム理論的枠組みを開発する。

用途: 共作を促進するための進化ゲーム理論の研究
難易度: Hard
コスト: Medium

githubGitHubあり2026-06-30

AirSim — Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

Microsoftにより開発されたオープンソースのシミュレータ、AirSimはリアルテンポでの自動運転車の動作をシミュレートすることができます。

用途: 自動運転車のシミュレーション
難易度: Easy
コスト: Medium

arxivPaper only2026-06-29

From Detecting Agency to Doing Work: Self-Caused Credit Builds a Durable Behavioral Self in a Minimal Spiking Agent

How does an agent that can tell self from world come to be durably shaped by that distinction? Recent work sho

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-29

Revenue Guarantee of Anonymous Pricing for Mixed Bidders:Bridging Value and Utility Maximizers

Mechanism design increasingly faces heterogeneous environments containing both traditional utility maximizers

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-29

I.i.d. Prophet Inequalities with Discounted Rewards: As Hard as the Non-i.i.d. Case

We study prophet inequalities with discounted rewards, where i.i.d. base rewards are multiplicatively discount

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-28

Generalized Bidding Games: Where Bidding and Stochastic Games Meet

Two-player games on graphs are a classical framework for analyzing strategic decision making. In turn-based ga

コンピュータビジョンセグメンテーション生成

用途: 生成
難易度: Hard
コスト: Medium

githubGitHubあり2026-06-28

CoreNLP — CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

CoreNLPはJavaで開発されたNLPツールのセットであり、分割、文分割、名詞認識、パーシング、コorefence、感情分析などを行える。

用途: 分析
難易度: Easy
コスト: Low

arxivPaper only2026-06-27

The Two Genie Game: Adoption and Welfare in Audit-Grounded AI Governance

We ask under what conditions an agent with a harm-minimizing policy can displace an approval-seeking (RLHF) ag

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-26

Discrete Event Population Updates: finding game theoretic emergent behaviour in queueing systems with simulation

カウントアップし合うゲームでは、プレイヤーが行動を決定し、他のプレイヤーにも反映されるメカニズムが含まれます。ゲーム理論を用いて、プレイヤーがどのような戦略で行動するかを分析し、それを計算を含むモデルとして表現することで

MI向きコンピュータビジョンセグメンテーション

用途: カウントアップし合うゲームの分析
難易度: Hard
コスト: Medium

githubGitHubあり2026-06-26

visionary — Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

コンピュータビジョン3D・点群3D

用途: 実装・検証基盤
難易度: Easy
コスト: High

arxivPaper only2026-06-25

Learning Anonymous Pricing for Online Resource Allocation

この研究では、オンラインリソース分配のアルゴリズムを提案している。このアルゴリズムは、リソースの供給と要求のバランスを考慮しながら、効率的な分配を目指している。

用途: オンラインリソース分配
難易度: Hard
コスト: Medium

arxivPaper only2026-06-24

The Red Queen Gödel Machine: Co-Evolving Agents and Their Evaluators

この論文では、エージェントの評価を一連の開発の間で共進化させるための新しい方法を提案します。

用途: エージェントの評価を一連の開発の間で共進化させる
難易度: Hard
コスト: Medium

arxivPaper only2026-06-23

Spatial Partial Functionalization of Neural Networks based on Noise Fields

この研究では、スペース的部分関数化されたニューラルネットワークを提案します。

用途: スペース的部分関数化されたニューラルネットワークの開発
難易度: Hard
コスト: Medium

arxivPaper only2026-06-22

An Open-Source LFSR-Based Stochastic Leaky Integrate-and-Fire Neuron in SkyWater 130 nm: Design, Stochastic Characterisation, and Rate Coding

Stochastic spiking neurons trade exact arithmetic for controlled randomness, lowering area and tolerating inpu

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-22

Mass Conservation as an Inductive Bias for Self-Organized Criticality in NCA Reservoirs

Self-organized criticality (SOC), a dynamical regime associated with maximal information processing, offers a

品質予測/異常検知コンピュータビジョン動画認識分類

用途: 分類
難易度: Hard
コスト: High

arxivPaper only2026-06-21

A Theory-grounded Hybrid Neural Network Integrating Complementary Estimation Mechanisms for Stable Visual Object TrackingA

Hybrid neural networks (HNNs) that integrate artificial neural networks (ANNs) with brain-inspired neural netw

コンピュータビジョンセグメンテーション画像

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-20

Quantifying Theoretical AI Alignment Guarantees: Receiver-Utility Bounds in Bayesian Persuasion

Misalignment can change how information moves from an AI agent to a human user. We model this as an informatio

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-19

Gradient-Free Warm-Start Library Recovery: an Amortized-Regret Separation

Continual learning that is gradient-free, local, online, and append-only is attractive for edge and streaming

コンピュータビジョンセグメンテーション分類検出

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-06-19

Distance-based subsidy rate design to incentivize ride-hail access to advanced air mobility hubs

The success of advanced air mobility (AAM) operations is largely contingent on its effective integration with

コンピュータビジョンマルチモーダル

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

arxivGitHubあり2026-06-18

Provably Sub-Linear Two-Timescale NeuroEvolution with Online Plasticity

NeuroEvolution of Augmenting Topologies (NEAT) is a widely used neuroevolution algorithm for learning neural n

コンピュータビジョンセグメンテーション強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-15

Neural dynamical systems on ferroelectric compute-in-memory for real-time forecasting

ネットワークダイナミクスシステムを使って時間系列予測を高速化し、ニューロモルフィックコンピューティングを活用した。

コンピュータビジョン動画認識予測

用途: 時間系列予測を高速化するためのフェロイレクトリックコンピュートインメモリシステム
難易度: Hard
コスト: High

arxivPaper only2026-06-15

Wavelength-Multiplexed 2D Beam Steering via a Passive Diffractive Network

ワーブレートを利用

センサ/時系列コンピュータビジョン3D・点群3D

用途: ワーブレートを利用した2Dビームステリング
難易度: Hard
コスト: High

arxivPaper only2026-06-12

Comparison Patrols on Drifting Orders: Certified Rank Maintenance, Evolving Planar Maxima, and Selection under Drifting Fitness

Rank-based selection in dynamic environments acts on order information that becomes stale while it is being us

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-12

Directing Open-Ended Evolution in Artificial Life via Multi-Scale Path Divergence

Open-ended evolution (OEE) in artificial life is typically driven by uninterpretable, black-box neural-network

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium