深層学習

transformers — 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

深層学習Transformer分類テキスト音声

tesseract — Tesseract Open Source OCR Engine (main repository)

Open Source Computer Vision Libraryは、画像やビデオを分析するためのライブラリです。

DeepSpeed — DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

分散トレーニングと推論を容易、効率的に実行するためのディープラーニング最適化ライブラリです。

未読 630件

transformers — 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

深層学習Transformer分類テキスト音声

用途: 機械学習モデル定義
難易度: Easy
コスト: High

tesseract — Tesseract Open Source OCR Engine (main repository)

Open Source Computer Vision Libraryは、画像やビデオを分析するためのライブラリです。

用途: 文字認識
難易度: Easy
コスト: Medium

DeepSpeed — DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

分散トレーニングと推論を容易、効率的に実行するためのディープラーニング最適化ライブラリです。

用途: ディープラーニング最適化ライブラリ
難易度: Easy
コスト: High

opencv — Open Source Computer Vision Library

このリポジトリでは、64MパラメータのGPTを完全にTrainingし、2時間以内に完成させる手法を提供します。

深層学習画像

用途: 大モデル 2時間で完全にTraining
難易度: Easy
コスト: High

cs249r_book — Machine Learning Systems

マシンラーニングシステムの理論と実装に関する本。

深層学習テキスト

用途: 機械学習システム
難易度: Easy
コスト: Medium

Paddle — PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

中国の工业界で開発された機械学習フレームワーク。並列化および分散処理を可能にしている。

用途: 機械学習フレームワーク
難易度: Easy
コスト: Medium

datasets — 🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

AI用のデータセットを提供するプラットフォームです。

深層学習軽量化・量子化音声

用途: データセットハブ
難易度: Easy
コスト: Medium

深層学習Transformerセグメンテーション画像

segmentation_models.pytorch — Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

セマンティックシーケンス分割モデルのライブラリです。

用途: セマンティックシーケンス分割モデル
難易度: Easy
コスト: High

深層学習Transformer画像テキストマルチモーダル

sglang — SGLang is a high-performance serving framework for large language models and multimodal models.

SGLangは、大規模言語モデルのサービングフレームワークです。このライブラリは、高性能なサービスフレームワークで、大規模言語モデルのサービングをサポートしています。

用途: 大規模言語モデルのサービングフレームワーク
難易度: Easy
コスト: High

Sana — SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

SANAは、高解像度画像生成モデルSANAを紹介する本研究であり、低計算コストで優れた高解像度画像を生成できる。

用途: 高解像度画像合成
難易度: Easy
コスト: High

openvino — OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

オープンソースのAI推論最適化と展開用ツールキットです。

深層学習Transformer分類生成音声

用途: AI推論の最適化と展開
難易度: Easy
コスト: Low

transformerlab-app — The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

kserveは、分散型生成的および予測性AI推論プラットフォームであり、可 scale, multi-framework デプロイをサポートして Kubernetes 上に展開されることができます。

用途: AI研究者向けのオープンソース研究環境
難易度: Easy
コスト: Medium

FastVideo — A unified inference and post-training framework for accelerated video generation.

FastVideoは、加速されたビデオ生成用の統合推論とポストトレーニングのフレームワークです。

深層学習軽量化・量子化生成動画

用途: ビデオ生成を加速する
難易度: Easy
コスト: High

LightX2V — Lightweight Image Video Action Generation Inference Framework

zenmlは、データパイプラインからエージェントまで、AIプラットフォームです。

深層学習軽量化・量子化生成画像動画

用途: AI推論を軽量化したインフラ
難易度: Easy
コスト: High

FastGen — NVIDIA FastGen: Fast Generation from Diffusion Models

この論文では、ディフュージョンモデルの高速化を目的としたNVIDIA FastGenについて説明しています。FastGenは、ディフュージョンモデルから高速に生成することが可能です。

用途: ディフュージョンモデルの高速化
難易度: Easy
コスト: High

vllm — A high-throughput and memory-efficient inference and serving engine for LLMs

このリポジトリでは、私的なAIプラットフォームであるDocGPTを提供しています。

用途: 私的なAIプラットフォーム
難易度: Easy
コスト: High

haystack — Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

オープンソースのAIオーケストレーションフレームワークです。LLMアプリケーションの構築に必要なパイプラインやエージェントワークフローの設計ができるようになっています。

深層学習Transformer生成要約テキスト

用途: LLMアプリケーションの構築
難易度: Easy
コスト: High

peft — 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

パラメータ効率の向上のための最先端のフィネチュニングフレームワークです。

用途: パラメータ効率の向上
難易度: Easy
コスト: Medium

FunASR — Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenAI-compatible/MCP serving.

電気生理信号から表現を学習し、脳コンピューターインターフェースの開発を支援する。

深層学習Transformer分類検出テキスト

用途: 電気生理信号から表現を学習する
難易度: Easy
コスト: High

DocsGPT — Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

このリポジトリでは、トークナイザーの最適化を提供しています。

用途: トークナイザーの最適化
難易度: Easy
コスト: Medium

LlamaFactory — Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

LLMやVLMのFine-Tuningを簡素化したライブラリ。

用途: LLMのFine-Tuning
難易度: Easy
コスト: High

tokenizers — 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

トークナイザーの高速化を目的としているライブラリ。

用途: トークナイザーの高速化
難易度: Easy
コスト: Medium

表形式向き深層学習Transformer分類検出画像

presidio — An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

presidioは、テキスト、画像、構造化データを含む敏感データを検出、削除、マスク、アノニマイズするオープンソースフレームワークです。自然言語処理、パターンマッチング、カスタマイズ可能なパイプラインをサポートします。

用途: データのプライバシーを保護する
難易度: Easy
コスト: Low

Expanding Flow Maps

流れベースの生成モデルに関する新しいアプローチであるExpanding Flow Mapsを提案しました。Expanding Flow Mapsは、定数次元または定数シーケンス長に限定されるものの従来のパラメータ化に比べ

用途: 流れの生成技術の開発
難易度: Hard
コスト: Medium

品質予測/異常検知画像検査深層学習Transformer検出生成画像

Synthetic data generation framework for quality control automation in gravure printing

印刷品質管理技術のための新しいアプローチであるシンセティックデータ生成フレームワークを提案しました。このフレームワークは、ロトグラビューグラビング技術における品質管理のためのシンセティックデータを生成することで、印刷

用途: 印刷品質管理技術の開発
難易度: Hard
コスト: High

Graph Learning on Ensembles of Cyclic Peptides: An Investigation of Molecular Ensemble Modeling

分子設計技術のための新しいアプローチであるEnsembleEGNN（Equivariant Graph Neural Network）を提案しました。EnsembleEGNNは、共役グラフニューラルネットワークを使用して

深層学習Transformer自己教師

用途: 分子設計技術の開発
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知深層学習軽量化・量子化テキスト音声マルチモーダル

X$^3$-OPD: Distilling Reasoning into Large Audio-Language Models via On-Policy Alignment

大規模な言語モデルを用いた推論技術のための新しいアプローチであるX$^3$-OPD（Distilling Reasoning into Large Audio-Language Models via On-Policy

用途: 大規模な言語モデルを用いた推論技術の開発
難易度: Hard
コスト: High

The Boundaries of Automation: A Theory of Persistent Human Participation

The rapid progress of AI has intensified the long-standing pursuit of automation: replacing human participatio

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知深層学習Transformer画像

KroQuant: Kronecker-Structured Block Transforms for Efficient Post-Training Quantization of Diffusion Transformers

Post-training quantization (PTQ) of diffusion transformers (DiTs) to W4A4 severely degrades output quality, be

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer時系列

A Diffusion-Model Subpopulation Digital Twin for Mobile Health Deployment: A Case Study on the HeartSteps Intervention

人間に合わせたモバイルヘルスインストリュメントの開発を進めた。この研究では、現実的なシミュレートされたユーザーを用いて、アルゴリズムをテストすることによって、人間に合わせたモバイルヘルスインストリュメントを開発するための

用途: 人間に合わせたモバイルヘルスインストリュメントの開発
難易度: Hard
コスト: High

Mean-to-Score Discrete Diffusion: Posterior-Mean Denoisers for Score Entropy

ディスクリート確率模型におけるベイジアン解釈の分析を進めた。この研究では、正確さを高めるために、負のスコア比率を制限したディスクリート確率模型を提案した。

用途: ディスクリート確率模型におけるベイジアン解釈性分析
難易度: Hard
コスト: High

Hilbert Operator for Progressive Encoding (HOPE): A Mathematical Framework for Deconstructing Learned Representations in Deep Networks

学習された表現の分解構造を理解するための枠組みを提案した。この研究では、ネットワークをコンプレッションすることで、学習された表現を分解することで、学習された表現の理解を進めた。

用途: 学習された表現の分解構造を理解するための枠組み
難易度: Hard
コスト: Medium

Gradient Concentration, Not Weight Saliency, Explains Representation-Level Class Unlearning

学習機械の忘却における重みの重要性を理解することで、モデルを再トレーニングせずに忘却することができると主張した。この研究では、重み選択方法の検討を行うことで、重みの重要性を理解することができると主張した。

用途: 学習機械の忘却における重みの重要性を理解する
難易度: Hard
コスト: High

How Many Bits Can an Adapter Write? Measuring the Capacity and Memorization of Parameter-Efficient Fine-Tuning

パ

用途: パラメータ効率性ファインチューニングモデルの能力を測定する
難易度: Hard
コスト: Medium

Adaptive Depth Sparse Framework: Similarity-Driven Resource Allocation for Pre-Trained LLMs

Large language models (LLMs) achieve strong generation and reasoning performance, but the Transformer architec

用途: 生成
難易度: Hard
コスト: High

The Dark Room in the Reward Channel: Dense Prediction Rewards Collapse GRPO-Trained LLM Agents -- and What Actually Works

Dense per-step supervision is an appealing remedy for sparse-reward, long-horizon LLM agents: reward the agent

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer動画

DART: A Degradation-Aware Recurrent Transformer for Archival Film Restoration

Archival film restoration is a challenging problem because historical footage contains compound degradations s

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知深層学習Transformer生成予測テキスト

Transformer-based Diffusion models for Hydrological Time Series Probabilistic Imputation and Forecasting

The modeling of hydrometeorological time series with limited observations is a key challenge in the monitoring

用途: 生成
難易度: Hard
コスト: High

Agree on the Model, Verify the Inference: GKR Protocols for HND-Based Transformer Inference

この研究では、オフサイトトランスフォーマー推論を検証するためにGKR-HNDプロトコルを提案し、クライアントがモデルサブスティテューションや不完全な実行にさらされるリスクを軽減したいと考えています。

用途: モデルの検証
難易度: Hard
コスト: High

説明可能センサ/時系列深層学習グラフニューラルネットテキスト時系列

Demographically-Informed Heat-Mortality Risk Curves via Risk Graph Neural Networks

この研究では、リスクグラフニューリアルネットワーク(RGNN)を使用して、人口統計学的特性と地域の情報と組み合わせた熱死亡リスクを推定する新しい方法を提案し、DLNMの効果的な方法に代わる可能性があります。

用途: 熱死亡リスクの推定
難易度: Hard
コスト: Low

Smooth Neural Point Processes via B-Splines

この研究では、トピックプロセシーの最適なモデリング法として、Bスプラインとニューラルネットワークを結合したSmooTNTPを提案し、ポイントプロセスのモデリングを改善します。

用途: ポイントプロセスのモデリング
難易度: Hard
コスト: High

A Polynomial Architecture-Attribution Co-Design Framework for Exact Aumann-Shapley Attribution in GNNs

この研究では、グラフニューラルネットワークの解釈を行うためのAPEXフレームワークを提案し、Aumann-Shapley Attributionを使用して、グラフニューラルネットワークの特徴レベルの解釈を実現します。

深層学習グラフニューラルネット

用途: グラフニューラルネットワークの解釈
難易度: Hard
コスト: Medium

CASC: Causal Adversarial Subspace Clustering for Multivariate Spatiotemporal Data

この研究では、CASCフレームワークを提案し、多変量空間時系列データを含む多様なデータを扱えるグラムニューラルネットワークのサブスペースクラスタリングを実現します。

用途: 多変量空間時系列データのクラスタリング
難易度: Hard
コスト: Low

Automatic knot selection in smooth additive models

この研究では、Bスプライン回帰のために、ナレッジの選択を自動化するための新しい方法であるAutomatic Knot Selectionを提案し、ナレッジの選択とデータ分析を容易にします。

深層学習軽量化・量子化回帰テキスト

用途: なじみのないデータの分析
難易度: Hard
コスト: Medium

Spectral Transformation for Layer-wise Global Rank Discovery in Federated LoRA for Vision Transformers

Fine-tuning Vision Transformers (ViTs) with low-rank adapters (LoRA) promises better communication efficiency

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Counterfactual Explainability Framework With CycleGAN And Counterfactual-Classifier Alignnment Score for Retinal Disease Classification

Automated detection of vision impairing retina-based ocular conditions from fundus images is important for ear

説明可能深層学習CNN分類検出画像

用途: 分類
難易度: Hard
コスト: Low

Regularized Optimization on Grassmann Manifold: Theory, Algorithm and Applications

Spectral methods are among the most widely used techniques for community detection, clustering, and graph lear

説明可能深層学習軽量化・量子化検出

用途: 検出
難易度: Hard
コスト: Medium

Weight-norm Criticality: A Mechanism for Loss Spikes Induced by the Normalization and Weight Decay

Most explanations of training instability focus on \emph{learning-rate criticality}, typically characterized b

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

An Analytically Trained Variational Surrogate for Quantum Phase Estimation on NISQ Hardware

QPE（量子位相推定）を実現する深い回路を、変分量子回路（VQC）を使用して簡略化できると考えた研究で、QPEの深い回路を簡略化可能であると示唆している。

説明可能MI向き深層学習軽量化・量子化

用途: QPEの深い回路を簡略化する
難易度: Hard
コスト: High

Three-Pronged Spectral Control for Federated Parameter Efficient Fine Tuning

FL（分散機械学習）におけるパラメータの効率的なフィンテューニングを支援するツールを提案した研究で、TRISHUL（Three-Pronged Spectral Control for Federated Paramet

深層学習軽量化・量子化マルチモーダル

用途: FLにおけるパラメータの効率的なフィンテューニングを支援する
難易度: Hard
コスト: High

HierarchicalDAEW: Domain-Aware Edge-Weighted Graph Convolution with Evidential Uncertainty for Multi-Section Spatial Gene Expression Prediction from H&E Histology

H&E組織組織における微分発現量を予測するために、HierarchicalDAEW（層付きドメイン認識）を提案した研究で、この方法により高精度の微分発現量予測が可能であると示唆している。

用途: H&E組織組織における微分発現量の予測を支援する
難易度: Hard
コスト: Medium

Information-Theoretically Secure Aggregation for Lightweight Federated Learning: Resilient to Dropouts and Adversaries

FLにおける安全な合計を提供するための方法を提案した研究で、この方法により、データが分散されるときにセキュリティが確保できる。

用途: FLにおける安全な合計を提供する
難易度: Hard
コスト: High

TwistedMerge: Certified Higher-Order Diagnostics and Abstention for Model Merging

モデル合成を支援するTwistedMergeを提案した研究で、この方法により、モデルが効率的に合成できる。

用途: モデル合成を支援する
難易度: Hard
コスト: Low

センサ/時系列品質予測/異常検知深層学習CNN分類画像

Machine Learning for Charge State Characterization of Isolated Double Quantum Dots

ダブル量子ドットのCharge Stateを分析するためのMachine Learning方法を提案した研究で、この方法により、量子ドットのCharge Stateが効率的に分析できる。

用途: ダブル量子ドットのCharge Stateを分析する
難易度: Hard
コスト: High

説明可能センサ/時系列深層学習グラフニューラルネット

Multilevel Graph Wavelet Compressed Sensing with Scale-Aware Neural Recovery

グラフシグナルを圧縮するためのGraph Wavelet Compressed Sensing（グラフウェーブレット圧縮感知）を提案した研究で、この方法により、グラフシグナルが効率的に圧縮できる。

用途: グラフシグナルを圧縮する
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer検出テキスト時系列

Beyond Heavy Log Curation: Perplexity-Based APT Detection via Unsupervised, Context-Augmented Language Models

Advanced Persistent Threats (APTs) remain difficult to detect because only a small fraction of events in large

用途: 検出
難易度: Hard
コスト: High

The Geometry of Personality: Activation Steering with Jungian Cognitive Functions

Activation steering enables control and interpretation of LLMs, yet existing work primarily models personality

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知深層学習RNN / LSTM検出異常検知教師なし

Unsupervised Consensus-Based Anomaly Detection for Spatiotemporal Malaria Incidence in Ghana

A consensus anomaly detection framework was applied to monthly malaria surveillance data from Ghana (2014-2023

用途: 検出
難易度: Hard
コスト: Medium

Optimal use of a black-box learner in semiparametric estimation

Consider the partial linear model $Y = μ_0(X) + β_0 \cdot T + \varepsilon$ and $T = π_0(X) + u$ in the structu

深層学習軽量化・量子化回帰

用途: 回帰
難易度: Hard
コスト: Medium

OpenForgeRL: Train Harness-native Agents in Any Environment

OpenForgeRLは、ハーネス付きエージェントを訓練するためのフレームワークを提供する。これにより、エージェントが複雑なトラジショナルハーネスを利用して、外部システムと協力し、複数のタスクを同時に解決できるようになっ

深層学習軽量化・量子化マルチモーダル

用途: ハーネス付きエージェントのトレーニング
難易度: Hard
コスト: High

Visual Contrastive Self-Distillation

Visual Contrastive Self-Distillationは、セルフディスタンスルールを高速化する方法を提案した。この方法は、入力情報だけで学生と教師の間の情報の不均衡をなくした。

用途: セルフディスタンスルールの高速化
難易度: Hard
コスト: Medium

Artificial Epanorthosis: Why large language models overuse a classical rhetorical figure, and how to mitigate it

Artificial Epanorthosisは、大規模言語モデルが古典的なルレチックの表現を使用する傾向に注目した。結果は、モデルのトレーニングデータの形状がこの傾向に影響していることができた。

深層学習軽量化・量子化分類生成テキスト

用途: 大規模言語モデル上のエパノルシス
難易度: Hard
コスト: High

Toward Continuous Assurance for the Democratization of AI Agent Creation in Industry

Democratization of AI Agent Creationは、オーガナイゼーションがオープンなAIエージェントを作成できるようにした。方法は、エージェントの信頼性を

用途: AIエージェントの民主化
難易度: Hard
コスト: Low

Thinkink: 2D Spatial Ink-native Interaction with LLMs

People often use handwritten notes and sketches to externalize ideas for ideation. To integrate large language

深層学習軽量化・量子化画像テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Agent-Guided Relational Concept Discovery: Toward Interpretable Surgical Margin Assessment

Deep learning models can effectively use Rapid Evaporative Ionization Mass Spectrometry (REIMS) data for surgi

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer画像テキスト動画

Adaptive Identity Anchoring: Closed-Loop Keyframe Placement for Synthetic Paired Supervision in Video Face Swapping

Video face swapping has no natural paired supervision: no real footage exists of one person's face performing

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

When Are Reasoning-Based Guardrails Not Efficient? ResponseGuard: A Fast Vision-Language Guard for Real-Time Moderation

A vision-language AI assistant returns its answer as a stream of generated tokens. Therefore, a safety guard t

深層学習軽量化・量子化検出画像テキスト

用途: 検出
難易度: Hard
コスト: High

説明可能深層学習Transformer検出埋め込みテキスト

Multimodal Pretraining for Generalizable EEG Representation Learning

Electroencephalography (EEG) models used for epilepsy are often limited to specific datasets and tasks. This l

用途: 検出
難易度: Hard
コスト: High

Towards Faithful Graph Explanations with Synergistic Edge Effects via Granular Balls

Instance-level explanations aim to reveal the rationale behind a model's decisions for a specific graph. Previ

深層学習グラフニューラルネット分類

用途: 分類
難易度: Hard
コスト: Low

Regulating autonomous and agentic AI

Regulating activities where regulatees use autonomous and agentic AI is challenging. Regulatory assumptions ab

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知深層学習軽量化・量子化生成強化学習

Expert Behavior Prior Reinforcement Learning

Behavior prior reinforcement learning (BPRL) has emerged as a promising paradigm to improve sample efficiency

用途: 生成
難易度: Hard
コスト: High

A Comparative Evaluation of Embeddings and LLMs in a Greek Book Publisher Setting - The CUP Dataset

この研究では、大規模言語モデルを活用して、Greekに基づく書籍検索システムの評価を行いました。大規模言語モデルを活用することで、検索精度が高まりました。

深層学習Transformer要約

用途: 書籍検索システムの評価
難易度: Hard
コスト: High

slang.gr as a Large-Scale Crowdsourced Resource for Non-Standard Greek

この研究では、大規模言語モデルを使用して、GREEKのスラングを研究しました。このスラングは大規模言語モデルを活用することで推測することができました。

用途: スラングの研究
難易度: Hard
コスト: High

Declarative Problem Solving in UAM Strategic Deconfliction

The growing demand for Urban Air Mobility (UAM) introduces significant challenges in airspace management, part

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Identifying Good Rules for Efficient SAT Encodings of Single-Constant Multiplication Using Machine Learning

机械学習モデルを用いて、指定された数値定数に掛け算する方法を効率的に探索します。在来の dinamic programming法は効率が高いが、定数の大きさに対応できません。この研究では、神経符号学的アプローチにより、定

品質予測/異常検知深層学習グラフニューラルネット

用途: 数値定数乗算最適化問題解決
難易度: Easy
コスト: Medium

One More Turn, Less Regret: A Regret-Based Multi-Turn Benchmark for LLMs' Clarification Policies

再発防止を目指す会話助言の評価基準である RegretBench を提案します。这一基準评估了會話助言の多輪交互式決定における後悔を最小化すること。

用途: 再発防止による会話助言の評価
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成テキスト音声

Faster IndexTTS-2: Accelerating and Streaming Autoregressive Zero-Shot Text-to-Speech Synthesis on GPUs

Autoregressive text-to-speech models achieve strong naturalness but suffer from slow inference due to sequenti

用途: 生成
難易度: Hard
コスト: High

Reexamining zero-shot summarization: Empirical investigation of trustworthiness of LLM-summarizers

Zero-shot summarization using Large Language Models (LLMs) has significantly advanced the abstractive summariz

MI向き深層学習軽量化・量子化分類生成要約

用途: 分類
難易度: Hard
コスト: High

Naju: A Native Discrete State-Space Model with Independent Retention and Writing for Long-Sequence Memory

Long-sequence memory tracking places two opposing demands on a recurrent state: near-lossless retention of sto

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

説明可能深層学習Transformer埋め込み画像動画

HyWorldVLA: A Vision-Language-Action Model with Hybrid World Modeling for Autonomous Driving

Vision-Language-Action (VLA) models augmented with world modeling represent a promising paradigm for end-to-en

用途: 埋め込み
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer画像テキスト動画

Beyond Independent Optimization: Compression, MoE Routing, and Quantization Interactions in Multimodal Edge Intelligence

効率的な多モードの推論は、モデルの性能やFLOPCOuntだけでなく、移動、キャッシュ、変形、量化された表現を保存するコストやメモリ、エネルギーに関する制約にも制限されています。この論文では、最近のビジュアルトークン圧縮

用途: 分析的コストと効率性を向上させるための多モードのエッジAIの効率化
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知深層学習軽量化・量子化時系列

From Scalars to Time Series: Rethinking Implicit Neural Representations for Time-Varying Volumetric Data

Implicit neural representations (INRs) for time-varying volumetric data are typically trained using dense samp

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Clustered Edge Intelligence: Beyond Just Convergence of Edge Computing and AI

We are moving from an information age to the age of intelligence. A decade, or possibly less than that, data w

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

OPOD: On-Policy Omni Distillation

Omni-modal models can handle text, images, and audio in one system, but improving all of these abilities toget

深層学習軽量化・量子化画像テキスト音声

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Source-Prior-Driven Selective Adaptation for Efficient Diffusion Model Finetuning

Fine-tuning large diffusion models for new domains or styles involves a trade-off: improving target-specific g

用途: 生成
難易度: Hard
コスト: High

Auditing Provenance Sensitivity in LLM Agent Action Selection

LLM agents choose tools and arguments from context that mixes user requests, tool outputs, retrieved records,

深層学習Transformer検出テキスト

用途: 検出
難易度: Hard
コスト: High

Efficient and Interpretable Body-Based Emotion Recognition with Lightweight Temporal Convolutional Networks

Body-based emotion recognition is important for real-time affective systems, but graph-based skeleton models c

説明可能センサ/時系列深層学習CNN分類時系列

用途: 分類
難易度: Hard
コスト: Low

説明可能MI向き品質予測/異常検知深層学習Transformer分類生成画像

Enhancing Explainable Cardiac Diagnosis with Guide-Grounded Multimodal LLMs

The electrocardiogram (ECG) is a cornerstone of cardiac as- sessment, yet clinical deployment of deep learning

用途: 分類
難易度: Hard
コスト: High

Profiling Lightweight Large Language Models

Lightweight large language models (LLMs) are increasingly being deployed locally on personal computers and are

用途: 生成
難易度: Hard
コスト: High

Search Hardness-Aware LLM-Based Problem Formulation for Expensive Simulation-Driven Design

シミュレーション駆動設計では、高精度なシミュレーションを少なくすることで設計を実現しています。既存の手法では、その問題に取り組むために最適化アルゴリズムが改善されてきましたが、問題の定義自体は検討されていません。この論文

用途: コスト削減的なシミュレーション駆動設計
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer分類テキスト音声

DONDO: Open w2v-BERT Speech-Recognition Base Models for African Languages

この論文では、DONDO と呼ばれるアフリカ諸国向けの音声認識ベースモデル (ASR)が構築されました。これらのモデルは、自律学習型スピーチエンコーダーであるw2v-BERT 2.0を使用して構築されています。このエンコ

用途: アフリカ諸国への音声認識技術の適用
難易度: Hard
コスト: Low

CPUで試しやすいセンサ/時系列深層学習軽量化・量子化分類テキスト音声

VibeVoice-ASR-BitNet Technical Report

We present VibeVoice-ASR-BitNet, a compressed variant of VibeVoice-ASR optimized for real-time inference on ed

用途: 分類
難易度: Hard
コスト: High

QuantiBias: Benchmarking Quantization-Induced Bias in LLMs

Almost every large language model that reaches a broad audience is quantized: trained in full precision, then

用途: 生成
難易度: Hard
コスト: High

Sample-Efficient Learning from Agent Experience

Real-world agent learning is often constrained by costly environment interactions, such as running time-consum

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知深層学習Transformer生成テキスト

Transformer-Assisted LLM-Based Source Code Summarisation: to Enable More Secure Software Development

ソフトウェア開発の維持フェーズで、ソースコードの自然言語解説を生成するためのモデルの改善を目的とした研究。

用途: ソフトウェア開発のスピードアップ
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成画像テキスト

Streaming Multi-Agent Autoregressive Diffusion Model with World State Registers

多エージェントのシミュレーションにおいて、共有世界状態がエージェント間で保持され、その世界状態が観測結果に反映されると仮定している。

用途: マルチエージェントのシミュレーション
難易度: Hard
コスト: High

MI向き深層学習軽量化・量子化セグメンテーション異常検知画像

Unified Video Dense Prediction from Disjoint Data

ビデオ内の物体の空間推論を同時に行うことで、現存するタスク固有の注釈を超えた統一的なビデオ推論システムを構築した。

用途: ビデオの分割推論
難易度: Hard
コスト: High

Inference-Time Scaling of Diffusion Models via Progressive Seed Pruning

ディフュージョンモデルにおける初期的なNoise Seed の影響が、モデルが生成する高質のイメージに大きく影響していることを提示し、Seed Search 時の時間的負荷を削減するための方法を提案した。

用途: ディフュージョンモデルのサケリング
難易度: Hard
コスト: High

Scale Up Strategically: Learning Compositional Generalization via Bias-Aware Evaluation and Data Collection for Robotic Manipulation

分割推論の一般化を促進するためのフレームワークを提案し、instruction factor bias を定式化し、bias を減らすための fine-tuned ポリシーの適切化方法を提示した。

用途: 分割推論の一般化
難易度: Hard
コスト: High

Self-Supervised Learning of Structured Dynamics from Videos

ビデオ内のキャメラの動きと物体の動きを切り離すことで、モーションの表現学習を改善した。

深層学習Transformer埋め込み画像動画

用途: ビデオ内の動きの予測
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成動画

SANA-Video 2.0: Hybrid Linear Attention with Attention Residuals for Efficient Video Generation

ビデオ生成モデルの効率性と高品質性を向上させるための新しい方法を提案した。

用途: ビデオの生成
難易度: Hard
コスト: High

深層学習軽量化・量子化セグメンテーションマルチモーダル

UnDA: Unpaired Domain Alignment for Cross-Modal Knowledge Transfer in Medical Imaging

複数モーダルデータの統合を支援するための方法とツールを提案し、医療画像認識におけるモーダル間の知見の共有を促進した。

用途: 複数モーダルデータの統合
難易度: Hard
コスト: High

Towards Robust Iris Recognition Through Occlusion Identification and Conditional Diffusion-Based Reconstruction

アイス認識の精度を向上させるための方法を提案し、視覚認識におけるアイス認識タスクの課題を分析した。

深層学習CNN分類画像テキスト

用途: アイス認識
難易度: Hard
コスト: High

センサ/時系列深層学習軽量化・量子化画像3D自己教師

Boosting Robustness for All-Weather Self-Supervised Depth Estimation in Autonomous Driving

Self-supervised depth estimation is challenging for safe autonomous driving under various adverse weather cond

用途: 自走車両の障害物認識
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer画像テキスト動画

Texture++: Elevating 3D Asset Texture Resolution with a Region-Aware Diffusion Model

Numerous 3D assets are discarded due to low texture resolution, while current super-resolution models ignore t

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Recurrent Sinusoidal INRs for Efficient High-Fidelity Representation

We study sinusoidal recurrence as an iterative mechanism for harmonic spectral enrichment in implicit neural r

深層学習RNN / LSTM画像3D

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知深層学習RNN / LSTM画像テキスト

CLUIE: Clustering-Aware Recurrent Propagation with Local Structural Compensation for Underwater Image Enhancement

Underwater image enhancement remains challenging due to wavelength-dependent light absorption, scattering, and

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知画像検査深層学習Transformerセグメンテーションテキスト

SPDCN: Strip-based Deformable Convolutional Network for Steel Surface Defect Segmentation

ステンレス鋼の表面欠陥検出は、工業的な品質検査において重要であるが、従来の方法は、ひずみ、非対称性や欠陥の境界の不規則性に伴う標準的な共通化の非対称受容領域と剛性のサンプリンググリッドを利用しているため、ひずみ、非対称性

用途: ステンレス鋼表面損傷の検出
難易度: Hard
コスト: Medium

GrainGS: Gradient-Decoupled Gaussian Splatting for Efficient Dynamic Novel View Synthesis

3Dガウシアンスプレイティングによる動的なシーン再構成は、動的なモーションモデリング、構造的安定性とコンパクトな表現のバランスをとることが求められる。実際、既存のprimitive毎に実際に実装されている方法はローカルの

品質予測/異常検知深層学習軽量化・量子化生成3D

用途: 3D Gaussian Splatting動的シーン再構成
難易度: Hard
コスト: High

DAPM: UAV Monocular Depth Estimation from Any Height, Pitch, Roll and FOV

UAVは、高度、ピッチ、ロール、FOVの変動を含む高度なカメラポーズにおいて動作するため、非対称分布の深さが含まれる広範な空中画像におけるモノラル深度推定を実現するには、高度な深度推定手法が必要である。ほとんどの推定手法

深層学習軽量化・量子化画像3D

用途: UAV用モノラル深度推定
難易度: Hard
コスト: High

Towards Privacy-Preserving Federated Prompt Tuning under Data Heterogeneity: A Subspace-Decomposed Expert Approach

Federated プールトーニングは、視覚言語モデルを軽量なプールで共有することで、視覚的、言語的、音声的なモデルを組み込んだ視覚言語モデル（VLM）の共有の協力的適応を実現する。従来のプールを使用する方法では、個々の

MI向き深層学習軽量化・量子化テキスト

用途: 対称性を持つ視覚言語モデルでのプールトーニング
難易度: Hard
コスト: Medium

GLAM-SLAM: Real-time Gaussian Large-scale Mapping via Flow Densification and Spatial Decomposition

一部のGaussianスプレイティングを利用したSL

品質予測/異常検知深層学習軽量化・量子化検出3D

用途: シンプルで実用的なSLAM
難易度: Hard
コスト: High

品質予測/異常検知深層学習正規化・最適化手法分類画像テキスト

Quality-Aware Multimodal Fusion Reveals Implicit Identity in Valence-Arousal Features

Conventional face recognition relies on static appearance cues and degrades in unconstrained settings with exp

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成画像

SlerpFlow: Spherical Trajectory Correction for Rectified Flow Inversion

Rectified-flow-based diffusion transformers, particularly FLUX, have demonstrated outstanding performance in h

用途: 生成
難易度: Hard
コスト: High

HGeo-TopoMap: Boosting Topological Mapping with Hierarchical Geometric Priors

Topological maps are key outputs of autonomous driving perception systems, delivering essential road informati

用途: 検出
難易度: Easy
コスト: Low

Flash EQ-Linear: Accelerating Equivariant Linear Layers via Group-wise Discrete Fourier Transform

Equivariant networks embed geometric symmetries as structural priors through weight sharing, achieving remarka

MI向き深層学習Transformer

用途: 技術検証・論文読解補助
難易度: Easy
コスト: Medium

Stokes-Informed Diffusion for Robust Linear Polarization Estimation

Polarization cues benefit applications such as material detection and de-reflection, yet acquiring them typica

深層学習軽量化・量子化検出画像

用途: 検出
難易度: Hard
コスト: High

Learning-based Seam Correspondence Reconstruction in Sewing Patterns

Digital sewing patterns typically consist of disjoint 2D panels without explicit stitch annotations, making do

深層学習Transformerテキスト3D

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

DTIF: Robust Loop Closure Detection via Delaunay Triangle Topology in Complex Forests

Accurate forest inventory and large-scale mapping are essential for ecosystem monitoring and sustainable fores

深層学習Transformer検出3D

用途: 検出
難易度: Hard
コスト: High

深層学習Transformerテキストマルチモーダル

C-PTQ: Fisher-weighted Channel-wise Sensitivity for Post-training Quantization of MLLMs

大規模言語モデルの圧縮には、モデルのパフォーマンスが低下する可能性があるため、量化の保護が重要です。この研究では、Fisher加重チャネル感受性を用い、MLLMの量化を安定させるためのC-PTQをプロPOSEしています。

用途: 大規模言語モデル圧縮
難易度: Hard
コスト: High

深層学習Transformer画像テキストマルチモーダル

MVEI & EmObserver: Empowering MLLM-Oriented Visual Emotional Intelligence via Emotion Statement Judgement

感情認識は、現代のアギを促進するために不可欠ですが、大規模

用途: 感情認識
難易度: Hard
コスト: High

Spectral-Spatial Synergistic Guided Network for Hyperspectral Salient Object Detection

Hyperspectral salient object detection aims to identify visually salient regions from hyperspectral images. Ex

深層学習軽量化・量子化検出画像

用途: 検出
難易度: Hard
コスト: Low

品質予測/異常検知深層学習Transformer検出生成画像

GroupVideo: Multi-Identity Customized Text-to-Video Generation

Current identity customized video generation methodologies are predominantly limited to single-identity scenar

用途: 検出
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化画像動画3D

WAT3R: Feedforward Underwater 3D Reconstruction

Reliable feedforward underwater 3D reconstruction remains challenging due to severe light attenuation and back

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化テキスト動画マルチモーダル

ProCap: Prominence-guided Object Rectification for Faithful and Comprehensive Video Captioning

Improving video captioning quality typically demands retraining large vision-language models, an expensive and

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

少数データ向きセンサ/時系列深層学習軽量化・量子化音声半教師あり

Latent Variable-Mediated Cross-Learning for Few-Shot Acoustic Impedance Imaging

Acoustic impedance imaging is a fundamental yet severely ill-posed problem in subsurface analysis: the seismic

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

深層学習Attention機構検出セグメンテーション

FSB-Net: Frequency-Spatial Boundary Network for Brain Stroke Lesion Segmentation in Non-Contrast CT

この論文では、非コントラストCT（NCCT）スキャン中の脳梗塞領域を正確に分割するために、周囲境界を特徴としているFrequency-Spatial Boundary Network（FSB-Net）を開発しました。

用途: 脳梗塞領域の分割
難易度: Hard
コスト: Low

RECO: Region-Aware Compensation for Extrinsic Perturbations in Roadside 3D Detection

この研究では、路上の3Dオブジェクト検出を改善するために、外部性を考慮した地域認識のアラーカンシーを提案します。

深層学習Transformer検出3D

用途: 鉄道沿いのオブジェクト検出
難易度: Hard
コスト: High

Ms. Forcing: Efficient Streaming Video Generation with Multi-Scale Patchification and Attention

この論文では、効率的なストリーミングビデオ生成手法であるMs. Forcingを提案します。Ms.フオーシングは、Multi-Scale PatchificationとAttentionを組み合わせた手法です。

深層学習Transformer生成動画

用途: ストリーミングビデオ生成
難易度: Hard
コスト: High

MagicMakeup: A Region-Controllable Diffusion Transformer for High-Fidelity Makeup-Transfer

この研究では、マイメイク移植を改善するために、マイメイクの強い地域性を考慮したRegion-Controllable Diffusion Transformer（MagicMakeup）を提案します。

用途: マイメイク移植
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成3D

FA-LAM: Focus-Aware Large Avatar Model for One-Shot 4D Animatable Gaussian Head

この論文では、Focus-Aware Large Avatar Model（FA-LAM）を提案します。FA-LAMは、一時的なGaussian頭の生成に適したモデルです。

用途: 一時的なGaussian頭の生成
難易度: Hard
コスト: High

MI向き品質予測/異常検知深層学習Transformer分類画像テキスト

Sidewalk Moments: Are Richer Representations Always More Human-Aligned? Evidence from City-Walk Videos

この研究では、都市ウォークビデオを分析するために、4つのモダリティの表現（スペース時領域情報、時間平均画像、オーディオ符号化、テキストベースの表現）を使用しました。

用途: 都市ウォークビデオの分析
難易度: Hard
コスト: High

DINO-VPT: Hierarchical Visual Prompt Tuning for Joint Physical-Digital Face Anti-Spoofing

この論文では、DINO-VPTという手法を提案します。DINO-VPTは、Hierarchical Visual Prompt Tuning（HVPT）を使用して、物理的なスポーフィングとデジタルスポーフィングを検出しま

深層学習軽量化・量子化画像テキストマルチモーダル

用途: フェイスアンティスポーフィング
難易度: Hard
コスト: High

WhereEdit: Mask-aware Local Latent Editing for One-Step Image Editing

この研究では、WhereEditという手法を提案します。WhereEditは、Mask-aware Local Latent Editingを使用して、一ステップの画像編集を実行します。

用途: 画像編集
難易度: Hard
コスト: Medium

Explainable graph attention network for stress recognition (StressGAT) via differential action units

Stress is a dynamic process characterized by significant individual variability in facial expression. Traditio

説明可能深層学習Transformer分類

用途: 分類
難易度: Hard
コスト: Low

品質予測/異常検知深層学習軽量化・量子化生成画像3D

SubSplat: High-Resolution Pixel-aligned 3DGS via Sub-pixel Gaussian Reparameterization

Pixel-aligned Gaussian splatting enables efficient and generalizable novel-view synthesis. However, high-resol

用途: 生成
難易度: Hard
コスト: High

CPUで試しやすい深層学習軽量化・量子化検出3Dマルチモーダル

Factorized Spatio-Temporal Convolutions for Human Pose Estimation from Planar Lidar

この論文では、安全な人とロボット間の対話を目的とした、人間の姿勢推定とロボットの動作制御の一連のネットワークが提案されます。

用途: 人間とロボット間の安全な交互作用
難易度: Hard
コスト: High

RL-MACRO: A Cybernetic Closed-Loop Intelligence Framework for Multimodal Adaptive Robotic Craniotomy

クロアニオトミーの手術を自動化するために、複数のモジュールから形成されるサイバネティックなクローゼッドループのフレームワークを提案します。このフレームワークは、ツールと組織との対話を通じて、ツールと組織の相互作用に対して

センサ/時系列深層学習CNN音声マルチモーダル

用途: クロアニオトミー手術の自動化
難易度: Hard
コスト: High

センサ/時系列深層学習Transformer検出画像音声

Human-Inspired Framework for Robotic Craniotomy: Integrating Multimodal Fusion and Adaptive Trajectory Adjustment

人間の知能を模倣するクロアニオトミー手術のフレームワークを提案します。このフレームワークは、前方計画と後方実行を組み合わせて、手術中に手術台の位置を自動的に調整することで、人間と同様の安全で効率的な手順を実現します。

用途: クロアニオトミー手術の自動化
難易度: Hard
コスト: High

A Real-Time Generalized Nash Equilibrium Framework for Interaction-Aware Autonomous Driving in Mixed Traffic

混合交通の自律走行において、人間とロボットの間の安全な交互作用を実現するために、一般化されたナッシュ均衡決定問題を利用したフレームワークを提案しました。このフレームワークでは、安全性と制御の複雑さをバランスさせることで、

用途: 混合交通の自律走行
難易度: Hard
コスト: Medium

AwesomeOPD — Awesome List for On-Policy Distillation

AwesomeOPDはオンPolicy distillationの最適化用リストである。オンPolicy distillationでは、学習済みモデルを小さくすることでモデルを高速化する。

用途: 精度の向上
難易度: Easy
コスト: Medium

EasyR1 — EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

可換のVEによる効率的でスケーラブルな多モダリティRLトレーニングフレームワーク

用途: 多モダリティ RLのトレーニングフレームワーク
難易度: Easy
コスト: High

品質予測/異常検知深層学習軽量化・量子化生成テキスト動画

Causal-Forcing — [ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation" & Causal Forcing++

この論文では、Causal-Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive

用途: 高品質のビデオ生成を実現する。
難易度: Easy
コスト: High

best-of-ml-python — 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

Pythonで使えるマシンラーニングライブラリを紹介している。

用途: Python MLライブラリ
難易度: Easy
コスト: Medium

txtai — 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

LLMを利用するために、セマンティック検索やLLMのオーケストレーションなどを行えるフレームワーク。

用途: セマンティック検索
難易度: Easy
コスト: High

Memory-Computation Tradeoffs in Semi Amortized Parametric Optimization

Learning-enabled decision systems often use offline data or computation to reduce online compute cost. Despite

深層学習Transformer回帰

用途: 回帰
難易度: Hard
コスト: Medium

GaugeQuant: Online Learning of Quantization-Optimal Bases from LLM Symmetries

Transformers are known to have internal continuous symmetries that leave outputs invariant, while modifying qu

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Self-Supervised Bio-Inspired Robotic Trajectory Planning with Obstacle Avoidance

Trajectory planning is a fundamental problem in robotics, requiring the generation of collision-free and effic

深層学習軽量化・量子化生成教師あり自己教師

用途: 生成
難易度: Hard
コスト: High

Cardinality-Decomposed Loss: Matching Training Objectives to Relation Structure in Heterogeneous Recommendation Graphs

Graph Neural Networks trained on heterogenous bipartite graphs form a common basis in recommendation systems.

深層学習Transformerセグメンテーション

用途: セグメンテーション
難易度: Hard
コスト: High

Leaky Language Models: Stealing Architecture and Inference Optimizations via Per-Token Timing

This work presents LeakyLMs, a set of attacks that leak proprietary model, architecture, and deployment inform

用途: 生成
難易度: Hard
コスト: Medium

説明可能センサ/時系列深層学習軽量化・量子化時系列

CEDAR: Causal Edge Discovery for Autoregressive Processes

We propose CEDAR (Causal Edge Discovery for Autoregressive Processes), a constraint-based method for lagged ca

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Writhe-Based Polymer Link Classification Using Machine Learning

Unique and rapid classification of knots and links is an open mathematical problem that is relevant to a range

MI向き深層学習軽量化・量子化分類

用途: 分類
難易度: Hard
コスト: Low

Scaling Interpretable Transformers with Parity Bottleneck Layers

Language models are thought to exhibit the phenomenon of superposition, representing many more features than d

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Masked Topology Modeling for Self-Supervised Learning on Parametric CAD

Computer aided design (CAD) is ubiquitous: virtually any modern object was designed using editable CAD tools.

深層学習軽量化・量子化教師あり自己教師

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Interval and fuzzy physics-augmented neural networks (iPANN and fPANN) for uncertainty quantification and propagation in constitutive modeling

この研究では、物理学に関する知見を組み込んだニューラルネットワーク (Physics-Augmented Neural Network; PANN) を提案し、不確実性の量化と伝播を考慮した構造設計が可能になった。

用途: 可能性の量化と伝播における不確実性に対するアプローチ
難易度: Hard
コスト: Medium

MI向きセンサ/時系列深層学習Transformer分類画像時系列

Multi-modal transformer for signal classification in nanopore blockade experiments

この研究では、ナノポア測定器から得られる複雑な信号を分析するために、多モーダル変換ニューラルネットワーク (Multi-modal Transformer) を提案し、信号分類の精度を向上させた。

用途: ナノポア測定器における信号の分類
難易度: Hard
コスト: Low

Label-Free Finite-Volume-Residual Training of Attention Graph Neural Networks for Coupled Thermo-Fluid Fields

この研究では、注意機構を併用したグラフニューラルネットワーク (Attention Graph Neural Network) を開発し、流体場の予測精度を向上させた。

深層学習グラフニューラルネット生成3D

用途: 流体場の予測における注意機構の活用
難易度: Hard
コスト: High

When Does Recurrence Become an Algorithm? Convergence Selection in Weight-Tied Looped Transformers

When does a weight-tied looped transformer -- one block applied T times -- implement an actual algorithm? We a

深層学習Transformer異常検知

用途: 異常検知
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化検出生成異常検知

Classical Hardware Acceleration of Quantum Autoencoders for Real-Time Anomaly Detection in Collider Experiments

この研究では、クラスター検出アナライザーにおける量子力学の応用を研究し、精度を向上させた。

用途: クラスター検出アナライザーにおける量子力学の応用
難易度: Hard
コスト: Low

The Blessing of Dimensionality: How Near-Orthogonality in High-Dimensional Spaces Explains Temporal Portability

この研究では、分布に変化がない場合の時間軸への適応性を

用途: 分布に変化がない場合の時間軸への適応性
難易度: Hard
コスト: High

Interpretable Fuzzy Rule-Based Regression Extension for Ex-Fuzzy Library

Machine learning models achieve high predictive accuracy in regression tasks, but their deployment in safety-c

説明可能深層学習軽量化・量子化回帰

用途: 回帰
難易度: Hard
コスト: Medium

Detecting Neural Network Failures through Spectral Analysis of Internal Activations

Neural network misclassifications exhibit characteristic spectral instability in internal activations that is

深層学習軽量化・量子化分類検出

用途: 分類
難易度: Hard
コスト: Low

説明可能品質予測/異常検知深層学習Transformer

PhaseAware: Interpretable Human-in-the-Loop Rehabilitation Scoring with Boundary Monitoring

Rehabilitation scoring systems are most useful when their outputs can be reviewed and interpreted within clini

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Dynamical and Optimization Trade-offs of Levi--Civita Coordinates for Learned Close-Encounter Dynamics

Classical regularization removes the binary-collision singularity from the Kepler problem, but its value as a

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

説明可能品質予測/異常検知深層学習Transformer分類埋め込み

User-Centric Modeling of Transactional Sequences with Explainable State Space Models

ユーザーセンター化されたトランザクションシーケンスのモデリングを目指す新しいアプローチを提案し、Contrastive Representation Learning と State Space Models の組み

用途: ユーザーセンター化されたトランザクションシーケンスのモデリング
難易度: Hard
コスト: Low

ELSAA: Efficient Low-Rank and Sparse Attention Approximation for Training Transformers

Transformers の効率化を目的とした新しいアプローチ、ELSA (Efficient Low-Rank and Sparse Attention Approximation) を提案する。

用途: Transformers の効率化
難易度: Hard
コスト: High

Statistical Inference for Rank Allocation in Low-Rank Adaptation

パラメータ効率の確保を目的とした Low-Rank Adaptation (LoRA) のランクの確立を扱う研究を紹介する。

深層学習Transformer生成QAテキスト

用途: パラメータ効率の確保
難易度: Hard
コスト: High

The Quadrilateral Loss: Additivity as a Measurable Behavior of Dense Neural Networks

ニューラルネットワークの解釈可能性を向上させるために Quadrilateral Loss を提案する。

用途: ニューラルネットワークの解釈可能性を向上させる
難易度: Hard
コスト: High

MI向き深層学習Transformer生成テキスト強化学習

OLEDLM: A Unified Language Model for OLED Molecular Design

OLED 材料の開発を目指す新しいアプローチ、causal language models を用いて optoelectronic プロパティを予測するフレームワークを提案する。

用途: OLED 材料の開発
難易度: Hard
コスト: High

Instance Hardness-Based Relevance for Imbalanced Regression

回帰問題の不均衡を扱う研究、Instance Hardness-Based Relevance の概念を提案する。

深層学習Transformer回帰

用途: 回帰問題の不均衡
難易度: Easy
コスト: Medium

Hard Guarantees at a Measured Price: Entropy-Stable Learned Finite Volumes for Compressible Flow

圧縮流体の解析を目的とした新しいアプローチ、Entropy-Stable Learned Finite Volumes を提案する。

深層学習Transformer異常検知

用途: 圧縮流体の解析
難易度: Hard
コスト: High

Local Stability and Gaussian Smoothing of Quantized Neural Networks

可逆化されたニューラルネットワークの収縮を扱う研究、Gaussian Smoothing を用いて収縮を提案する。

用途: 可逆化されたニューラルネットワークの収縮
難易度: Hard
コスト: High

Exact ReLU realization of affine one-dimensional refinement iterates via residual memory and offset frames

We study vector-valued affine refinement operators of the form [ (Wγ)(t)=\sum_{j\in\mathbb{Z}} A_jγ(Mt-j)+B(t)

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知深層学習軽量化・量子化分類生成動画

HeadCast: Casting Attention Heads for Efficient Autoregressive Video Generation

流動画像生成を扱う研究、HeadCast を用いて流動画像生成を提案する。

用途: 流動画像生成
難易度: Hard
コスト: High

Cumsum-Composable Phase Transport for Low-Cost Streaming Keyword Spotting

ストリーミングキーワードスポットイントを扱う研究、Cumsum-Composable Phase Transport を用いてストリーミングキーワードスポットイントを提案する。

センサ/時系列深層学習CNN音声

用途: ストリーミングキーワードスポットтинグ
難易度: Hard
コスト: High

Bayesian uncertainty estimation improves clinical decision making in medical AI agents

Machine learning models for medical image analysis typically lack a reliable measure of confidence, limiting t

深層学習正規化・最適化手法分類検出画像

用途: 分類
難易度: Hard
コスト: High

PN-QNN: Harnessing Physical Noise as a Native Regularizer in Photonic Hybrid Quantum Neural Networks

この研究では、フォトニック量子ニューラルネットワーク（PHQCNN）を正則化するために、物理的なノイズを直接inject trainingに使用する方法を提案しました。

用途: フォトニック量子ニューラルネットワークの正則化
難易度: Hard
コスト: High

Generalized Kalman filter based temporal difference reinforcement learning

この研究では、強化学習の強化値と行動値（Q値）関数を条件的期待として扱い、これらの関数の推定を確率的推論として表現する新たなフレームワークを提案しました。

深層学習Transformer強化学習

用途: 強化学習における条件的期待の利用
難易度: Hard
コスト: Medium

CPUで試しやすいMI向き深層学習軽量化・量子化分類検出

Taming the Security-Energy Paradox: A Green AI Approach to Optimized Android Malware Detection

この研究では、Androidマルウェアの検出に使用されるデープラーニングモデルをOptimizeする方法を提案しました。

用途: Androidマルウェアの検出
難易度: Hard
コスト: Low

表形式向きCPUで試しやすいセンサ/時系列品質予測/異常検知深層学習CNN回帰予測時系列

Time Series Network Utilization KPI Forecasting Using Advanced AI/ML Models

この研究では、ネットワークの利用率KPIを予測するために、従来の機械学習モデルの外側に新しいモデルを組み合わせた方法を提案しました。

用途: ネットワーク利用率KPIの予測
難易度: Hard
コスト: Low

The Giant Hippocampus: From Structural Monoculture to a System of Systems

この研究では、人工知能の研究者と神経科学者の間の分野を結びつけるために、脳のシステム構造を研究し、その研究から導かれた新しいアプローチを提案しました。

深層学習Transformer分類画像テキスト

用途: 脳のシステム構造とその応用
難易度: Hard
コスト: High

説明可能MI向きセンサ/時系列深層学習Transformer3D

AI-Driven Surrogate Models for Predicting Electrode-Scale Discharge Behavior in Lithium-Ion Batteries

Physics-based simulations are essential for understanding the electrode-scale discharge behavior of lithium-io

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

OPIUM: Mitigating Steering Externalities and Over-Refusal via Dual Objective Latent Optimization

大規模言語モデルを制御するために活性化制御を使用するときに生じる可能性のある外部性、つまり安全性を低下させる可能性と、禁止された要素を誘発する可能性を軽減するために、制御ベクトルの洗浄を実施する提案されている。

用途: 適応制御
難易度: Hard
コスト: High

Zero-Observation User Reactivation with Gap-Driven Dimensional Gating

連続的に観測された行動を捕捉するためのシーケンシャル推奨モデルを使用すると、期間が長い間隔が発生した場合に、再活性化されたユーザーへのリコールを改善できる提案されている。

深層学習Transformer動画

用途: 再活性化されたユーザーの推奨
難易度: Hard
コスト: High

TriAgent: Divergence-Aware Multi-Agent Committees for Cost-Efficient Financial Sentiment Analysis

生産的言語モデルの利用による金銭的感情分析に対処するための方法を提案している。複数のエージェントを活用したコミティー方式を使用し、さまざまな粒度のテキストデータに対応できるように、単語レベルのルールベースアプローチ、句節

深層学習Transformer検出テキスト

用途: 金融分野の感情分析
難易度: Hard
コスト: High

A Structure-Adaptive Random Feature Method for High-Dimensional Elliptic PDEs

ヘルシティ偏微分方程式を扱うための方法を提案している。ランダム関数を使用して高次元の偏微分方程式を線形係数問題に縮小することは可能であるが、フル次元の試行空間は、低次元構造の低次元の特性を考慮していない。提案されている方

用途: ヘルシティ偏微分方程式の解析
難易度: Hard
コスト: Medium

An Isotropy-Preserving Spectral Cap for Muon: Theory and Three Case Studies

language モデルを前訓練するために、Muon などの矩式最適化を使用するが、これらのモデルの内部幾何学を保持する方法についてはよくわかっていない。仮定から、モデルの内部幾何学を安定化するために、SGDに内在するb

深層学習正規化・最適化手法テキスト

用途: モデルの内部幾何学の安定化
難易度: Hard
コスト: High

Learning the Arabic Dialect Continuum as a Continuous Space: A Regression Approach to Speaker Origin Prediction

We present a regression-based approach to Arabic dialect geolocation that models dialectal variation as a cont

深層学習Transformer検出回帰

用途: 検出
難易度: Hard
コスト: High

Efficient Clustering with Provable Guardrails for LLM Inference at Scale

Scaling LLM-based applications to millions of users is bottlenecked by the inference cost and latency of moder

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Data-Poisoning Audits for Causal Effect Estimation

Observational causal analyses increasingly pool records across sites, vendors, and collection systems, creatin

MI向き深層学習Transformer

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知深層学習軽量化・量子化生成テキスト

Multi-Mask Diffusion Language Models for Few-Step Generation

この研究では、多マスク分散言語モデルを提案します。このモデルには、複数のマスクがあり、それぞれが異なる生成タスクを実行することになります。このモデルは、生成タスクの多様性を高めることができ、生成された文がより多様性の高い

用途: リトルバイトの生成
難易度: Hard
コスト: High

Twoblock clustering trees with coskewness-based dimension reduction: recovering piecewise multivariate linear regimes

The twoblock clustering tree (\tbtree) is introduced as a highly interpretable regression tree for multivariat

説明可能深層学習軽量化・量子化回帰

用途: 回帰
難易度: Hard
コスト: Medium

Can an AI System Be Creative? A Critical Perspective from Art and Engineering

This paper examines the question of whether artificial intelligence (AI) systems can be creative, approached f

深層学習Transformer分類生成画像

用途: 分類
難易度: Hard
コスト: Low

Refusal-Gated Decoding: Preserving Refusal Behavior Under High-Temperature Sampling

High-temperature sampling is one of the primary mechanisms for increasing diversity in LLMs. Recent advances i

用途: 生成
難易度: Hard
コスト: High

センサ/時系列深層学習軽量化・量子化画像テキストマルチモーダル

Robostral Navigate

Deploying navigation systems at scale requires a recipe that minimizes sensor assumptions, generalizes across

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Transition-Related Potentials as Markers of Narrative Comprehension in Continuous EEG

Harnessing the potential of electroencephalography (EEG) for brain research is fundamentally limited by intrin

深層学習Transformer検出テキスト

用途: 検出
難易度: Hard
コスト: Low

品質予測/異常検知深層学習軽量化・量子化セグメンテーション画像

U-CFR: Uncertainty-Guided Cascade Forward Refinement for Interactive Segmentation

Interactive image segmentation is critical for efficient image annotation; however, existing methods often req

用途: セグメンテーション
難易度: Hard
コスト: Low

A Framework for Reputation Aware Uninorm-driven Consensus Algorithms for Blockchain Networks

The operation of blockchain is governed by consensus algorithms (CA). Several consensus mechanisms require sig

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

品質予測/異常検知画像検査深層学習軽量化・量子化生成画像テキスト

Demonstrating GenDB: Instance-Optimized and Customized Query Processing Code Generation via LLM Agents

Traditional query processing engines require continuous development and extensions to support new techniques a

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化生成テキスト動画

RealVDeblur: One-Step Diffusion for Generalizable Real-World Video Deblurring

Real-world video deblurring remains challenging due to diverse motion patterns, complex degradations, and the

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer分類生成画像

Persian Pixel: A large-scale synthetic OCR dataset for Persian language

Optical Character Recognition (OCR) for Persian remains substantially less mature than for Latin-script langua

用途: 分類
難易度: Hard
コスト: High

Closing the Lab-to-Store Gap: A Data-Efficient Post-Training and Experience-Driven Learning VLA Framework for Retail Humanoids

Closing the gap between benchmark performance and reliable real-world operation remains a central challenge fo

深層学習軽量化・量子化異常検知画像テキスト

用途: 異常検知
難易度: Hard
コスト: High

Toward Reliable RGB-D Semantic Segmentation: Handling Missing Modalities via Condition Dropout

RGB-D semantic segmentation has achieved remarkable progress, yet most models assume that RGB and depth are al

深層学習正規化・最適化手法セグメンテーション

用途: セグメンテーション
難易度: Hard
コスト: High

Sound Probabilistic Safety Bounds for Large Language Models

最新言語モデル(LLM)が危険な生成を防ぐための確信的な安全な限界を計算するための新しいフレームワークを提案した。Clopper-Pearsonの信頼区間の新しい応用として、PAC(可能性が最も近い)の境界を得るためのア

深層学習軽量化・量子化生成テキスト音声

用途: 生成性質へのリスクを抑える
難易度: Hard
コスト: High

少数データ向き深層学習Transformer生成テキスト

The Maskability Index: Predicting Task-Objective Alignment in Pretrained Language Models

ある知識関係がマスクスタイルのパラミトリックパラメータ化方法で適切かどうかを計算したメトリックとしてMaskability Index (MI)を導入した。DepthRankの違いを用いて、与えられたパラメータ化方法で知

用途: 強い知識獲得タスクでのパフォーマンスの向上
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化生成テキスト音声

Pushing the Frontier of Full-Song Generation: Hierarchical Autoregressive Planning Meets Flow-Matching Rendering

3つのタスクをサポートする音曲生成フレームワークを提示した。これらのタスクには、歌詞、テキストの説明、音楽的特性を利用して、歌詞の生成、バンドの音楽の生成、カバー曲の生成などが含まれる。

用途: 音楽の生成
難易度: Hard
コスト: High

MI向き品質予測/異常検知深層学習Transformer生成画像動画

StreamHOI: Interaction-aware Temporal Memory Adaptation for Streaming HOI Video Generation

オフラインでの短時間の視覚生成が一般的な人間の行動の分析では、人間の行動の長期的な視覚生成は、実践的な長時間の視覚生成では実行不能である。StreamHOI は、人間間の視覚的な行動の生成を生成したいくつの画像を使用して

用途: 人物間の相互作用による視覚生成
難易度: Hard
コスト: High

SLAI T-Rex: Full-Parameter Post-training of the DeepSeek-V4 Family on Ascend SuperPOD

Full-parameter post-training of trillion-parameter-scale MoE models introduces substantial system-level challe

品質予測/異常検知深層学習軽量化・量子化テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

PRO-LONG: Programmatic Memory Enables Long-Horizon Reasoning

Long-horizon tasks require sustained perception, reasoning, and exploration, and are a persistent challenge fo

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Global Difference Constraint Propagation for Constraint Programming

差分制約問題を扱うグローバルプロパゲーターを提案し、Finite Domain Propagationアルゴリズムの効率化に寄与。

用途: 制約プログラミングの効率化
難易度: Hard
コスト: Medium

EvoDRC: A Self-Evolving Agentic Framework for Automated DRC Violation Repair

Design Rule Check closureを促進するための自動修正フレームワーク、EvoDRCを開発し、複雑な幾何学的相互作用を考慮した修正を実行する。

用途: デザインルール違反修正の自動化
難易度: Hard
コスト: High

SpikingMOT: A Spike-Driven Multi-Object Tracker

Multi-object tracking (MOT) plays a fundamental role in visual perception, where accurate trajectory predictio

深層学習正規化・最適化手法画像

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Learning to Detect UI Principle Violations via Reinforcement Learning

Small language models and coding agents increasingly generate web front-end code, yet their outputs are typica

用途: 生成
難易度: Hard
コスト: High

深層学習Transformer画像テキストマルチモーダル

Test-Time Training for Modality Order Consistency in Vision-Language Models

異なる順番で画像と質問が提示される場合、視覚言語モデルはモデルのパフォーマンスに大きな影響を受けることが発見された。

用途: モデルの出力の順番に影響する問題を解決する
難易度: Hard
コスト: High

PyroDash: Cost-Efficient Token-Level Small-Large Language Model Collaborative Inference

危険な問題に対する正しい答えを提供する大きな言語モデルと費用の効率が良い、小さな言語モデルを協力させる技術が開発されました。

用途: 小さな言語モデルを大きい言語モデルと協力させる手法が効率的かつ安全に実装される
難易度: Hard
コスト: High

Solar Open 2 Technical Report

We present Solar Open 2, a 250B-A15B Mixture-of-Experts language model built for long-horizon agentic tasks, s

品質予測/異常検知深層学習軽量化・量子化テキスト

用途: 長期のアギーントタスクに適した言
難易度: Hard
コスト: High

TalentCLEF at CLEF2026: Skill and Job Title Intelligence for Human Capital Management

This paper presents the second edition of the TalentCLEF Challenge, which will run as an evaluation lab as par

深層学習Transformer分類検出テキスト

用途: 分類
難易度: Hard
コスト: Low

When Does Knowledge Distillation Hurt? Reliability-Aware Distillation for Low-Resource Language Summarization

知識圧縮における信頼性を高めるための2つの方法を提案し、標準的な知識圧縮の結果が実際のデータに対して損害を及ぼす可能性があり、制約付き知識圧縮を導入し、損害が生じないようにする。

深層学習軽量化・量子化要約

用途: 知識圧縮の信頼性を高める
難易度: Hard
コスト: High

A Multi-Dimensional Evaluation of Explainability in Media Bias Detection

Detecting media bias automatically is difficult because biased framing is often subtle, yet in domains such as

深層学習Transformer分類検出

用途: 分類
難易度: Hard
コスト: High

Efficient Chain-of-Modality Reasoning via Progressive Compression for Spoken Language Models

Spoken language models (SLMs) enable natural human-computer interaction, but their reasoning ability still lag

深層学習軽量化・量子化QAテキスト音声

用途: QA
難易度: Hard
コスト: High

Sentence Splitter: Uncovering Latent Factual Structure for Self-Supervised Learning

この研究ではSentence Splitterシステムを提案し、自然言語処理の精度を高めることができました。このシステムは、自然言語を句点で分割することができます。

深層学習軽量化・量子化生成セグメンテーションQA

用途: 自然言語処理を改善する
難易度: Hard
コスト: High

説明可能CPUで試しやすい深層学習軽量化・量子化分類テキスト音声

Lightweight Person-Place Relation Extraction from Historical Newspapers with Dependency Graphs and Proximity Features

人名と場所の関係を抽出するタスクは、歴史的ニュース記事の解釈において重要です。従来の方法では、言語モデルの前処理が必要でしたが、Lightweightアルゴリズムは、依存グラフと近接特性を使って、歴史的ニュース記事から人

用途: 歴史新聞から人名と場所の関係の抽出
難易度: Hard
コスト: High

Look Less, Think Faster: Joint Token-Compute Adaptation for Multimodal LLMs

多モーダルラージランゲージモデルは、視覚言語タスクに強いですが、高い推論コストで問題となっています。Look Less, Think Fasterアルゴリズムは、単位次元を個別に最適化することで、多モーダルラージランゲー

深層学習軽量化・量子化画像テキストマルチモーダル

用途: 多モーダルラージランゲージモデルによる視覚言語タスクでのコスト削減
難易度: Hard
コスト: High

Evolving Cache Schedules for Fast Diffusion Policy Inference

分散式推論には、高解像度ビデオ生成のためにコストが高いという問題があります。Evolving Cache Schedulesアルゴリズムは、コストと効率性のトレードオフを最適化することで、キャッシュで推論コストを削減しま

用途: 分散式推論のためのキャッシュスケジュールの進化
難易度: Hard
コスト: High

センサ/時系列深層学習軽量化・量子化QA画像テキスト

Multimodal Large Language Models for Remote Sensing Image Understanding: Domain-Specific or General-Purpose?

画像理解のための多モーダルラージランゲージモデルは、強力ですが、まだ能力と限界については明確な理解が不足しています。この論文では、多モーダルラージランゲージモデルが画像理解においてどの程度の能力と限界を持つか、を分析し、

用途: 画像理解における多モーダルラージランゲージモデルの能力と限界
難易度: Hard
コスト: High

センサ/時系列深層学習軽量化・量子化検出セグメンテーション埋め込み

Not All Patches are Equal: Sampling Matters for Visible-Infrared Pre-Training

Visible-infrared (VIS-IR) alignment is a key pre-training task for robust multi-sensor perception. Most existi

用途: 検出
難易度: Hard
コスト: High

PerceptDrive: Perception Prior World-Action Modeling with Adaptive Expert Routing for End-to-End Autonomous Driving

Frozen perception foundation models encode rich geometric, semantic, and dynamic knowledge. Yet narrow conditi

深層学習軽量化・量子化生成動画自己教師

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成画像テキスト

SHFormer: Dynamic Spectral Filtering Convolutional Neural Network and High-pass Kernel Generation Transformer for Adaptive MRI Reconstruction

Attention Mechanism (AM) selectively focuses on essential information for imaging tasks and captures relations

用途: 生成
難易度: Hard
コスト: High

RIM: A Retrieval-In-Matching Framework for Cross-Domain Global Visual Localization of UAVs

Global visual localization of unmanned aerial vehicles (UAVs) using remote-sensing reference maps has attracte

センサ/時系列深層学習軽量化・量子化検出画像3D

用途: 検出
難易度: Hard
コスト: High

説明可能品質予測/異常検知深層学習軽量化・量子化回帰画像

Factor-Informed Uncertainty Distillation for Gaze Estimation

Deep gaze estimation works well in controlled capture but degrades in unconstrained settings, where systems mu

用途: 回帰
難易度: Hard
コスト: Medium

Importance-Aware OBS Pruning for Diffusion Models

セグメンテーションのパフォーマンスの向上と計算的リソースの削減を目的として、Lean-SAM2は対象領域をアサインする対象アンバウンダリーセグメンテーション（SAM2）にターゲットアンチャイニングされたメモリとエンコーダ

深層学習軽量化・量子化生成画像

用途: 画像のセグメンテーションに効率を実現
難易度: Hard
コスト: High

Toward Seasonal Guidelines for Robust Deep-Learning Sentinel-2 Building Detection in Different Area Types

OffNadirLocは地学化におけるオフナジアムの視点を考慮するための基準セットを提案します。これにより、ドローンと衛星画像の交差視点地学化プロセスでは重要な構造的シーン理解と内部ドメイン間の関係的制約に重点を置くこと

深層学習CNN分類検出セグメンテーション

用途: ドローンから衛星画像への地学化の改善
難易度: Hard
コスト: High

STEREOFLOW: Progressive Stereo Matching with StereoDiT and Transition Flow Matching

ステレオマッチングは3次元再構成において重要なタスクです。この研究では、ステレオマッチングを確率的生成タスクと組み合わせ、オブジェクト検出の向上を目的として、ステレオマッチングフレームワークと潜在分配を統合する方法を提案

深層学習Transformer生成回帰画像

用途: オブジェクト検出の向上
難易度: Hard
コスト: High

センサ/時系列品質予測/異常検知深層学習RNN / LSTM予測画像マルチモーダル

Forecasting the Number of Harvest-ready Fruits of Sweet Peppers Using Multimodal Time-Series Data

この研究では、スイートペッパーの収穫前期予測を目的として、多モード時系列データを統合するための深層学習フレームワークを提案します。

用途: 農業用果実の収穫前期の予測
難易度: Hard
コスト: High

Unified Prediction and Planning via Conflict-Aware Disjoint Parameter Training

この研究では、共感覚的ロボット移動において動く人間の行動の予測と安全な動作プランニングの自動化を目的として、統合的な行動予測と安全な動作プランニングのフレームワークを提案します。

用途: 人間の動きの予測と安全な動作プランニングの自動化
難易度: Hard
コスト: High

CPUで試しやすい深層学習軽量化・量子化セグメンテーション3D

StrokeSeg2: Stroke Lesion Segmentation in Clinical Research Workflows

Deep learning frameworks like nnU-Net achieve state-of-theart brain lesion segmentation performance but remain

用途: セグメンテーション
難易度: Hard
コスト: High

品質予測/異常検知深層学習Attention機構分類生成画像

MTVDiff: Multimodal Conditional Latent Diffusion for Enhanced Thermal-to-Visible Face Translation

Thermal-to-visible face translation presents fundamental challenges including geometric discontinuities, seman

用途: 分類
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化検出セグメンテーション画像

Current Injection Spiking Neural Network for Infrared and Visible Image Fusion

Infrared and visible image fusion (IVIF) integrates the complementary information of two modalities into a sin

用途: 検出
難易度: Hard
コスト: High

Robust Activation Map Rectification for Weakly Supervised Volumetric Segmentation: Temporal Coherence as a Free Lunch

Weakly supervised segmentation relies heavily on class activation maps (CAMs) to initially localize target reg

深層学習軽量化・量子化セグメンテーション

用途: セグメンテーション
難易度: Hard
コスト: High

深層学習Attention機構セグメンテーション画像テキスト

Lean-SAM2: Target-Anchored Memory and Encoder Acceleration for SAM2

The Segment Anything Model 2 (SAM2) has advanced temporal promptable segmentation, yet its deployment remains

用途: セグメンテーション
難易度: Easy
コスト: Medium

Physics-Aware Complex-Valued State Space Model with Scattering-Prior Feature Modulation for PolSAR Image Classification

この研究では、地象性AIにおける物理的知識を使用してポーラリメトリック合成開口ラダール画像を分類するための新しいモデルを提案しました。このモデルは、ラダール画像を物理的なプロセスと関連付けることができます。

深層学習軽量化・量子化分類埋め込み画像

用途: ポーラリメトリック合成開口ラダール画像の分類
難易度: Hard
コスト: Low

WASABI: Whole-graph Assignment-based Stabilizer for lAne topology By Inter-frame tracking

マグネティックリゾナンスイメージング (MRI) のデータ収集には、多くのエネルギーと時間が必要です。アクティブサンプリングは MRI の速度を増加させる技術ですが、現在のアプローチでは、低周波数部分（解像度）と高

用途: MRIデータを効率よく収集する
難易度: Hard
コスト: Medium

Look Before You Edit: Attention-Guided Camera Placement and Multi-View Alignment for 3D Gaussian Splatting Editing

DRGBTトラッキングの分野では、目標物を変動するセンシングモデリティと観測プラットフォームの条件下で追跡することが求められます。ドリフト、視線、時間の条件変化に関しても検討が必要です。ただし、現在のバenchmarkで

深層学習Transformerテキスト3D

用途: DRGBTトラッキングを改善
難易度: Hard
コスト: High

EgoRecovery: Acquiring Failure Recovery Ability Through Human Recovery Demonstration

オブジェクトの動きを追跡することは、動画の理解の鍵です。ただし、オブジェクトの動きを追跡するアルゴリズムは、計算コストが高いことが多いのではないでしょうか。

用途: オブジェクトの動きを追跡する
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化検出セグメンテーション動画

Efficient Tracking and Understanding Object Transformations

Tracking objects through state transformations is essential for understanding real-world dynamics. However, ex

用途: 疼痛位置
難易度: Hard
コスト: High

説明可能深層学習Transformer検出マルチモーダル

An Exploratory Analysis of Pain Localization via Explainable Computational Modeling

Automatic pain localization, which involves identifying the anatomical origin of pain from peripheral physiolo

用途: 検出
難易度: Hard
コスト: High

CPUで試しやすい深層学習Transformer分類動画3D

A Unified Tokenization Framework for Pain Recognition using Heterogeneous 3D Modalities

Pain is a complex and pervasive phenomenon affecting a large percentage of the population, and accurate assess

用途: 分類
難易度: Hard
コスト: High

Point-Selection Fine-Tuning Framework for Robust Point Cloud Classification

Noisy and corrupted points can substantially degrade point cloud recognition performance, especially under cha

深層学習軽量化・量子化分類生成3D

用途: 分類
難易度: Hard
コスト: High

PhenSPINE: A Standardized Benchmark for Spine Pathology Diagnosis

The accurate diagnosis of spinal pathologies depends heavily on radiological interpretation, yet automated sys

品質予測/異常検知深層学習CNN画像テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

深層学習Transformerセグメンテーション画像

A Unified Variational Framework for Deep Weakly Supervised Image Segmentation

We propose a unified variational framework for image segmentation under sparse pixel-level supervision. Our me

用途: セグメンテーション
難易度: Hard
コスト: High

Socially Consistent Multi-Robot Navigation Using Decoupled Planning and Trajectory Coordination

The successful integration of mobile robots in human-centric environments requires navigation that is not only

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

FELT: Generating Tactile Signals from Vision for Visuo-Tactile Manipulation

The sense of touch is central to manipulation, especially when vision is occluded or ambiguous. Although combi

センサ/時系列深層学習軽量化・量子化画像

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Towards Capability-Aware Traversability Navigation for Unstructured Environments

Estimating traversability in unstructured environments requires conditioning on robot embodiment, as the same

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Safe and Scalable Multi-Drone Payload Transport via CBF-based Reinforcement Learning with Zero-Shot Sim-to-Real Transfer

Multi-drone payload transportation has emerged as a promising research paradigm with potential applications in

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

SeededGrasp: Language-Guided Grasping in Complex Scenes with Multiple Embodiments

Language-Guided Grasping は、複雑なシーンで物体の把持を行うために、視覚言語モデル（VLM）を用いる。このアプローチでは、VLM は直接把持を予測するのではなく、3 次元空間における把持の位置を指

深層学習軽量化・量子化生成テキスト3D

用途: 複雑なシーンで物体の把持を実現
難易度: Hard
コスト: High

SOPD-SocialNav: Selective On-Policy Distillation for Vision-Language Social Navigation

SOPD-SocialNav は、学習モデルを小さなロボットに伝える技術であり、ロボットが環境と人間の行動を理解し、ナビゲーションが行えるようにする。

深層学習軽量化・量子化テキストマルチモーダル

用途: ソーシャルなナビゲーションのための学習モデルを小さなロボットに伝える技術
難易度: Hard
コスト: High

Defer to Plan: Adaptive Multi-Agent Fusion for End-to-End V2X Driving

Defer to Plan は、自動車が情報交換をして安全に走行するためのシステムである。このシステムでは、自動車間で情報が交換され、各車が安全に走行するような経路を選択できる。

用途: 自動車が情報交換をして安全に走行するためのシステム
難易度: Hard
コスト: Medium

LENS: LLM-guided Environment Simplification for Planning and Control in Clutter

Despite recent advances in general-purpose robotic manipulation, real-world multi-object clutter remains chall

深層学習軽量化・量子化マルチモーダル

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

githubGitHubあり2026-07-22

pytorch_geometric — Graph Neural Network Library for PyTorch

transformerlab-appは、AI研究者に、ローカルハードウェアとGPUクラスタ間でモデルを平滑にトレーニング、評価、およびスケールさせることができる、オープンソースの研究環境です。

用途: 実装・検証基盤
難易度: Easy
コスト: Medium

githubGitHubあり2026-07-22

picollm — On-device LLM Inference Powered by X-Bit Quantization

デバイス上のLLM推論をXビット量化を使用したもの。

用途: ラジケイタクイズナイゼーション
難易度: Easy
コスト: High

githubGitHubあり2026-07-22

ncnn — ncnn is a high-performance neural network inference framework optimized for the mobile platform

Neural Network Inference用の高性能フレームワークです。モバイルプラットフォームに最適化されています。

用途: Neural Network Inference
難易度: Easy
コスト: Medium

Deep Shape Regression for Planar Curves with Multimodal Covariates

深層学習を用いた形状推定モデルを作成し、オープン平面曲線の形状を推定するための深層学習モデルを提案した。

深層学習CNN回帰画像マルチモーダル

用途: 多モデルの形状推定
難易度: Hard
コスト: High

RELTA-SGLD: Relative-Growth Localized Taming for Nonconvex Stochastic-Gradient Langevin Learning

不動点に達するまでに要する計算量を理解するために、不動点を達成するまでの計算量と時系列の関係を検証した。

用途: 不動点に達するまでの計算量
難易度: Hard
コスト: High

Total Variation Distance Estimation in Autoregressive Models

自動変換モデルで使用されるLLMの同定の精度の評価に役立つ「Total Variation Distance Estimation」を行った研究。この研究では3種類のアクセスモデルと異なる推定方法を提案し、実験で推定方

用途: LLMの同定の精度の評価のためのTV距離の推定
難易度: Hard
コスト: High

Strong Gravitational Lensing Posterior Sampling in Pixel-Space Using Diffusion Models and Recurrent Inference Machines

Modeling galaxy-galaxy strong gravitational lenses to infer the brightness of the source galaxy and the mass d

深層学習Transformer生成画像

用途: 生成
難易度: Hard
コスト: High

Provable diffusion-based posterior sampling for linear inverse problems via DDIM

逆問題を解くために、拡散ベースのサンプリングアルゴリズムが提案されていました。これにより、解の特性の正確さが向上することが期待されます。

用途: 逆問題を解く
難易度: Hard
コスト: High

On the sensitivity of machine-learned probabilistic weather forecast models to scale-aware scoring rules

Probabilistic forecast models can be machine-learned from data using loss functions based on scoring rules suc

深層学習Transformer予測

用途: 予測
難易度: Hard
コスト: High

Deep learning-based prediction of time-resolved adhesive forces in viscoelastic Hertzian contacts

Fast prediction of the response of adhesive soft viscoelastic contacts represents a current challenge in soft

説明可能深層学習CNN

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Elicitation without Backpropagation: Steering Model Behavior by Optimizing the Latent Posterior

In the \emph{latent posterior model} of transformer behavior, the next-token distribution arises from a poster

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Uncertainty quantification in mechanics: A unified Bayesian perspective

Uncertainty quantification (UQ) is essential to experimental mechanics, but has become particularly relevant i

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Spiking Neural Networks for fMRI-Based Visual Semantic Decoding

fMRIデータから視覚情報を解釈するために、スパイクニューラルネットワークを用いた方法を提案し、fMRIデータから視覚情報を解釈する検証を行う。

深層学習Transformer回帰画像

用途: fMRIから視覚情報の解釈
難易度: Hard
コスト: Medium

How the fly holds a single goal: normalization, not selection, in Drosophila FC2

A walking fly steers toward a goal direction, held as a bump of activity across the FC2 neurons of the fan-sha

用途: 分類
難易度: Hard
コスト: Low

Knowledge-Centric Self-Improvement

知識を重視した自己向上の研究を実施し、自己向上を知識を重視することにより効果的に行う方法を提案した。

用途: 知識を重視した自己向上
難易度: Hard
コスト: High

On the Computational Complexity of Structural Generalization

組み合わせる能力の計算量を理解するために、組み合わせる能力の計算量とデータのサイズの関係を検証した。

用途: 組み合わせる能力の計算量の理解
難易度: Hard
コスト: Medium

Selective State-Space Adaptation and Retrieval for Language Model Reasoning

Low-rank adaptation introduces a static learned update applied identically to every input. The update provides

深層学習RNN / LSTMテキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

表形式向き説明可能深層学習軽量化・量子化テキスト表形式

CircuitKIT : Circuit Discovery, Evaluation, and Application Toolkit for Mechanistic Interpretability

機械学習モデルの解釈のためのツールが提案されていました。これにより、モデルがどのように機能しているかが理解できるようになります。

用途: 機械学習モデルの解釈
難易度: Easy
コスト: Low

Inference-Time Steering for Cross-Lingual Factual Consistency in LLMs

Although Large Language Models (LLMs) demonstrate remarkable multilingual fluency, their internal knowledge re

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化生成テキスト

AdaFlash: Adaptive Speculative Decoding via On-Policy Distilled Diffusion Drafters

Offline再調整学習（RL）で、アクション偏好キューを使用し、エキスパートのフィードバックを利用してポリシーを向上させます。

用途: Offline RLにおけるアクション偏好キューの有効な使用
難易度: Hard
コスト: High

Translation as Augmentation: Effect of Translated Data on Assessment of Difficulty

Reliable Text Difficulty Assessment is a prerequisite for valid text simplification workflows and personalized

深層学習Transformer回帰翻訳テキスト

用途: 回帰
難易度: Hard
コスト: High

DAIS: Dependency-Aware Intermediate QA Supervision for Complex Reasoning

この研究では、Chain-of-Thought (CoT) スーパーヒバテーションでは、最終的な答えに到達するまでの理由を公開することで、中間的には提供される理由の質を強力にし、しかし、多くの場合には、前のステージに到達

用途: QAにおけるメモリ補助システムの開発
難易度: Hard
コスト: High

センサ/時系列深層学習Transformerテキスト音声教師あり

Content is What Remains: Invariant Speech Tokenization from Parallel Utterances

ある単語を複数のスピーカーや環境の異なる条件下で言語モデルが使用できるようにしたい場合は、単語の抽出を実現する必要がある。しかし、現在の言語モデルでは、スピーカーの特性や環境の特性が単語に含まれていることが多い。ここでは

用途: 可変な条件における単語の抽出
難易度: Hard
コスト: Medium

説明可能品質予測/異常検知深層学習軽量化・量子化テキスト

Verifiable Self-Evolution for Open-Ended Dialogue Skills via Future-Feedback Prediction

この研究では、固定化された言語モデルを強化するために、自律進化する対話スキルを開発しています。このシナリオでは、ユーザーの反応がモデルの進化に影響を受けないため、対話の対称性を維持する必要があります。

用途: 对話スキル自律進化
難易度: Hard
コスト: Medium

H$^2$SD: Hybrid Hindsight Self-Distillation

Reinforcement learning with verifiable rewards (RLVR) provides reliable outcome supervision for language model

深層学習軽量化・量子化テキスト強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Constrained CTC Decoding for Efficient Diacritic Restoration

アラビア語の発音記号化は重要な問題だが、データが不足していることが難点の一つである。この問題を解決するために、ここでは「Connectionist Temporal Classification (CTC)」を使った制約

深層学習軽量化・量子化分類テキスト音声

用途: 語音記号化の制約付き復元
難易度: Hard
コスト: Low

品質予測/異常検知深層学習Attention機構検出音声

Transcription Policy as a Latent Variable: Activating Controllable Verbatim ASR with Word-Level Timing

記号化の種類 (verbatim vs. intended) は、現在の音声認識モデルの評価に影響を与えるが、このような制約はモデルのトレーニングに影響しないことが多い。しかし、ここでは、制約はモデルのトレーニングに影響

用途: 記号化の制約付き復元
難易度: Hard
コスト: High

From a Multilingual Streaming ASR Backbone to Kenyan-Language Systems: Data-Centric Adaptation of Nemotron 3.5 for Kikuyu, Dholuo, and Kalenjin

Automatic speech recognition (ASR) for African languages is constrained by orthographic inconsistency, annotat

深層学習RNN / LSTM分類生成テキスト

用途: 分類
難易度: Hard
コスト: Low

HindsightBench: A Black-Box Behavioral Audit Protocol for Parametric Hindsight in Time-Indexed LLM Decision Tasks

大規模言語モデルは、決定タスクを遂行する過程で、実行された事実を含むパラメトリックな知識を漏らす傾向にある。大規模言語モデルが実際にどのような意思決定タスクを遂行したかを検証するのは困難であるものの、これが確かに事実であ

用途: LLMによる金融意思決定タスクの検証
難易度: Hard
コスト: High

HPD-Parsing: Hierarchical Parallel Document Parsing

Efficient teamwork typically combines global coordination with parallel execution, a principle not yet fully r

深層学習軽量化・量子化生成テキストマルチモーダル

用途: 生成
難易度: Hard
コスト: High

RF-Agent: A Practical Framework for Building Language Agents for RFIC Design

Large language models (LLMs) have driven rapid progress in electronic design automation (EDA), yet their appli

用途: 生成
難易度: Hard
コスト: High

CPUで試しやすい品質予測/異常検知深層学習軽量化・量子化生成テキスト

RAGAL: A Frugal, Fully Local Retrieval-Augmented Assistant for Technical Support at a Government Agency

Public institutions hold large volumes of sensitive documents and support tickets that cannot leave the premis

用途: 生成
難易度: Hard
コスト: High

Dual Attention Residuals

Recent work extends Transformer residual pathways along two complementary axes: historical retrieval selects i

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

説明可能深層学習Transformer生成強化学習

Stale but Stable: Staleness-Adaptive Trust Regions for Stabilizing Asynchronous Reinforcement Learning

離散RLは、長所と短所を含む複雑なランク付けゴールの最適化に効果があります。しかし、その計算コストは通常高く、自動微分化などの複雑なグラadientsの計算アラウンドを必要とします。この文書では、長所と短所を含むランク付

用途: 離散RLアルゴリズムの性能アップデート
難易度: Hard
コスト: High

Rationale-Guided Knowledge Distillation for Cross-Lingual Stance Detection

Stance detection aims to identify whether a text expresses a favorable or opposing attitude toward a given tar

深層学習軽量化・量子化検出テキスト

用途: 検出
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformerテキスト

Mark, Don't Erase: Token Inoculation for Dual-Use Knowledge in LLMs

ここでは、危険な知識を持つモデルにコントロールトークンを追加し、コントロールトークンに基づいてモデルが危険な知識を操作することを目標としていました。

用途: 多用語の安全管理
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成検索テキスト

PLAID-PRF: Pseudo-Relevance Feedback with Centroid-like Tokens in PLAID

Multi-vector dense retrieval models, such as ColBERT, achieve strong retrieval effectiveness by modelling fine

用途: 生成
難易度: Hard
コスト: Low

品質予測/異常検知深層学習RNN / LSTM翻訳テキスト

LatentMT: Machine Translation with Latent Reasoning

Latent-reasoning looped language models (LoopLMs) offer a different scaling path for machine translation (MT):

用途: 翻訳
難易度: Hard
コスト: High

深層学習Transformer分類生成セグメンテーション

Pathologist Attention-Aligned Report Generation for Prostate Histopathology

The allocation of visual attention by pathologists during cancer diagnosis is a highly selective process that

用途: 分類
難易度: Hard
コスト: High

センサ/時系列深層学習軽量化・量子化検出セグメンテーションテキスト

EGRNet: A Lightweight Semantic Segmentation Network with Edge-Gated Refinement and Adversarial Sensing

As autonomous systems and smart cities continue to evolve, the demand for efficient and robust scene understan

用途: 検出
難易度: Hard
コスト: Medium

VQ-Transplant: Efficient VQ-Module Integration for Pre-trained Visual Tokenizers

Vector Quantization (VQ) underpins modern discrete visual tokenization. However, training quantization modules

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Geospatial Diffusion-based Evolution Synthesis (GeoDES) for Storm-Centered Weather Augmentation

While machine learning-based weather models hold significant promise, they struggle to predict the detailed st

深層学習軽量化・量子化生成画像動画

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformerテキスト動画マルチモーダル

BLUE: Semantics-Preserving Video Compression for Efficient Vision-Language Surveillance Analytics

Continuous surveillance video creates a growing storage, transmission, and inference burden for enterprise vid

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Detect Early, Escalate Rarely: Anytime Detection of AI-Generated Video from the Compressed Bitstream

Detectors for AI-generated video are evaluated offline. A clip is decoded to pixels and scored once, increasin

CPUで試しやすい深層学習CNN検出画像テキスト

用途: 検出
難易度: Hard
コスト: High

MI向き深層学習Transformer生成画像テキスト

Appearance Pointers -- Multimodal Region Control of Diffusion Transformers

画像生成において、材料、 객체、領域を制御することが難しい問題がある。 Diffusion Transformers はテキストと画像を組み合わせて処理できるが、どちらをどの程度影響させるか決める仕組みがなかった。その

用途: 多モーダル画像制御
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成画像

ROMS-IMLE: A Minimalist Approach to Competitive Single-Step Generative Modelling

生成モデルの構築のための新しいアプローチが提案されていました。これにより、生成モデルの構築が効率化され、強い表現力が得られるようになります。

用途: 生成モデルの構築
難易度: Hard
コスト: High

InstructMixup: Instruction-Guided Salient Patch Editing for Robust Data Augmentation

記述情報に従って画像や動画データを混ぜ合わせる「対数混合法」を拡張する方法、InstructMixupを提案する。これにより、データを拡張しながらデータの内容とラベルが維持される。

深層学習Transformer分類検出生成

用途: データ拡張のための対数混合法を拡張する
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformerセグメンテーション画像テキスト

IGGT4D: Streaming 4D Instance-Grounded Geometry Transformer

実際の空間知能では、空間に続いて流れるビデオを理解する必要がある。この問題を解決するために、4次元空間を理解することができるモデルを提案する。

用途: 空間に続いて流れるビデオを理解する
難易度: Hard
コスト: High

Stochastic Multi-Objective Kinodynamic Planning Against Adversaries

この研究では、複雑な環境に対処した後、キノ動的計画を解決します。

用途: キノ動的計画
難易度: Hard
コスト: Low

Cognitive Dual-Process Planning for Autonomous Driving with Structured Scene Knowledge and Verifiable Reasoning-Action Consistency

自動運転のための計画とは、状況理解、タイムリーな推論、行動選択というものがあるが、しかし、これらの要素を組み合わせるのは難しい。これを解決するために、シーン理解を分離することによって、計画を安全かつ有効性のあるものにする

深層学習軽量化・量子化画像テキストマルチモーダル

用途: 自動運転のための分離された計画システムを提案する
難易度: Hard
コスト: High

Bayesian Retraction Optimization for Tissue Attachment Mapping in Surgical Dissection

With growing surgeon shortages, automating surgical sub-tasks such as tissue dissection offers a promising ste

用途: 分類
難易度: Hard
コスト: Low

Design and stability analysis of an underactuated hand with passively rotating fingers

This paper presents an innovative design and stability analysis of an underactuated robotic finger with spatia

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Beyond Transformers: Linear Attention Policy for Open-Vocabulary Object Goal Navigation

オープン・バグナビゲーションには、エージェントへの部分観測が含まれます。パフォーマンスの向上のために、内部状態更新が重要です。これを実現するには、ポリシーネットワークの更新が必要です。最近のアプローチでは、トランスフォー

深層学習Transformerテキスト3D

用途: オープン・バグナビゲーション問題を解決する
難易度: Hard
コスト: High

Confidence-Gated Vision-Only Heading Alignment for UAV-UGV Cooperative Systems

UA と UGV の協調システムは、共通の姿勢情報を共有し、適切な行動を行って安全かつ効率的な協調を可能にします。

用途: UA と UGV の協調システム
難易度: Hard
コスト: Medium

表形式向き深層学習軽量化・量子化テキスト3D強化学習

Intelligent Multi-UAV Navigation in ITNTNs: A Hierarchical LLM Approach

The deployment of high-speed Uncrewed Aerial Vehicles (UAVs) in 3D aerial highways necessitates robust coordin

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化テキスト動画

ABot-World-0: Infinite Interactive World Rollout on a Single Desktop GPU

We present ABot-World-0, an action-conditioned video world model for real-time, long-horizon closed-loop inter

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

Generative World Renderer at the Speed of Play

Generative world renderer AlayaRenderer receives structured world states exported from physics engines and syn

用途: 生成
難易度: Easy
コスト: Medium

説明可能深層学習Transformer生成画像テキスト

Text Template Tokens Are Implicit Semantic Registers in Diffusion Transformers

Text-to-image diffusion transformers (DiTs) jointly process text and image tokens, yet their internal computat

用途: 生成
難易度: Easy
コスト: High

品質予測/異常検知深層学習Transformer生成画像テキスト

Mage-Flow: An Efficient Native-Resolution Foundation Model for Image Generation and Editing

Large-scale visual generators are increasingly capable but costly to train, fine-tune, and deploy. We introduc

用途: 生成
難易度: Easy
コスト: High

ISO: An RLVR-Native Optimization Stack

Reinforcement learning with verifiable rewards (RLVR) is rapidly advancing the reasoning capabilities of langu

深層学習正規化・最適化手法テキスト強化学習

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

Where Should Optimizer State Live? Tiered State Allocation for Memory-Efficient Mixture-of-Experts Training

Optimizer state is the largest single line item in the memory budget of mixture-of-experts (MoE) training: on

深層学習正規化・最適化手法テキスト

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

githubGitHubあり2026-07-21

AI-For-Beginners — 12 Weeks, 24 Lessons, AI for All!

AIの入門コースを提供する。

用途: AI入門
難易度: Easy
コスト: Medium

githubGitHubあり2026-07-21

awesome-datascience — :memo: An awesome Data Science repository to learn and apply for real world problems.

データサイエンスの学習には役立つリポジトリ。実世界の問題に応じた学習が可能。

深層学習画像

用途: データサイエンス学習
難易度: Easy
コスト: Medium

githubGitHubあり2026-07-21

ml-engineering — Machine Learning Engineering Open Book

Machine Learning Engineeringは、機械学習の開発と運用をサポートするためのリソースを提供する。

用途: 機械学習エンジニアリングリソース
難易度: Easy
コスト: Medium

PAC--Bayes Bounds on Quotient Parameter Spaces: Geometry-induced Implicit-Bias Priors

Overparameterized models often have continuous parameter symmetries, so different parameters define the same p

深層学習正規化・最適化手法回帰

用途: 回帰
難易度: Hard
コスト: High

Vector Search As Nearest Neighbor Matching: RAG-based Policy Learning in Causal Inference

因果推論を用いた政策学習を提案し、政策選択を行う際に最も近い類似の証拠によって行動の有効性を評価することを目指している。

用途: 因果推論の政策学習
難易度: Hard
コスト: Low

COVAriance-Induced Fairness Gap Penalty for Subgroup-Fair Clustering

Fair clustering aims to make cluster assignments independent of sensitive attributes, but this goal becomes ch

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

表形式向き深層学習Transformerテキスト表形式3D

Topological Signatures of Context-Level Reliability in TabPFN

多元表格予測モデルTabPFNは、条件設定されたサポートセットと、入力クエリーでタスク特指訓練を行うことなく、推論を行います。実行時間における内部挙動を理解する際に、 zig-zag永続ホモロジーを使用することで、Tab

用途: 予測と協調
難易度: Hard
コスト: High

An Adjoint-Sensitivity Framework for Lost-in-the-Middle Phenomena in Causal Residual Transformers

We develop an adjoint-sensitivity framework for positional influence in causal residual Transformers and separ

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Equality, Equity, and Causality in Fairness Research: A Commentary on Cheng (2026)

This is an invited commentary on the Psychometrika focus article "Fairness Issues and Evaluation in Psychometr

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

An efficient adaptive dimension selection algorithm for multidimensional probit graded response models

Multidimensional graded response models (MGRMs) are widely used for analyzing ordinal questionnaire data in ps

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Program Synthesis for Simulation-Based Inference: Joint Model Selection and Parameter Estimation

Neural simulation-based inference enables parameter estimation for complex models, but typically requires the

用途: 生成
難易度: Hard
コスト: High

Conditioned Direct Feedback Alignment via Activity and Error Geometry

Direct feedback alignment (DFA) trains hidden layers with fixed random projections of the output error, avoidi

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Scalable and Efficient Joint Spiking Embedding Predictive Architecture for Large-Scale Dynamic Graphs

Dynamic graph learning aims to capture evolving structural and semantic patterns in real-world systems, such a

深層学習軽量化・量子化分類検出生成

用途: 分類
難易度: Hard
コスト: High

arxivGitHubあり2026-07-20

For What Reason? Interpreting Models' Encoding of Causation and Antithesis

Discourse relations provide document structure, critical to language understanding and enabling language model

用途: 技術検証・論文読解補助
難易度: Easy
コスト: Medium

Reasoning Fine-Tuning Induces Persistent Latent Policy States

Reasoning-specialized language models show large performance gains over base models, yet the internal changes

深層学習軽量化・量子化埋め込みテキスト

用途: 埋め込み
難易度: Hard
コスト: Low

CANDOR: Chance-Calibrated Discordance in Frozen Foundation Encoders

Frozen encoders are chosen by how well a lightweight head reads a finding from their features, not whether the

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Convolution for Large Language Models

Large language models (LLMs) largely rely on Transformers, where self-attention provides global token interact

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer分類テキスト

SWE-Pruner Pro: The Coder LLM Already Knows What to Prune

Pruning long context for coding agents has been a vital technology for efficient context management. While exi

用途: 分類
難易度: Hard
コスト: High

説明可能品質予測/異常検知深層学習軽量化・量子化テキスト

PPL-Factory: Task-Aware and Budget-Aware Data Selection from Language Modeling to Reasoning

訓練データの品質を高めるために、付与ラベルに基づいてデータを選択する方法を提案し、訓練データから選択されないデータは排除することを目指している。

用途: 付与ラベルによるデータ選択
難易度: Hard
コスト: High

Operational Hallucination and Safety Drift in AI Agents

Large language models (LLMs) serving as planners in tool-using autonomous agents introduce dynamic reliability

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

MI向き品質予測/異常検知深層学習軽量化・量子化生成テキスト3D

Do Language Models Dream of Binding Molecules? Benchmarking LLMs under Spatial Constraints

Structure-based drug design (SBDD) leverages the 3D structure of protein targets, often complemented by other

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化テキスト強化学習

LLM-as-a-Coach: Experiential Learning for Non-Verifiable Tasks

この研究では、ルビック評価を含む非確認タスクの最適化を目的とします。従来のRLには、モデル評価の情報が使われるだけですが、モデル自身は反省や自己改善はすることがありません。ここでは、LJMをコーチとみなして、モデルが反省

用途: ルビック評価を含む非確認タスクの最適化
難易度: Hard
コスト: High

FinSAgent: Corpus-Aligned Multi-Agent RAG Framework for Evidence-Grounded SEC Filing Question Answering

金融質問回答を実行するには、長い標準化されて高度に冗長な説明書に分散する証券取引委員会（SEC）の証拠を取得する必要がある。既存の取得を拡張するおよび多要素システムの多くの選択肢は、モデルの先行事項と目的のファイルリング

深層学習軽量化・量子化生成QAテキスト

用途: 金融質問回答問題を解決
難易度: Hard
コスト: Low

VDAR-Router: Adaptive LLMs Routing via Verbalized Query Difficulty Analysis Retrieval

大きな言語モデルは実用システムで増えているため、費用対効果のあるモデルを選択することが重要になる。モデルを割り当てるためにLKM路線が提案された。しかし、既存の路線方法は入力問に基づいてモデルを選択し、モデルに適合しない

用途: model routingの問題を解決
難易度: Hard
コスト: High

Bridging the Sim-to-Real Gap under Real-Time Constraints in Autonomous Racing

Autonomous racing exposes the sim-to-real gap under extreme operating conditions characterized by high speed,

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

センサ/時系列深層学習Transformer分類異常検知画像

Recti-Q: Feature-Space Rectification for Out-of-Distribution-Robust Quantized Perception in Edge Robotics

エッジロボチクスでの画像認識精度を安定させ、その安定性を確保するために、量化後のパフォーマンスを向上させ、分散型データ量化を実現し、分布シフトの影響を緩和する、新しい機械学習アプローチを提案します。

用途: エッジロボチクスでの画像認識の安定性
難易度: Hard
コスト: High

DASH Robot: Minimalistic Design and Optimal Aerial-Terrestrial Locomotion via Contact-Implicit Control

We present a novel and minimalistic design of an aerial-terrestrial robot DASH: Ducted Aerial Spring Hopper. T

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

深層学習Transformer埋め込み画像テキスト

Patch Policy: Efficient Embodied Control via Dense Visual Representations

ロボット制御を効率化するために、パッチを用いた政策学習を提案し、密集された視覺表現を用いて実装することを目的としている。

用途: リソース制限のあるロボットの制御
難易度: Hard
コスト: High

センサ/時系列深層学習軽量化・量子化画像テキストマルチモーダル

FM-VLA: Force-based Memory for Vision-Language-Action Models in Contact-Rich Manipulation

existing VLA modelの制約を解決するためのforce-based memory method、FM-VLAを提案する。

用途: manipulateする物体の状態を解決する
難易度: Hard
コスト: High

Towards Torque-Driven Reinforcement Learning for Quadruped Locomotion

Reinforcement learning (RL) for legged robots is advancing locomotion, demonstrating its ability to adapt to n

センサ/時系列深層学習軽量化・量子化強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Isaac Sim-to-Real: Reinforcement Learning based Locomotion for Quadrupeds

existing locomotion methodの制約を解決するためのreinforcement learning based loco-manipulation method、Isaac Sim-to-Realを提

用途: ロボットの自律歩行を解決する
難易度: Hard
コスト: High

Value-Aware Prediction for Robust Multi-Agent Coordination Under Communication Loss

Robust multi-agent coordination relies heavily on inter-agent communication, which is frequently disrupted by

深層学習正規化・最適化手法テキスト強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Task-Space Constrained Stochastic Trajectory Optimization for Time-Optimal Forestry Crane Motion Planning

自律運航のクレーンは、木材の移動を安全かつ効率的に行う必要があります。このため、木材の運搬におけるコスト削減と安全性の確保のために、クレーンの運動計画を最適化します。この研究では、VP-STO(Via-Point-bas

用途: 伐木のコマーシャル用クレーンでの操作
難易度: Hard
コスト: Low

深層学習Transformer分類検出セグメンテーション

Seg2Grasp: A Robust Modular Suction Grasping in Bin Picking

採掘ロボットの性能向上を目指したSeg2Graspを構築し、セグメンテーション、グレイシング、クラスフィルタリングの3つのモジュールで構成されます。セグメンテーションモジュールではTransformerを利用したオブジェ

用途: 採掘ロボットがオブジェクトを取り上げる能力の向上
難易度: Hard
コスト: Low

Predictive Training with Latent Imagination for Visual Quadruped Navigation

四足ロボットのナビゲーションのための予測的推論方法が提案されます。ロボットは、現在の観察と短期的な記憶によってアクションを選択しますが、障害物の発展を予測することができないため、このアプローチには課題があります。この課題

用途: ロボットのナビゲーション
難易度: Hard
コスト: High

GeoWorldAD: Geometry World Action Model for Autonomous Driving

Autonomous driving requires both safe and efficient planning decisions in dynamic 3D environments. Although re

深層学習Transformer画像動画3D

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Test-Time Scaling for World Action Models via Zero-Shot Geometric Evaluation

Test-time scaling improves foundation-model inference by spending additional computation, but robot control re

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Breaking Network Densification Limits with Distributed Cooperative Massive Access (DCMA)

In this work, we investigate the performance of the distributed cooperative massive access (DCMA) framework in

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Nonexistence of Simultaneously EF1 and Pareto Optimal Allocations for Submodular Valuations

The existence of allocations of indivisible goods that are simultaneously fair (envy-free up to one item (EF1)

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

AlayaWorld: Interactive Long-Horizon World Modeling -- Full Technical Report

Unlike conventional video game development, which relies on labor-intensive pipelines for asset production, an

用途: 生成
難易度: Easy
コスト: High

ConsiSpace: Learning Geometric Consistency Matters for Video Spatial Reasoning

Video spatial reasoning is essential for navigation-oriented perception and long-video question answering, whe

深層学習軽量化・量子化QAテキスト動画

用途: QA
難易度: Easy
コスト: High

HOMIE: Human-object Centric Video Personalization via Multimodal Intelligent Enchancement

Human-object centric video personalization (HOCVP) is a core task within subject-driven video generation. Howe

用途: 生成
難易度: Easy
コスト: High

FlashRT: Agent Harness for Guiding Agents to Deploy Real-Time Multimodal Applications

Real-time multimodal applications, including voice agents and interactive video generation, compose heterogene

深層学習軽量化・量子化生成テキスト音声

用途: 生成
難易度: Easy
コスト: High

Self-State Attacks on Self-Hosted AI Agents: How Far Can OS Defenses Go?

Self-hosted AI agents read and write their own memory and configuration files to function. An agent may get co

用途: 検出
難易度: Easy
コスト: Medium

ReViV: Reconstructing the Viewer and the View in 4D from Monocular Egocentric Video

Egocentric devices, such as wearable front-facing cameras, provide a unique perspective for capturing the cont

深層学習Transformer生成動画3D

用途: 生成
難易度: Easy
コスト: High

githubGitHubあり2026-07-20

pytorch-lightning — Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

AIモデルを高速にトレーニングするためのライブラリ。1台から10000台のGPUで利用可能。

用途: AIモデルトレーニング
難易度: Easy
コスト: High

githubGitHubあり2026-07-20

pruna — Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

デベロッパー向けのモデロプティミゼーションフレームワークです。モデルの高速化と効率化を実現することができます。

深層学習Transformer分類音声

用途: モデロプティミゼーション
難易度: Easy
コスト: Low

Kernelized Linear Attention: Breaking the Capacity Wall with Symmetric Cones

Linear attention promises constant-time recurrent inference but degrades sharply on associative recall. We for

深層学習RNN / LSTM異常検知

用途: 異常検知
難易度: Hard
コスト: High

Efficient Sequential Evaluation of Large Language Models

We study the problem of sequentially evaluating a new large language model (LLM) on a fixed question set using

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Kernel Regression with Tensor Trains and Hadamard Overparameterization

Kernel regression with tensor trains and Hadamard overparameterization (KReTTaH) is introduced as a training-d

説明可能深層学習Transformer回帰

用途: 回帰
難易度: Hard
コスト: High

The Resolution of Causal Heterogeneity

Causal subgroup analyses often report a small number of groups summarizing treatment effect heterogeneity, as

MI向き深層学習軽量化・量子化

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

表形式向き深層学習Transformer表形式強化学習

Non-Asymptotic Best Policy Identification Guarantees in Online Reinforcement Learning

In this work we study the Best Policy Identification (BPI) problem in online, tabular Reinforcement Learning.

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Expressivity of Shallow Neural Networks Over Finite Fields

We study the expressivity of shallow polynomial neural networks (PNNs) with monomial activation functions over

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Rethinking the Suitability of Reinforcement Learning Algorithms Under Practical Transfer Constraints

Transfer-oriented reinforcement learning requires evaluating algorithms along dimensions that go beyond standa

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

センサ/時系列深層学習RNN / LSTM画像3D

DROID-ANCHOR: Odometry-Anchored Recurrent Metric Depth Estimation

Precise metric depth estimation is fundamental for autonomous robot navigation, yet monocular systems inherent

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Articulated Humanoid Head for a Robot Receptionist Capable of Natural Human Interaction

Humanoid robots have become increasingly popular in applications such as social interaction, education, and se

深層学習軽量化・量子化分類テキスト

用途: 分類
難易度: Hard
コスト: Low

huggingfaceHugging Faceあり2026-07-19

HarmoHOI: Harmonizing Appearance and 3D Motion for Multi-view Hand-Object Interaction Synthesis

Hand-Object Interaction (HOI) synthesis is a cornerstone for animation production and embodied AI. Despite the

品質予測/異常検知深層学習Transformer生成画像動画

用途: 生成
難易度: Easy
コスト: High

huggingfaceGitHubありHugging Faceあり2026-07-19

Distilled Reinforcement Learning for LLM Post-training

Large language model (LLM) post-training is essential for improving reasoning, adaptation, and alignment. Exis

説明可能品質予測/異常検知深層学習軽量化・量子化テキスト強化学習

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

huggingfaceHugging Faceあり2026-07-19

The Geometry of Semantic Space: A Continuous Geometric Framework for the Transformer Architecture

We present a continuous geometric framework that models the discrete algebraic operations of the Transformer a

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

Deep Adaptive Bayesian Screening

We introduce Deep Adaptive Bayesian Screening (DABS), a method for performing adaptive factorial screening in

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Dropout and Random Gradient Masking Are Asymptotically Equivalent in Large ResNets

Dropout and Random Gradient Masking (RaM) are two training techniques used to improve performance in deep lear

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

The Value of Depth in Message Passing on Sparse Graphs: A Kesten-Stigum Dichotomy

How deep does a graph neural network need to be on a sparse graph? We study its purest statistical form: node

深層学習グラフニューラルネット分類テキスト

用途: 分類
難易度: Hard
コスト: Low

表形式向き深層学習正規化・最適化手法テキスト表形式

Backpropagation-Free Trunk Training via the Split Forward Gradients

Backpropagation makes training deep networks memory intensive because it must store intermediate activations.

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Decision Variable Analysis-Guided Differentiated Fuzzy Search for Large-Scale Multi-Objective Optimization

Large-scale multi-objective optimization problems (LSMOPs) are challenging due to their high-dimensional decis

条件最適化深層学習Transformer生成

用途: 生成
難易度: Hard
コスト: Medium

説明可能深層学習軽量化・量子化生成テキストマルチモーダル

G2-Nav: Grounded and Guarded Vision-Language Costmaps for Robot Social Navigation

Social navigation requires the robot to reason and respond in complex real-world environments. While recent wo

用途: 生成
難易度: Hard
コスト: High

PREFAIL: Identifying Precursors to Failures in Robotic Lift-and-Place Tasks to Improve Task Execution Performance

Non-prehensile manipulation enables flexible material handling with part carriers, but friction-based support

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

A BIM-enabled, Agent-based Discrete-event Simulation Platform for Robotic Studies: A Method based on Graph Theory

Indoor robots are increasingly employed for facility management tasks such as cleaning and inspection. These a

品質予測/異常検知深層学習軽量化・量子化検出

用途: 検出
難易度: Hard
コスト: Medium

Approximate Relative Entropy Constraints for Nonlinear Covariance Steering Under Distribution Ambiguity

Covariance steering provides an efficient framework for designing linear stochastic feedback policies, but its

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

AI-Augmented Model Predictive Control for Safe and Adaptive Rendezvous and Proximity Operations

Autonomous rendezvous and proximity operations (RPO) in adversarial orbital environments require guidance arch

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

SAGE: A Socially-Aware Generative Engine for Heterogeneous Multi-Agent Navigation

Safe and socially compliant navigation in open human-robot environments requires robots to reason about hetero

用途: 生成
難易度: Hard
コスト: High

huggingfaceGitHubありHugging Faceあり2026-07-18

Dataset Distillation by Influence Matching

We revisit dataset distillation from an outcome-centric perspective. Rather than aligning process surrogates (

深層学習軽量化・量子化分類画像テキスト

用途: 分類
難易度: Easy
コスト: High

huggingfaceHugging Faceあり2026-07-18

Group Entropy-Controlled Policy Optimization

Entropy control has become an effective tool in reinforcement learning (RL) of large language models (LLMs), h

深層学習軽量化・量子化生成テキスト強化学習

用途: 生成
難易度: Easy
コスト: High

Scaling Limits of Constant-Stepsize SGD at Flat Minima

For stochastic gradient descent (SGD) with a constant stepsize $α$, the invariant law of the iterates, centere

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

An Efficient Likelihood Ratio Test for Online Changepoint Detection in the Presence of Autocorrelation

オフラインのデータ流れの中で、時間序列のデータに基づいてデータ変化を検知することができる方法が必要。この問題を解決するために、オンラインのデータ流れで検知した変更点をオフラインのデータ流れに適用する方法を提案。

用途: オフラインの変更点検知
難易度: Hard
コスト: Low

Deep and Probabilistic Models for Gene Regulatory Network Inference

グループデータを分析する方法を提案する。この方法では、個人の属性を考慮することで集団の特性をより正確に予測することができると主張する。

用途: グループデータを分析
難易度: Hard
コスト: High

Which Hyperparameters Matter? A Game-Theoretic Framework for Interpretable Hyperparameter Sensitivity Analysis

この研究では、ゲーム理論的なフレームワークを使用して、ハイパーパラメータと目的関数のインタラクションを解析します。このフレームワークは、Shapley Efforts を使用してグローバル感度分析を行い、パレート前列を使

用途: ハイパーパラメータの影響分析
難易度: Hard
コスト: Medium

Aggregation of Statistical Evidence under Exchangeability

この研究では、集団データの統合に取り組み、変換に対して一貫性のある集団データ統合フレームワークを提案します。このフレームワークは、統合されたデータの精度とパワーを高め、変換の影響を検証または制御することができます。

用途: 集団データの統合
難易度: Hard
コスト: Medium

品質予測/異常検知深層学習Transformerテキスト

Retraining Seeks Stable Signals

Predictive models deployed at scale influence future data, a phenomenon called performativity. And there is al

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Constrained Hebbian Learning Supports Efficient Representational Allocation under Structural Constraints

脳のニューロン同士のつながりを分析する方法を提案する。この方法では、神経伝達の構造を考慮しながら、ニューロン間のつながりを分析できる。

深層学習Transformer分類画像音声

用途: 神経伝達の分析
難易度: Hard
コスト: Low

Certifiable Safe Model-Based Reinforcement Learning with Control-Affine Dynamics Approximation

Safe model-based reinforcement learning (RL) often bridges control-theoretic analysis and RL for robots to saf

深層学習軽量化・量子化生成3D強化学習

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知深層学習RNN / LSTMテキスト音声

Back to the museum: Investigation of the acceptance of Android Andrea with and without emotion simulation in a museum

For a second time, the android robot Andrea was set up at a public museum in Germany for six consecutive days

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

PIXIE: A Zero-Shot texture-invariant 6D pose estimation framework for unseen objects with assembly defects

PIXIEフレームワークは、6次元オブジェクト位置推定を実現し、ロボットハンドの制御と物体の操作を実現します。

深層学習Transformer画像テキスト3D

用途: オブジェクトの6次元位置推定
難易度: Hard
コスト: High

深層学習Transformerセグメンテーション動画3D

arxivGitHubあり2026-07-17

DPNeXt: A Lightweight Multi-Scale Feature Fusion Framework for Efficient ViT-Based Multi-Task Dense Prediction

多タスク学習はロボティクスの視覚理解系で、セマンティックセグメンテーションと深度推定の統合をサポートします。視覚基底モデル(VFM)は強力な特徴エンコーダとして広く採用されていますが、既存のデコード戦略は重要なボトルネ

用途: ロボティクスの多タスク学習による3D空間理解
難易度: Hard
コスト: High

センサ/時系列深層学習Transformerセグメンテーション画像マルチモーダル

PRISM: Multimodal Terrain Mapping for Rover Navigation in Unstructured Environments

Robotic navigation in unstructured environments requires robust situational awareness to safely traverse hazar

用途: セグメンテーション
難易度: Hard
コスト: High

品質予測/異常検知深層学習Transformer生成動画

FVAttn: Adaptive Sparse Attention with Runtime Load Balancing for Video Generation

Video Diffusion Transformers process long spatio-temporal sequences, making self-attention the main bottleneck

用途: 生成
難易度: Easy
コスト: High

CPUで試しやすい深層学習軽量化・量子化マルチモーダル強化学習

JoyNexus: Service-Oriented Multi-Tenant Post-Training for VLA Models

The post-training of Vision-Language-Action (VLA) models is essential due to the diversity of simulators, robo

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

Loop the Loopies!

We present Loopie, the most powerful looped Transformer to date. The Loopie series consists of two Mixture-of-

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

RecGPT-V3 Technical Report

Large language models (LLMs) are transforming recommender systems from matching co-occurrence patterns in hist

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

Recursive Harness Self-Improvement

Under model--harness co-evolution, harnesses are not merely inference-time scaffolds but data-generating compo

品質予測/異常検知深層学習軽量化・量子化テキスト

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

When Does Muon Help Agentic Reinforcement Learning?

Muon is competitive with AdamW in large-scale pre-training, but its value for reinforcement-learning (RL) post

深層学習正規化・最適化手法強化学習

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

DSWorld: A Data Science World Model for Efficient Autonomous Agents

Despite strong capabilities in data understanding and decision-making, autonomous data science agents still he

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

Diffusion models recover accurate mixture weights despite score function insensitivity

スコアベース生成モデルにおけるモード分解能の向上を目的とした研究で、モード分解能がスコア関数に依存しておらず、生成サンプルから混合重みを推測できることを明らかにした。

深層学習Transformer生成マルチモーダル

用途: スコアベース生成モデルにおけるモード分解能の向上
難易度: Hard
コスト: High

Prediction-Only Distillation in Linear and Logistic Regression

distillationにおける予測のみを扱う学習アプローチを提案し、それをテストした。

深層学習軽量化・量子化分類回帰異常検知

用途: distillationにおける予測のみ
難易度: Hard
コスト: High

Optimal Self-Distillation for Rectified Flow via Linear Probing

Modern generative models are increasingly trained using model-generated signals, creating both opportunities f

深層学習軽量化・量子化生成画像

用途: モデル改善
難易度: Hard
コスト: Medium

GAttNHP: Group Attention Neural Hawkes Process for Extrapolation Reasoning in Temporal Knowledge Graphs

この研究では、時系列データの予測を目的に、新しいフレームワークを提案しました。このフレームワークは、時系列データの予測を容易にするために、グループの注意とニューハウクスプロセスを組み込んでいます。

深層学習Transformer回帰予測テキスト

用途: 時系列データの予測
難易度: Hard
コスト: Low

What's in a Smoothness Constant? Tighter Rates for Local SGD with Bounded Second-order Heterogeneity

この研究では、分散最適化アルゴリズムの効率化を目的に、新しい評価方法を提案しました。この方法は、局所のSGDの効率を分析し、実際のデータヘテロgeneityを考慮することで、分散最適化の効率を向上させました。

用途: 分散最適化の効率化
難易度: Hard
コスト: Low

Sharp Stability Threshold and Certification for Designing Stable Residual Architectures

弾性的深層ネットワークの安定性に関連する問題を解決するための新しい原理が提案されました。この原理は、入力量のエクスポネンシャルに基づく安定性しきい値が得られます。この安定しきい値は、各残差ブロックの速度場の入力量のエクス

MI向き深層学習Transformer

用途: 安定性問題の解決
難易度: Hard
コスト: High

Adaptive Runge-Kutta Step Control Buys Training Loss, Not Generalization: An Honest Compute-Matched Study of RK-Adam Optimizers

この研究では、ルンゲクッタ分離を使用したオプティマイザのパフォーマンスを比較検討しました。結果により、通常の Adam と比較すると、ルンゲクッタ分離を使用したオプティマイザはトレーニングロスでは劣ったことがわかりました

用途: オプティマイザのパフォーマンス比較
難易度: Hard
コスト: High

Fast and Scalable Caputo Fractional Gradient Descent via Perturbation-Preserving Memory Compression

Fractional gradient descent (FGD) incorporates long-range memory through Caputo-type operators and has been sh

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

NeuronSoup: Evolving Asynchronous, Shared-Neuron Temporal Graphs without Backpropagation

この研究では、共有ニューロンを使用して時系列グラフを学習する方法、NeuronSoup を開発しました。NeuronSoup では、各パスの信号は、変数数の間のニューロンを通過する途中で、共有ニューロンを使用して伝票され

深層学習Transformer分類生成

用途: 神経ネットワークの共有ニューロンによる時系列グラフの学習
難易度: Hard
コスト: Low

Confidence-based Ranking with Adaptive Sampling for Noisy Black-Box Optimisation

Real-world optimization problems often involve black-box functions and uncertainties in their evaluation, wide

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

MI向き品質予測/異常検知深層学習Transformer分類検出

Toward Energy-Efficient and Low-Power Arrhythmia Detection for Wearable Devices

この研究では、ウェアラブルデバイスで電気生理学記録（ECG）を分析するために使用される深層学習アルゴリズムを開発することを目的としています。このアルゴリズムは、エネルギー効率が高く、小型化が可能であるため、心臓の病気の検

用途: 心臓の病気の検出
難易度: Hard
コスト: Low

Cross-Layer Error Compensation and Finite-Sample Feature-Statistics Matching for Extreme Low-Bit Quantization of Large Language Models

Layer-wise post-training quantization of large language models minimizes each layer's reconstruction error in

深層学習CNNテキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

All Games Have Equilibria

Research on Nash equilibrium existence for infinite games has grown into a patchwork of technical precondition

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Compensation Design

コンピューティングリソースの確実な割り当てを実現するため、compensation designという新しい分野を提案します。

用途: コンピューティングリソースの確実な割り当て
難易度: Hard
コスト: Medium

Multi-Turn On-Policy Distillation with Prefix Replay

We study on-policy distillation (OPD) for agentic tasks, where an LLM agent interacts with an environment over

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

品質予測/異常検知深層学習軽量化・量子化検出画像テキスト

Trajectory-aware Cross-view Geo-localization with Sequential Observations

Cross-view geo-localization matches ground-level observations against geo-tagged satellite imagery. Recent met

用途: 検出
難易度: Easy
コスト: High

Xiaomi-Robotics-1: Scaling Vision-Language-Action Models with over 100K Hours of Real-World Trajectories

We present Xiaomi-Robotics-1, a foundational vision-language-action (VLA) model capable of (1) following diver

深層学習軽量化・量子化生成テキストマルチモーダル

用途: 生成
難易度: Easy
コスト: High

xHC: Expanded Hyper-Connections

Hyper-Connections (HC) expand the residual stream of Transformers into N parallel streams, providing a form of

用途: 生成
難易度: Easy
コスト: High

huggingfaceGitHubありHugging Faceあり2026-07-16

On-Policy Delta Distillation

On-policy distillation is an alternative post-training method in reinforcement learning that alleviates the co

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

Beyond Entropy: Correctness-Aware Advantage Shaping via Contrastive Policy Optimization

Reinforcement learning with verifiable rewards (RLVR) commonly uses entropy for advantage shaping. However, en

深層学習軽量化・量子化生成強化学習

用途: 生成
難易度: Easy
コスト: Medium

深層学習Transformerマルチモーダル自己教師

githubGitHubあり2026-07-16

stable-pretraining — Reliable, minimal and scalable library for pretraining foundation and world models

基礎モデルの前処理を行うためのライブラリ。最小限でシームレスにスケールできる。

用途: 基礎モデルの前処理
難易度: Easy
コスト: High

githubGitHubあり2026-07-16

TurboDiffusion — TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

画像認証システムにおける悪用された画像からの画像の認証方法を提示しました。

深層学習軽量化・量子化生成動画

用途: 画像認証システムの改良
難易度: Easy
コスト: High

githubGitHubあり2026-07-16

pytorch-image-models — The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

PyTorchで使用できる画像エンコーダとバックボーンの最大のコレクションです。トレーニング、評価、推論など様々なスクリプトや事前の重み付きデータが含まれます。

深層学習Transformer分類画像

用途: PyTorchで使用できる画像エンコーダとバックボーン
難易度: Easy
コスト: High

Supervised Fine-Tuning vs. In-Context Learning: An Equilibrium Analysis of LLM Personalization under Congestion

Large Language Models（LLM）の個別化はモデルを適応させることができるが、計算リソースが限られている状況では、コストがかかるSupervised Fine-Tuning法か、軽量なIn-Contex

深層学習軽量化・量子化回帰テキスト

用途: LLMの個別化の戦略
難易度: Hard
コスト: High

NeuralChaos: Optimal Adapted Approximation of Square Integrable Predictable Processes

可予性プロセスの近似は、数学的ファイナンス、機械学習、制御理論、物理学などの分野で重要な問題である。このため、NeuralChaosを用いて、可予性プロセスの近似を解くための新しい方法を提案した。

用途: 可予性プロセスの近似
難易度: Hard
コスト: Medium

Spectral Concentration and Recovery in Sparse High-Dimensional Random Geometric Graphs

We study sparse threshold random geometric graphs generated by high-dimensional spherical or Gaussian latent v

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Multimodal Empirical Bayes Variational Autoencoders for Joint Longitudinal and Time-to-Event Modeling

Longitudinal tumor measurements, dropout information, and genetic covariates provide complementary information

深層学習正規化・最適化手法マルチモーダル

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

説明可能品質予測/異常検知深層学習Transformer生成

A Temporal Machine Learning-Based Time-to-Event Model for Predicting ALS Progression and Healthcare Utilization

Amyotrophic lateral sclerosis (ALS) is a progressive and heterogeneous neurodegenerative disease in which pred

用途: 生成
難易度: Hard
コスト: Medium

Lipschitz Continuity in Deep Learning: A Systematic Review of Theoretical Foundations, Estimation Methods, Regularization Approaches, and Certifiable Robustness

Lipschitz continuity is a fundamental property of neural networks that characterizes their sensitivity to inpu

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

PiVoT: A Variational Solution for Real-time Large-scale Multi-object Detection and Tracking under Heavy Clutter

難しい環境でマルチオブジェクトの検知と追跡が可能なPiVoTを開発、実用的なソリューションを提案した。

深層学習軽量化・量子化検出画像3D

用途: マルチオブジェクトの検知と追跡
難易度: Hard
コスト: High

DAGR: State-Conditioned Goal Representations via Difference-Aware Goal Cross-Attention

この研究では、目標が現在の状況に依存するゴール表現を確立します。研究者は、目標の静的表現をステート条件表現に更新することで、現在の状況に応じて目標を修正します。

深層学習Attention機構強化学習

用途: ステートコンディショナルゴール表現
難易度: Hard
コスト: Low

Gauge-Invariant, Parameter-Insensitive Regularization for Potential Recovery from Flow on Directed Graphs

流れの観測値から潜在値を推測することを目指し、流れの観測値からの潜在値推測を可能にするための新しい正則化手法を提案し、正則化手法が流れの観測値からの潜在値推測を効果的に推測することを確認している。

用途: 流れの観測値から潜在値を推測する
難易度: Hard
コスト: Medium

表形式向きCPUで試しやすい品質予測/異常検知深層学習軽量化・量子化分類回帰

Parallel gradient boosting for flexible estimation of conditional distributions

Boosting is one of the most successful learning techniques for standard classification and regression tasks. I

用途: 分類
難易度: Hard
コスト: High

Visual Place Recognition Using Rate-Encoded Spiking Neural Networks with Discrete STDP Learning

Spiking Neural Networks (SNNs) trained through unsupervised Spike-Timing-Dependent Plasticity (STDP) have been

深層学習軽量化・量子化分類画像教師なし

用途: 分類
難易度: Hard
コスト: Low

Generalised Reachability Games

We study two-player zero-sum turn-based games played on graphs with multiple reachability objectives called ge

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Tighter Bounds for the Random-Offerer Mechanism in Bilateral Trade

二者間取引のランダムオフィシャーの機構を研究し、選択された取引者が取引を実行するための最適な価格を決定することを目的とした研究。

用途: 二者間取引におけるランダムオフィシャーの機構
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-07-15

DiffGI: Differentiable Geometry Images for High-Fidelity Thin-Shell 3D Generation

Existing 3D generative models predominantly rely on implicit volumetric representations, which enforce waterti

深層学習Transformer生成画像3D

用途: 生成
難易度: Easy
コスト: High

huggingfaceHugging Faceあり2026-07-15

Diagnosing and Calibrating Tool-Call Boundary Drift in Multi-Teacher On-Policy Distillation

Agentic language models must learn when to call tools, when to consume tool responses, and when to answer dire

用途: 生成
難易度: Easy
コスト: High

huggingfaceHugging Faceあり2026-07-15

VideoRAE: Taming Video Foundation Models for Generative Modeling via Representation Autoencoders

Video generative models commonly rely on latent spaces learned by 3D Variational Autoencoders (3D-VAEs). Howev

用途: 生成
難易度: Easy
コスト: High

Sharp Optimal Algorithm for Derivative-Free Stochastic Convex Optimization in One Dimension

Stochastic convex optimization is a classical problem with well-understood guarantees under first-order feedba

用途: 1次元の非連続的なconvex関数の最適化を目的とする。
難易度: Hard
コスト: Medium

ANGLE: Angular Neural Generative Learning via Engression

Circular data, representing angles or directions, are frequently encountered in computer vision, biology, geol

深層学習軽量化・量子化生成回帰画像

用途: 生成
難易度: Hard
コスト: High

Contrast-Free ICA and Causal Inference via Wasserstein Distances to the Gaussian

平方2-Wasserstein距離を基準に、無相関の因子分析と帰納的因果推論を実現するものです。この距離は、独立な標準化されたソースと、その線形結合のWasserstein非ガウス性の間の厳密不等式を利用しています。

用途: 無相関の因子分析と帰納的因果推論
難易度: Hard
コスト: Medium

MixCIT: A Kernel Based Local-Polynomial Debiased Test for Conditional Independence on Mixed-Type Data

多種多様なデータに対して条件的独立性の検定を行う方法です。混合タイプデータに対して統一的な、効率的な、あるいは統計的に有効な解決策は存在しませんでしたが、グラフ上のノード間の距離を比较する方法を提案しています。

用途: 複合データの条件的独立性検定
難易度: Hard
コスト: Medium

Wasserstein gradient flows for Coulomb discrepancies

We study the long-time behavior of the Wasserstein gradient flow of the squared Maximum Mean Discrepancy (MMD)

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

What Does Goodness Measure? A Likelihood-Ratio Account of Forward-Forward Learning

フォワードフォワード法で信頼性を向上させるために、対数比推定を用いて信頼性の正確な推定値とする。

品質予測/異常検知深層学習正規化・最適化手法生成

用途: フォワードフォワード法の信頼性を向上させる
難易度: Hard
コスト: Medium

Fisher Rank Inflation: A Spectral Signature of Memorization under Label Noise

ラベルノイズ時の回帰を支援するために、fisher階度膨張を用いる。

用途: ラベルノイズ時の回帰を支援する
難易度: Hard
コスト: High

Statistical Properties and Power Analysis of Divergence Measures for Credit Risk Model Monitoring

金融データの分布の変化を検出し、信用リスクモデルの監視を行う方法です。Jensen-Shannon-DivergenceやKullback-Leibler-Divergenceなどの分散量は異なる種類の変化を検出できます

用途: 信用リスクモデルの監視
難易度: Hard
コスト: Medium

Cluster-Weighted EDMD

Extended Dynamic Mode Decomposition (EDMD) approximates Koopman operators from data, but a single global opera

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

A 32-channel event-based bio-signal analog front-end with adaptive delta and pulse frequency encoding

Low-power event-based Analog Front-Ends (AFEs) are essential for building efficient, end-to-end neuromorphic s

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Structured Fluctuations and the Information Dynamics of Self-Maintenance in Growing Neural Cellular Automata

GNCAは、自己維持と自己修復を実現するのに安定した能力を持っていますが、その内部ダイナミクスがまだよく理解されていません。内部の揺らぎ---隠れチャンネル状態の時間的微小変動---がこの能力の発揮にどのような役割を果た

用途: ニューラルセルラー・オートマトナの自己維持能力の向上
難易度: Hard
コスト: Medium

A new dual-population constrained multi-objective evolutionary optimization algorithm with repair constraint handling for structural optimization

構造オプティミゼーション問題は、決定変数が多数、かつ非凸の可行域を持つため、Pareto前景に到達するには多数の関数評価が必要とします。そのため、高性能で効率的なオプティミゼーションアルゴリズムが必要になっています。この

条件最適化深層学習軽量化・量子化

用途: 構造オプティミゼーション問題の効率的な解決方法
難易度: Hard
コスト: Medium

Stability Buys Time: A Re-Keying Game for Encrypted Multi-Agent Control

暗号化された制御システムでは、クラウドがホモモルフィック暗号化された状態を操作し、動物達の動作をプライバシーで管理することができる。安全を確保するために、サイドチャネル攻撃のリスクを考慮しながら、制御機器が信頼できると仮

用途: 暗号化された制御
難易度: Hard
コスト: Low

huggingfaceHugging Faceあり2026-07-14

Color Pass-Through via Camera-Display Coupling

When a real-world scene is captured by a smartphone camera and viewed on its screen, the displayed image often

用途: 技術検証・論文読解補助
難易度: Easy
コスト: Low

huggingfaceHugging Faceあり2026-07-14

From Human-Centric to Agentic Code Review: The Impact of Different Generations of Generative AI Technology on Review Quality

Code review helps maintain software quality before code integration, but it also imposes a substantial workloa

品質予測/異常検知深層学習Transformer生成テキスト

用途: 生成
難易度: Easy
コスト: High

githubGitHubあり2026-07-14

OpenRLHF — An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

OpenRLHFは、Ray上に構築された強化学習フレームワークです。このフレームワークは、PPO、DAPO、REINFORCE++など、様々な強化学習アルゴリズムをサポートしています。

用途: 強化学習フレームワーク
難易度: Easy
コスト: High

githubGitHubあり2026-07-14

LakonLab — Official implementation of AsymFlow, pi-Flow, GMFlow

LakonLabは、AsymFlow、pi-Flow、GMFlowなどの生成型流体力学を実装するためのオープンソースプロジェクトです。

深層学習軽量化・量子化生成画像テキスト

用途: 生成型流体力学の実装
難易度: Easy
コスト: Medium

Learning the Graphical Nature of Symmetries

有限群は固有性の高い代数的構造であり、カイレー図は群の富むネットワーク構造を明らかにし、群の結構を測定・比較・学習することに役立つ。研究では、群の大きさ$767$未満および$131,406$のカイレー図を収集し、各群の群

深層学習グラフニューラルネット

用途: 群論とグラフ構造の研究
難易度: Hard
コスト: Low

Diversified Multinomial Logit Contextual Bandits

ルート選択に関する研究を進め、人間の選び方を再現することで、スマートなナビゲーションシステムを開発します。

条件最適化深層学習軽量化・量子化テキスト

用途: 最適なルート選択を支援する
難易度: Hard
コスト: Medium

表形式向き深層学習Transformerテキスト表形式

DAG-FM: A Foundation Model for Causal Discovery under Heterogeneous Causal Mechanisms

健康状態の推測は、人間の健康状態を推定するために、生物学的および行動的なデータを使用します。この研究では、健康状態を予測するための基金モデルとしてのDAG-FMを提案しました。

用途: 健康状態の推測
難易度: Hard
コスト: Low

CDFM: Towards a General-Purpose Causal Discovery Foundation Model

この研究では、Causal Discovery Foundation Modelを提案しました。このモデルは、観測データから潜在的な原因構造を回復することを目的としています。

用途: 健康状態の推測
難易度: Hard
コスト: High

Robust Subgroup Analysis for Heterogeneous Censored Data

この研究では、欠損データを持つ場合に活用できる、新しいスァグル分析方法を提案した。この方法は、欠損値の推測とスァグル分析の統合を実現することで、欠損データを持つ場合に、信頼性の高い結果を得ることができる。

用途: スァグル分析
難易度: Hard
コスト: Medium

Learning to control switching nonlinear systems with Koopman operator regression

この研究では、制御理論の適用を簡素化するための新しいアプローチを提案した。このアプローチでは、Koopman演算子を使用することで、非線形システムを線形システムとしてモデル化し、制御理論を適用しやすくする。

深層学習Transformer回帰

用途: 制御理論
難易度: Hard
コスト: Medium

センサ/時系列深層学習RNN / LSTM回帰予測時系列

Long-Memory Reservoir Computing for Data-Scarce Dengue Forecasting

大型言語モデル(LLM)は最近急速に普及していますが、その推論に際してはAI加速器が必要になります。トークンフェーズはLSTMなどのニューラルネットワークで処理される分野ですが、現在AI加速器におけるこの分野の効率を向上

用途: AI加速器でのLLMトークンフェーズを最適化する
難易度: Hard
コスト: High

NeuroMem-FHP: A Likelihood-Free Deep Learning Framework for Parameter Estimation of Fractional Hawkes Process

In this paper, we propose deep learning based NeuroMem-FHP framework for estimating the parameters of the frac

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Rank-Conditioned Sample Reuse for the Plackett--Luce Best-of-$K$ Objective

We study the coupled objective J_K^WOR = E_{S ~ PL-WOR_K}[max_{i in S} R_i]: the expected maximum reward of a

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Actor-Critic Learning for Extended Mean Field Control with Deterministic Policies

This paper develops a model-free reinforcement learning framework for continuous--time extended mean field con

深層学習Transformer強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

表形式向き品質予測/異常検知深層学習Transformer検出表形式強化学習

Transformer-Guided Swarm Intelligence for Frugal Neural Architecture Search

この研究では、従来のNAS方法のコストを抑えるための方法を開発します。この方法では、NASをトランスフォーマーを使用して実行します。

用途: NAS (Neural Architecture Search) のコストを抑えるための方法を開発
難易度: Hard
コスト: Low

Event-based Neural Decoding for Neuroprosthetic Motor Control

A substantial number of patients experience diminished mobility due to disabilities, diseases, or accidents. A

品質予測/異常検知深層学習RNN / LSTM

用途: 運動
難易度: Hard
コスト: High

Efficient and Robust Spiking Neural Networks for sEMG-Based Muscle Fatigue Detection

Detecting muscle fatigue via surface electromyography (sEMG) is essential for applications in sports, rehabili

用途: 検出
難易度: Hard
コスト: High

Paradoxes of Game Theoretic Equilibria and Price of Anarchy

この研究では、ゲーム理論的な均衡点を理解するための手法を開発します。この手法を使用すると、ゲーム理論的な均衡点を理解できます。

用途: ゲーム理論的な均衡点を理解するための手法を開発
難易度: Hard
コスト: Low

Efficient Online Proportional Sampling with Applications to Smoothed Online Learning

この研究では、オンライン確率サンプリングを高速化するための新しいアルゴリズムを提案した。このアルゴリズムは、オブジェクトの分割構造を考慮することで、効率的なデータ構造を構築し、オンライン確率サンプリングを高速化できる。

用途: オンライン確率サンプリング
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-07-13

Qwen-Music Technical Report

In this report, we introduce Qwen-Music, a powerful music generation model capable of producing highly musical

センサ/時系列品質予測/異常検知深層学習Transformer生成テキスト音声

用途: 生成
難易度: Easy
コスト: High

Fast Whole-Brain, Geometry-Aware Functional Alignment for Cross-Subject Decoding

Decoding brain activity is useful for characterizing brain processes and understanding the functional architec

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

表形式向き説明可能MI向き品質予測/異常検知深層学習Transformer表形式

Incremental Transformer for Surrogate-Based Inverse Design of Geopolymer Mixtures

Small-data inverse design is challenging in engineering informatics when observations are heterogeneous, mixed

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

arxivGitHubあり2026-07-12

Beyond Looking Up, Try Looking Around: Harmonizing Global Structure and Local Consistency in Optimal Transport for Short Text Clustering

Pseudo-labeling based on Optimal Transport (OT) has become an effective mechanism for enhancing short text clu

用途: 技術検証・論文読解補助
難易度: Easy
コスト: Medium

Representation Learning for Semiparametric Causal Mediation Analysis under No Essential Heterogeneity

We propose a two-stage estimator for structural mediation parameters that combines deep representation learnin

品質予測/異常検知深層学習軽量化・量子化埋め込み

用途: 埋め込み
難易度: Hard
コスト: Low

LayerNorm as Implicit Gain Control in Looped Transformers

In pre-LayerNorm looped transformers, LayerNorm inside the recurrent block acts as an implicit gain controller

CPUで試しやすい深層学習Transformer

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

huggingfaceHugging Faceあり2026-07-12

Predictive Divergence Masks for LLM RL

Reinforcement learning for large language models (LLMs) typically relies on trust-region masks to stabilize of

深層学習軽量化・量子化テキスト強化学習

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

TSCoNet: A Two-Stage Copula CNN-LSTM for Uncertainty-Aware Spatio-Temporal Forecasting

Reliable forecasting of several interrelated environmental variables - such as regional precipitation and temp

深層学習CNN予測

用途: 予測
難易度: Hard
コスト: Low

The Differential Neural Tangent Kernel and Its Positivity

The Neural Tangent Kernel (NTK) is one powerful tool for analyzing the training dynamics of neural networks in

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Energy-guided Recursive Model

Recursive reasoning models address structured problems by repeatedly updating latent states of small neural ne

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

説明可能センサ/時系列深層学習Transformer異常検知埋め込み時系列

Emergent Generalization by Representation Learning in Artificial Neural Networks

Dimensionality reduction has proven powerful for identifying neural manifolds, which are low-dimensional struc

用途: 異常検知
難易度: Hard
コスト: Low

huggingfaceHugging Faceあり2026-07-11

GigaAM Multilingual: Foundation Model for Underrepresented Languages

Despite recent scaling successes, multilingual ASR performance remains highly uneven, with long-tail languages

深層学習Transformer音声

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

githubGitHubあり2026-07-11

LLMs-from-scratch — Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

用途: 医療機器へのアクセスを予測する
難易度: Easy
コスト: High

githubGitHubあり2026-07-11

fastai — The fastai deep learning library

「fastai」は、深層学習のライブラリです。

用途: 機械学習ライブラリ
難易度: Easy
コスト: Medium

説明可能品質予測/異常検知深層学習軽量化・量子化回帰

Dynamic Frechet Regression with Feature Selection for Distributional Data

Many scientific and engineering applications generate responses that are not scalars or vectors, but statistic

用途: 回帰
難易度: Hard
コスト: Medium

Neural Collapse Is Forbidden: Information Floors in Language Models

Within-class variance in language-model representations is commonly read as incomplete neural collapse. We arg

深層学習正規化・最適化手法テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Adaptive Search in Collatz Exponent-Code Space via 2-adic and 3-adic Constraints

We study a symbolic search space for the Collatz conjecture based on finite exponent codes of the accelerated

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

説明可能CPUで試しやすい深層学習Transformer強化学習

A Symbolic Neural CPU for Quantization-Simulated Writeback and Interpretable Program Execution

Neural networks can learn algorithmic input-output mappings, but trusting a learned executor requires more tha

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Remembering Distinct Items, Not Tokens: A Learnable Dirichlet-Process Cache Between State-Space Models and Attention

Fixed-state sequence models compress an unbounded past into a bounded state, which caps their associative reca

深層学習RNN / LSTMテキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Foveation-Guided Dynamic Token Selection for Robust and Efficient Vision Transformers

The human visual system (HVS) employs foveated sampling and eye movements to achieve efficient perception, con

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Interference and Retention in Continual Learning

学習過程で保持される情報の分析を提案。モデルが学習した情報を保持するパターンを調査することで、汎化能力を向上させるための新しい方法を開発した。

用途: 学習過程で保持される情報の分析
難易度: Hard
コスト: Low

深層学習Transformer分類検出セグメンテーション

githubGitHubあり2026-07-10

pytorch-grad-cam — Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

このライブラリは、コンピュータービジョンのための高度なAI解釈と可視化ソリューションです。このライブラリは、CNN、ビジョントランスフォーム、分類、物体検出、分割、画像類似度など、さまざまなコンピュータービジョンの

用途: AIの解釈と可視化ソリューション
難易度: Easy
コスト: Low

AlphaZero in Sparsely Rewarded Games: Limits and Auxiliary Supervision

AlphaZero has demonstrated that a neural-guided Monte Carlo Tree Search can achieve superhuman performance, bu

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Algorithmic Expert Aggregation

複数のエキスパートから予測したデータを合計して、より正確な予測を行うフレームワーク。

用途: エキスパートデータの合計
難易度: Hard
コスト: Medium

Quota Marketplace: Dynamic Pricing for Efficient Allocation of ML Training Resources

The escalating demand for Machine Learning (ML) training resources in recent years has resulted in a substanti

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

From Rules to Nash Equilibria: A Lean 4 Case Study in Game-Theoretic Analysis of a Competitive Trading Card Game

ポケモントレーディングカードゲームのメタゲーム分析を機械的に実行するための一つのフレームワーク。

用途: ゲームのメタ分析
難易度: Hard
コスト: Low

Provably Optimal Learning Algorithms for Assistance Games

この論文では、人とアシスタントが協力してタスクを解決する場合のオンラインバージョンの協力ゲーム (Assistance Games) 構造を研究しています。この文脈では、人間は世界の状況を把握できますが、アシスタントは人

用途: 人間とアシスタントが協力してタスクを解決すること
難易度: Hard
コスト: Medium

品質予測/異常検知深層学習Transformer画像テキスト

Social-spatial dependencies for learning visual navigation

これは、社会的行動を予測するための新しいフレームワークであるSocial-spatial dependenciesを提案し、個々のエージェントが社会的信号を学習する能力を向上させる。

用途: 社会的行動の予測
難易度: Hard
コスト: Low

Single-Entity Spiking Neuron Models: Survey

これは、単一のエンティティスパイクニューロンモデルの特徴と分類をまとめたものであるSingle-Entity Spiking Neuron Models: Surveyをまとめたもの。

用途: 神経システムのシミュレーション
難易度: Hard
コスト: Medium

Dynamic neural manifolds for flexible closed-loop control on neuromorphic hardware

これは、脳モデルとシンナー2チップの結合により動的なニューロンマニフルドを実現したDynamic neural manifolds for flexible closed-loop control on neuromor

用途: Flexible closed-loop制御
難易度: Hard
コスト: Medium

Size independence of consistency index for pairwise comparison matrices in analytic hierarchy process

Pairwise comparisons are fundamental in the analytic hierarchy process. Various consistency indices have been

用途: AHPにおけるペ
難易度: Hard
コスト: Medium

What Semivalues Cannot See: The Information Content of Anonymous Marginal Values

セミ値法は、ゲームの不平等な分配を均衡させようとするものです。しかし、その根底にある構造が何なのか、という問題を調べました。研究の結果、セミ値法ではすべてのプレイヤーが同等の情報を持っていることがわかりました。

用途: セミ値法の問題
難易度: Hard
コスト: Medium

huggingfaceHugging Faceあり2026-07-08

DeepSearch-World: Self-Distillation for Deep Search Agents in a Verifiable Environment

Training tool-use agents to improve from their own experience remains challenging, as supervised fine-tuning r

深層学習軽量化・量子化生成強化学習

用途: 生成
難易度: Easy
コスト: High

品質予測/異常検知深層学習軽量化・量子化画像テキスト3D

Do You Remember? Toward Memory-Centric Multimodal AI

Human memory is reconstructive, not a faithful recording. Current multimodal LLMs (MLLMs) lack this capability

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

An Introduction and Tutorial for the Beagle Framework

ビーガルフレームワークは、GPUを用いた遺伝的プログラミングフレームワークであり、NVIDIA GPUを用いることで、効率的な遺伝的プログラミングサーチを行うことができる。この技術ガイドでは、ビーガルフレームワークを紹介

深層学習軽量化・量子化回帰

用途: シンボル回帰問題
難易度: Hard
コスト: Low

A Hardware-Aware Open-Source Framework for Design Space Exploration of Mixed-Signal Spiking Neural Networks

spiking neural networkの設計とシミュレーションを行うためのフレームワークを提案する

用途: Hardware-Aware Spiking Neural Network
難易度: Hard
コスト: High

Scalable Perturbation Learning for Online Self-Supervised Echo State Networks

自律システムは、タスクの解決に加えて、実世界の制約下

深層学習軽量化・量子化教師あり自己教師

用途: オンライン自己学習
難易度: Hard
コスト: Medium

A Gold-Standard Study of What Makes a Lightweight Game-Playing Agent Strong

これは、プレイヤーが勝つゲームの勝利条件の強制とパロディーを目的としています。カードプレーヤーのゲームで特に興味を持っています。

深層学習CNNテキスト強化学習

用途: パソコンゲームの勝利するアリソーの決定
難易度: Hard
コスト: High

arxivGitHubあり2026-07-07

FootsiesGym: A Fighting Game Benchmark for Two-Player Zero-Sum Imperfect-Information Games

格闘ゲームNeutral Playにおける非確定情報ゲームを取り扱い、非確定情報ゲーム向けのオープンソース環境 FootsiesGymを開発した。

用途: 格闘ゲーム環境作成
難易度: Hard
コスト: High

センサ/時系列深層学習軽量化・量子化検出生成強化学習

6G Sensing Security: Distributed Game-Theoretic RL for Urban Beamforming and Attacker Detection

Next-generation wireless networksにおける分散型ゲーム理論を用いた6Gのセキュリティを研究します。分散型ゲーム理論は、6Gの通信システムが環境の認識とデータの伝送両方を実現するために必要な

用途: 6Gにおける分散型ゲーム理論
難易度: Hard
コスト: Medium

Contextual Procurement Auctions with Bandit Learning

買い手が情報を共有すると交渉が困難になる可能性があります。この問題に対処するために、貢献者が情報を共有するリスクを減らすための新しいアプローチ、「contextual auction with bandit learni

説明可能深層学習軽量化・量子化テキスト

用途: 買い手のコンテキストにおける再帰的買収
難易度: Hard
コスト: Low

huggingfaceHugging Faceあり2026-07-07

UI2App: Benchmarking Visual Interaction Inference in Executable Web Application Generation

Large language models (LLMs) have demonstrated growing competence in web page generation. However, existing te

用途: 生成
難易度: Easy
コスト: High

説明可能センサ/時系列深層学習Transformer検出画像

An event-driven framework for fly-inspired visual motion detection

イベントベースセンシングの活用と生物学的インスピレーションを利用した障害物検出を実現するために、飛行経路を用いた新しいアプローチが提案される。このアプローチは、イベントベースセンシングの活用と生物学的インスピレーションを

用途: イベントベースセンシングと飛行経路を用いた動的環境での障害物検出
難易度: Hard
コスト: High

少数データ向きCPUで試しやすい条件最適化深層学習軽量化・量子化生成テキスト

LLM-Driven Evolutionary Generation of Multi-Objective Bayesian Optimization Algorithms

Designing effective multi-objective Bayesian optimization (MOBO) algorithms requires balancing many interdepen

用途: 生成
難易度: Hard
コスト: High

品質予測/異常検知深層学習軽量化・量子化生成テキスト

QDEvo: A Multi-Objective Quality-Diversity Framework for Automated Heuristic Design

The integration of Large Language Models (LLMs) with evolutionary computation has emerged as a powerful paradi

用途: 生成
難易度: Hard
コスト: High

Heaviside Continuity of Rolling Coefficients for Eliminating Epistemic Entropy in Large Language Models

本研究では、推論プロセスの検証を目的とした Heaviside 不連続性の考慮を提案する。これにより、推論プロセスにおける潜在的なミスを検出した上で、正しい出力を生成することができる。

用途: 大容量言語モデルでの推論の検証
難易度: Hard
コスト: High

arxivPaper only2026-07-05

Neuromorphic Silicon Neuron Controller for Adaptive Deep Brain Stimulation in Parkinson's Disease

Parkinson's disease (PD) affects millions worldwide and causes severe motor symptoms. Adaptive deep brain stim

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

arxivPaper only2026-07-05

Burst Spiking Neural Networks

A central goal of current Spiking Neural Network (SNN) research is to improve their accuracy toward becoming l

深層学習CNN画像

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-07-05

Beyond Self-Resolution: Settlement Factorization for Robust Natural Language Mechanism

Language models increasingly mediate paid advice: agents submit open-ended forecasts, recommendations, plans,

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

huggingfaceGitHubありHugging Faceあり2026-07-05

Benchmarking Sensor Robustness in Plasma Diagnostic Models: A Systematic Evaluation on TokaMark

Plasma diagnostic models for tokamak fusion devices are almost universally evaluated on clean, complete sensor

表形式向きCPUで試しやすいセンサ/時系列深層学習Transformer検出

用途: 検出
難易度: Easy
コスト: Medium

Life as Plasmas: Autonomy and Interactivism in-materio

When is a material system a candidate for life at all? We argue that this question is prior to behavior, funct

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

SeqGPT: A Constrained Transformer Agent for the Inverse Designof Multi-Panel Composite Structures

Optimizing composite stacking sequences to match continuous targets (e.g., Lamination or Buckling Parameters)

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Microcosmos: Reimagining Artificial Life for the GPU Era

Most artificial life simulators either operate on abstract substrates disconnected from physical reality, or s

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

An Asymptotic Analysis of the Shapley Value for Dataset Valuation

We propose an asymptotic analysis of the Shapley value in a dataset valuation setting in which utilities are m

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

A Spiking Sequence Generator for Polar Trajectories on Neuromorphic Hardware

Neuromorphic controllers for size, weight, and power-constrained systems require neural architectures that are

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Stable Self-Modulating Quantum Fast-Weight Programmers with Bounded Memory Gates

この論文では、クォンタムAIで使用される高速重みプログラマー(QFWP)を安定化するために、QFWPに制約付きの古い状態モデュレータを導入した新しいframework「Stable Self-Modulating Qua

深層学習Transformer予測

用途: クォンタムAIの高速重みプログラマーの安定性向上
難易度: Hard
コスト: Low

Dendritic In-Context Learning in a Single-Layer Spiking Neural Network

In-context learning (ICL) は、現代の AI アーキテクチャのフワードパスの内側に埋め込まれた、潜在的なグレーディエント降下です。ICL を生物学的に可能性がある Spiking Neural N

用途: In-context Learning
難易度: Hard
コスト: High

表形式向きCPUで試しやすい深層学習Transformer分類検出回帰

Predicting Early Stages Of Alzheimer's Disease And Identifying Key Biomarkers Using Deep Artificial Neural Network And Ensemble Of Machine Learning Methodologies

この研究では、アルツハイマー病の前期診断と生物学的マーカーの検出にAI技術を適用します。AIモデルをトレーニングするために、電気エイセフィログラム（EEG）データを使用し、精度を高めます。また、AIモデルが得た情報を分析

用途: アルツハイマー病の前期診断と生物学的マーカーの検出
難易度: Hard
コスト: High

Electronic Bursting Neuron: design, equations and hardware implementation

SNNは、脳の行動を模倣するためのニューラルネットワークである。SNNの構築には、電子ニューロンを設計することが必要であり、その構築には、多くの方法が考えられていて、その構築は複雑で困難であり、実装も容易ではない。この研

用途: spiking-neuralネットワーク（SNN）
難易度: Hard
コスト: Medium

品質予測/異常検知深層学習Transformer生成

Evolutionary Wave Function Collapse

波形機能崩壊 (WFC) は、プロセス内容生成のために普及している一種メソッドで、ローカルな隣接制約を学習しながら、例の入力からより大きな出力を生成する。WFCに進化的検索を組み合わせることで生成されたレベルの評価が可能

用途: プロセス内容生成における進化的波形機能崩壊
難易度: Hard
コスト: Medium

Mechanism and Stability Analysis of Metabolic Closed-Loop Metaheuristics

この論文は、メタ解析システムのフレームワークレベルでの解釈を研究する。メタ解析システムのリソースループの解釈は、ナラティブのための象徴的表現だけではなく、フレームワークレベルにおいても存在するのではないかという質問を中心

用途: メタ解析システムの安定性の分析
難易度: Hard
コスト: Medium

On the Cost of Non-Adaptivity in Matroid Prophet Inequalities

Matroid prophet inequalities admit an optimal 2-competitive algorithm, which relies on adaptively updating thr

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Epistemic Horizon Minority Games: When Abundance Reduces Strategic Value

Strategic value can fall when an option becomes visible. A route, signal, bet, or opportunity may be attractiv

深層学習Transformer分類画像

用途: 分類
難易度: Hard
コスト: Low

Congestion-Based Slot Pricing in a Railway Auction Game

鉄道アジストゲームのスロット価格決定の問題を解決するための、アプローチを提示しました。

用途: 鉄道アジストゲームのスロット価格決定
難易度: Hard
コスト: Medium

arxivPaper only2026-07-01

BFF: Simple explanations for complex phenomena

「計算的生命」論文は、ペアが相互作用する複雑なシステムにおいて、自己複製体を容易に発見できることを示しました。ここでは、逆説的には、単純な遺伝子突然変異ウォークを用いた自己複製体の検出に新しいアプローチを提案し、この方法

深層学習検出

用途: 自己複製体の検出
難易度: Hard
コスト: Medium

arxivPaper only2026-07-01

Towards transferable lightweight neuromorphic computing through a model-free temporal-switch framework

Lightweight neuromorphic computing offers a promising route to efficient AI, with particular benefits for reso

深層学習軽量化・量子化分類

用途: 分類
難易度: Hard
コスト: High

arxivGitHubあり2026-07-01

Towards Learning Representations of Policies in Two-Player Zero-Sum Imperfect-Information Games

このアプローチでは、ゼロサムゲームのポリシー表現学習を取り上げ、ポリシー表現を生成し、評価する方法を提案しています。

深層学習Transformer教師あり自己教師

用途: ゼロサムゲームのポリシー表現学習
難易度: Easy
コスト: Low

MI向きセンサ/時系列深層学習Transformer予測時系列

EVOTS: Evolutionary Transformer Search for Time Series Forecasting

Evolutionary neural architecture design for multivariate time-series forecasting remains underexplored, with m

用途: 予測
難易度: Hard
コスト: High

Robustness of neural networks to random noise perturbations of their inputs

精密機械学習の堅牢性と精度の観点から、ネットワークの入力値をランダムにノイズを追加して精密機械学習の堅牢性を評価する研究であり、それらの間の関係を調査した。

用途: 画像認識の堅牢性確保
難易度: Hard
コスト: Medium

Data Sharing and Competition in Learning-by-Deploying Industries: Insights from Robotics and Beyond

データ共有と競争を経済学的にもうつることの影響を分析する。この研究では、企業がデータを共有することで、競争が減るか増すかを考察し、データ共有と競争の関係を分析する。

用途: データ共有と競争を経済学的にもうつることの影響を分析する
難易度: Hard
コスト: Low

Incentivizing Data Trading via Profit Reallocation

データ市場におけるデータの取引の促進。この研究では、データの取引を促進するための経済的インセンティブを開発する。

用途: データ市場におけるデータの取引の促進
難易度: Hard
コスト: High

Minimal MMAO: A Resource-Closed-Loop Framework for Adaptive Metaheuristic Search

This paper presents the Metabolic Multi-Agent Optimizer (MMAO) as an adaptive metaheuristic built around endog

用途: メタヒューリスティックの自動チューニング
難易度: Hard
コスト: Medium

Evolutionary Hyperparameter Optimization to Find Lightweight CNN Models for Autonomous Steering

This research investigates the optimization of Convolutional and Dense Neural Networks (CNNs and DNNs) for aut

深層学習CNN画像

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

A Tunable Incentive Mechanism for Binary Aggregation Without Verification

Binary aggregation without verifiable ground truth arises when agents' reports must be aggregated without acce

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Submission Responsibility Matters: Role-Aware Submission Quotas under Coauthorship

Author-level submission quotas are increasingly used to control growing peer-review load. Recent coauthorship-

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Theory of Continual Learning Against Data Poisoning Attacks

Continual learning (CL), where a model is trained on a sequence of data tasks, is increasingly being adopted a

深層学習Transformer分類画像テキスト

用途: 分類
難易度: Hard
コスト: High

githubGitHubあり2026-06-29

HunyuanVideo — HunyuanVideo: A Systematic Framework For Large Video Generation Model

画面の生成モデルであるHunyuanVideoを開発した。HunyuanVideoは、複雑なシーケンスを生成する能力を持つ。

深層学習Transformer生成動画

用途: 画面の生成モデルへの応用
難易度: Easy
コスト: High

Geometric Stability of Neural Population Codes: Regional Variation, Behavioral Relevance, and Circuit Dependence

Current models of representational reliability in neural populations focus on temporal stability: whether popu

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Supervised Hebbian learning in Deep Counterstream Associative Networks

Modern machine learning applications employ deep neural networks training with the error backpropagation algor

用途: 分類
難易度: Hard
コスト: High

arxivGitHubあり2026-06-28

When LLMs Develop Languages: Symbolic Communication for Efficient Multi-Agent Reasoning

Chain-of-Thought (CoT) improves large language models (LLMs) on difficult reasoning tasks, but it often incurs

MI向き深層学習軽量化・量子化テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

How Much Due Diligence Before You Bid? Learning in Intractable Takeover Auctions

When two companies bid to buy the same target, no one knows exactly what the target is worth. Each bidder pays

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Optimism as a Vulnerability: Deceptive Stackelberg Control of UCB Bandit Followers

Upper Confidence Bound (UCB) algorithms guarantee sublinear regret for agents learning unknown stochastic envi

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Unified Complex-valued Neural Network: A Magnitude-Phase Computational Model for Event-Driven Neuromorphic Learning

Artificial neural networks (ANN) provide accurate continuous-valued representation, whereas spiking neural net

説明可能深層学習CNN生成

用途: 生成
難易度: Hard
コスト: High

Road to scalability for efficient graph search on massively parallel neuromorphic hardware

Efficient computation of shortest paths in weighted graphs is a fundamental problem with many applications. Ne

CPUで試しやすい深層学習軽量化・量子化

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

The Game Changer Problem: Controlling Equilibria with Discrete Rewards

We introduce the game changer problem, where an external designer modifies a game's reward matrix to make a ta

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Pure Nash Equilibria under the Affine Mechanism: A Potential Game of Exaggeration

The mean mechanism is known to be non-incentive-compatible, namely, rational players are incentivized to misre

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Exit-and-Join Dynamics and Equilibrium in Continuum Cooperative Games

This paper develops a continuum theory of exit-and-join coalition dynamics in nonatomic cooperative games. We

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Neuromorphic Energy-Aware Learning for Adaptive Deep Brain Stimulation

Neuromorphic and edge computing research has focused on reducing the inference cost of neural network controll

深層学習軽量化・量子化音声強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

説明可能品質予測/異常検知深層学習Transformer

Comparing Scalar Objective Functions for Multi-Criteria Engineering Optimization

Scalar objective functions are required when a multi-criteria optimization problem must yield a single preferr

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

MMAO: A Metabolic Multi-Agent Optimizer with Endogenous Resource Allocation for Continuous and Discrete Optimization

Traditional meta-heuristics often rely on fixed population sizes, manually chosen search scales, and externall

センサ/時系列深層学習軽量化・量子化テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Heterogeneous synaptic motifs bridge microscale structure and macroscale nonlinear dynamics

Recent breakthroughs in synaptic-resolution network connectomics have revealed that brain circuits feature fin

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Co-Optimization of Analog Kolmogorov-Arnold Networks for Low-Power Function Approximation in Flexible Electronics

WearableデバイスやIoTセンサには、非線形活性関数や感知機などで計算を実行する必要のある信号処理やセンサカレブリューションが必要です。

センサ/時系列深層学習軽量化・量子化

用途: 電子機器での低消費電力な機能アプロキシメソッドの開発
難易度: Hard
コスト: High

説明可能品質予測/異常検知深層学習正規化・最適化手法

Criticality-Constrained Iterative Pruning for Energy-Efficient Spiking Neural Networks via Combined Importance Scoring

Deploying spiking neural networks (SNNs) on neuromorphic hardware demands aggressive synaptic pruning while pr

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

CANNs: A Toolkit for Research on Continuous Attractor Neural Networks

CANN(Continuos Attractor Neural Network)の研究を支援するために、CANN研究に特化したシミュレータを開発します。

用途: CANN研究の支援
難易度: Hard
コスト: Low

DE-2LS: Differential Evolution with Lightweight Late Local Search for Constrained Numerical Optimization

可算推定に適した制約付き一つの目標関数の最適化に適したアルゴリズムDE-2LSが紹介されます。

用途: 可算推定に適した制約付き一つの目標関数の最適化アルゴリズムの開発
難易度: Hard
コスト: Medium

DE-2LS: Differential Evolution with Late-Stage local-search for Unconstrained Single-Objective Numerical Optimization

可算推定に適した一つの目標関数の最適化アルゴリズムとして、DE-2LSが紹介されます。

用途: 可算推定に適した一つの目標関数の最適化アルゴリズムの開発
難易度: Hard
コスト: Medium

arxivPaper only2026-06-25

Multi-Objective Molecular Generation with Frequency-Controlled Evolutionary Dynamics

Molecule generation methods that leverage generative models have been successfully applied to drug discovery.

説明可能MI向き品質予測/異常検知深層学習軽量化・量子化生成

用途: 生成
難易度: Hard
コスト: High

arxivPaper only2026-06-25

CARVE: Content-Aware Recurrent with Value Efficiency for Chunk-Parallel Linear Attention

再発生モデルに効率的な記憶の管理を提案しており、記憶の除去はデータの新規記憶と共存する必要があります。

品質予測/異常検知深層学習Transformerテキスト

用途: メモリモデルに効率的な記憶の管理
難易度: Hard
コスト: High

arxivPaper only2026-06-25

Parametric Open Source Games

オープンスースペルミートゲームには、プレイヤーが決定手順に依存して動作するエージェントが含まれる。オープンスースペルミートゲームのパラメトリックモデルが提案され、自発的勾配の理論的枠組みが確立される。

用途: オープンスースペルミートゲームの解析
難易度: Hard
コスト: Medium

githubGitHubあり2026-06-25

ai-engineering-from-scratch — Learn it. Build it. Ship it for others.

このリポジトリでは、AIエンジニアリングのためのオープンソースプラットフォームであるMLflowを提供しています。

用途: AIエンジニアリングのためのプラットフォーム
難易度: Easy
コスト: Medium

githubGitHubあり2026-06-25

ml-mdm — Train high-quality text-to-image diffusion models in a data & compute efficient manner

Train high-quality text-to-image diffusion models in a data & compute efficient manner

用途: 生成
難易度: Easy
コスト: High

説明可能センサ/時系列品質予測/異常検知深層学習軽量化・量子化音声

What Does a Pathological Speech Assessment Model Know about Acoustic Features? A Case Study on Oral and Oropharyngeal Cancer Patients

この研究では、パーソナライズされた話し言葉アシスタンスシステムを提案します。

用途: パーソナライズされた話し言葉アシスタンスシステムの開発
難易度: Hard
コスト: Low

Adaptive Enhanced Quantum-inspired Simulated Bifurcation Algorithm for Population State Perception

Existing quantum-inspired simulated bifurcation algorithms rely on dynamic scheduling methods but lack the abi

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Strong duality for the GROW criterion

This paper presents general strong duality results when testing hypotheses by betting against them. A bet is a

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Sphere of Influence Centrality via Shapley Values: Empirical Approximation and Network Coverage Analysis

Node centrality is a fundamental problem in network analysis, yet classical metrics fail to capture the collec

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Identifying structural design principles shaping the computational abilities of recurrent neural networks

Understanding how the architecture of neural networks shapes the computations they carry is a central challeng

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

It's Much Easier for Neural Networks to learn Game of Life Dynamics with the Right Activation Function: Polynomial Kolmogorov-Arnold Networks

ゲームオブライフの動態を学習する方法を調べ、特定の活性関数を使用することで、より効果的に学習できることがわかりました。

説明可能深層学習

用途: ゲームオブライフの動態学習
難易度: Hard
コスト: Medium

Local Pheromone Network: Sparse Local Learning with Multi-Scale Synaptic Trails, Consolidation, and Replay

Backpropagation-trained dense neural networks are powerful function approximators, but they couple learning ac

深層学習Transformer回帰テキスト

用途: 回帰
難易度: Hard
コスト: High

Self-Modulating Quantum Fast-Weight Programmers for Efficient Adaptive Sequential Learning

Recent advances in quantum machine learning have motivated efficient models for sequential data processing. In

センサ/時系列深層学習軽量化・量子化時系列

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Recursive QLSTM with Dynamic Variational Quantum Circuit Adaptation

Recent advances in quantum computing and machine learning have motivated the development of quantum models for

センサ/時系列深層学習RNN / LSTM時系列

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Evolutionary Optimization Reveals Structural Constraints on Reservoir Architecture for Spatiotemporal Chaos

この研究では、スパチオ時空間のカオスの予測を目指すリザバーコンピューティングのための新しいアプローチ、Evolutionary Optimizationを提案しています。これにより、リザバーの構造自体を進化させることが可

用途: リザバーコンピューティング
難易度: Hard
コスト: Medium

Distributionally Robust Joint Information and Mechanism Design for Multi-Area Power System Coordination

We study a continuous-time stochastic Stackelberg control problem in which a leader steers a system of strateg

少数データ向き深層学習Transformer

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Neural Parameter Calibration for Finite-State Mean Field Games

Mean field games efficiently approximate a very large population of strategic agents. While these games can ai

用途: メンフィールドゲームのパラメータの学習
難易度: Hard
コスト: Medium

YUKTI: From Natural-Language Situations to Robust, Verifiable Decisions An Uncertainty-Typed Proposition IR, Assumption-Robust Pareto Frontiers, and a Regret Certificate

Language models turn a worded situation into a numeric plan, and the dominant pipelines (NL4Opt, OptiMUS, ORLM

深層学習軽量化・量子化テキスト音声

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Measuring Behavior Portability in Large Language Models

この研究では、モデルの行動を分析し、モデルの行動を他の環境に適応させる能力を評価する方法であるBehavioral Portability Testを開発しました。

用途: 弾力性と行動をポートレートする
難易度: Hard
コスト: High

Design and Development of a Neuromorphic Silicon Suite: PVT Sensing, Stochastic LIF Inference, On-Chip STDP Learning, and Crossbar Programming

Edge neuromorphic systems need compact, configurable hardware that combines probabilistic inference, local lea

センサ/時系列深層学習Transformer生成

用途: 生成
難易度: Hard
コスト: Medium

Multi-Level Resistive Synapses for On-Chip Neural Networks: A Physics-Based Design of a Memristive Crossbar Fabric with Quasi-Continuous Conductance States

Building on resistive communication, this paper presents a physics-based design of an on-chip neural network w

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

GARIP: A Running-Average Moving Reference for Last-Iterate Self-Play in Two-Player Zero-Sum Games

Self-play with naive gradient ascent cycles in two-player zero-sum games: the last iterate orbits the equilibr

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Risk-Aware Information Theory

We develop a risk-aware information theory by replacing expectation with expectiles, introducing expectile ent

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-20

Distilling a Modular Reservoir Through a Genomic Bottleneck

The intricate structures of biological neural networks largely emerge during development, guided by a comparat

用途: 生成
難易度: Hard
コスト: High

Soliton-like Waves in a Two-Dimensional Recurrent Spiking Neural Network with Weighted Spike-Timing-Dependent Plasticity

We construct a minimal but biologically plausible spiking neuron model operating in discrete time, combining m

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

On the Use of Survival Selection Methods for Evolutionary Diversity Optimisation

Generating a diverse set of high quality solutions for an optimisation problem has been studied extensively in

品質予測/異常検知深層学習軽量化・量子化生成

用途: 生成
難易度: Hard
コスト: Medium

Prophet Inequalities under Local Differential Privacy

Many online decision platforms, from hiring marketplaces to auctions, face a tension between efficient decisio

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

Simultaneously Efficient Allocation of Indivisible Items Across Multiple Dimensions

Many allocation problems are intrinsically multidimensional, since an item may contribute differently to sever

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

OPC UA Shared-Memory: Conceptual Elaboration and Prototypical Implementation Using Iceoryx2

The increasing virtualization of automation software leads to a growing co-location of heterogeneous applicati

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivGitHubあり2026-06-18

Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning

The temporal structure of reward composition in reinforcement learning (RL) is typically hand-designed and hel

MI向き深層学習Transformer強化学習

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

arxivPaper only2026-06-18

Evolutionary Two-Stage Hyperparameter Optimization Strategies for Physics-Informed Neural Networks

物理学定理を使用したニューラルネットワークを提案。物理学定理をニューラルネットワークに組み込み、この定理を用いて機械学習を向上させる。

条件最適化深層学習Transformer

用途: 物理学定理を使用したニューラルネットワーク
難易度: Hard
コスト: High

arxivPaper only2026-06-18

Hybrid ANN-SNN Pipeline with Local Plasticity

神経網路の設計を目指す本研究では、ANNとSNNを組み合わせたハフマン式設計法

深層学習CNN分類画像

用途: 神経網路の設計
難易度: Hard
コスト: High

arxivGitHubあり2026-06-18

Weight Adaptation for Improving Parallel Performance of Adaptive Stochastic Natural Gradient

概率モデルに基づく進化アルゴリズムは、暗号化された最適化において強力なツールである。特に、ASNGは、重力を適応させることで、効率的かつ安定した最適化を実現している。しかし、重量の制御は依然として未解明の一つの分野である

条件最適化深層学習軽量化・量子化

用途: 強力な並行化のための重み適応
難易度: Easy
コスト: Medium

説明可能センサ/時系列品質予測/異常検知深層学習RNN / LSTM音声

Adaptive Speech-to-Spike Encoding for Spiking Neural Networks

この研究では、音声認識のパターン認識を分析するためのスパイクニューラルネットワークを使用します。モデルは音声認識のパターン認識に役立ちます。

用途: 音声認識のパターン認識
難易度: Hard
コスト: High

Model Merging to Evolution: Parameter Space Exploration for Expert Models

Model merging integrates the capabilities of multiple expert models to create strong models for multiple tasks

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

Attention as Frustrated Synchronization

Attentionメカニズムをフラストレーションされた同期の観点から研究した。この方法では、トークンの状態を相関する相のフェーズとして設定することで、Attentionメカニズムがどのようにして計算されるかを理解すること

用途: Attentionメカニズム
難易度: Hard
コスト: High

FPGA-Accelerated Neuromorphic Vision System for Real-Time Orbital Object Detection

The escalating congestion in orbital space demands advanced monitoring solutions. This work presents a compreh

用途: 検出
難易度: Hard
コスト: Medium

arxivPaper only2026-06-16

Dimensionality Controls When Modularity Helps in Continual Learning

次元制御とモジュラリティの関係を研究することで、続き続ける学習において、安定性と変化性を最適化し、より効率的な学習を行うことが可能となる。

説明可能深層学習RNN / LSTM

用途: 次元制御とモジュラリティの関係
難易度: Hard
コスト: Medium

arxivPaper only2026-06-16

A Neuromorphic Trigger for Efficient Audio Event Detection

Efficient processing of continuous audio streams remains a key challenge for real-time and resource-constraine

深層学習軽量化・量子化分類検出音声

用途: オーディオイベント検出の
難易度: Hard
コスト: Low

arxivPaper only2026-06-15

Energy-efficient codon optimization on thermodynamic hardware

この研究では、テクノロジックコンピューティングを使用して、特定の生体活性を実現する化合物の設計に焦点を当てます。これは、有毒物質と有害な物質を含む、広範囲にわたる化合物集合を効率的に探索するために使用する可能性があります

用途: 化学物質の設計
難易度: Hard
コスト: Medium

arxivPaper only2026-06-15

From Compression to Deployment: Real-Time and Energy-Efficient FastGRNN on Ultra-Constrained Microcontrollers

この研究では、エッジデバイスで機械学習を実行するという目標を達成します。これは、機械学習をエッジデバイスに実行するための実用的な方法を提供します。

用途: エッジデバイスでの機械学習
難易度: Hard
コスト: High

arxivPaper only2026-06-14

An Integrated System for Real-Time Student Assessment and Career Guidance Using Neural Networks in Computing Disciplines

Many undergraduate students in Computer Science (CS) and Software Engineering (SWE) struggle to identify suita

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-06-14

AQ4SViT: An Automated Quantization Framework with Search Gating Policy for Compressing Spiking Vision Transformers

Spiking Vision Transformers (SViTs) have emerged as alternative low-power ViT models, but their large sizes hi

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

arxivPaper only2026-06-13

Controlled Dynamics Attractor Transformer

この研究では、Controlled Dynamics Attractor Transformer (CDAT)を提案しました。このTransformerは、Self-Attention MechanismとAssocia

説明可能品質予測/異常検知深層学習Transformer分類検出異常検知

用途: Controlled Dynamics Attractor Transformer (CDAT)を提案すること。
難易度: Hard
コスト: Low

Harnessing cortical geometry, wiring, and function as inductive biases for recurrent neural networks

How the wiring and functional organization of cortex shape recurrent computation remains a central question in

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

Test-Time Adaptation of Spiking Neural Networks for Intracortical Neural Decoding using Membrane Potential Alignment

Intracortical brain-computer interfaces suffer from day-to-day neural signal shifts that degrade pretrained de

深層学習RNN / LSTM教師なし

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

VQ4SNN: Vector Quantization for Memory-Efficient FPGA Spiking Neural Networks

Spiking Neural Networks (SNNs) offer an energy-efficient paradigm for edge AI, making them attractive for hard

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

A Programmer's Guide to Cascaded Adaptive Combiners: Online Learning by Biologically Accurate Models of Multilayer Neuron Networks

Learning in biological multilayer neuronal networks offers insights that extend beyond the classical weighted-

深層学習軽量化・量子化分類画像

用途: 分類
難易度: Hard
コスト: Low

Robust Auto-associative Memory via Convolutional Restricted Hopfield Networks

Associative memory models play a fundamental role in pattern retrieval, but their performance often degrades u

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

センサ/時系列深層学習グラフニューラルネット予測時系列

SpikF-GO: Spiking Fourier Graph Operators for Multivariate Time Series Forecasting

この研究では、Spiking Neural Networks (SNNs)を用いて、時系列予測を改善し、複数の変数間の関係を考慮することができることを示しています。

用途: 時系列予測
難易度: Hard
コスト: Low

ReSCom: A Reconfigurable Spiking Neural Network Accelerator Using Stochastic Computing

スパイクニューラルネットワークは、エネルギー効率のよいAIモデルです。この研究では、スパイクニューラルネットワークのアクセラレータを実装し、その性能をテストしました。

深層学習RNN / LSTM分類画像

用途: スパイクニューラルネットワークアクセラレータの実装
難易度: Hard
コスト: Low

Adaptive-Frequency Resonate-and-Fire Neurons for Spectral Estimation of Streaming Radar Signals

FMCWレーダーの周波数推定には従来のFourier変換法が一般的ですが、記憶保存と処理が必要で、低遅延アプリケーションでの利用が困難です。ARFニューロンを用いたアプローチを提案し、効率的な周波数推定を可能にしました。

用途: FMCWレーダーの周波数推定
難易度: Hard
コスト: Medium

Interaction Dynamics MPC for Knee Rehabilitation Exoskeletons: A Series-Elastic Instantiation

Safe rehabilitation is an interaction-dynamics problem: the controller must regulate a prescribed motion while

用途: 分類
難易度: Hard
コスト: Low