生成AI

diffusers — 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

.diffusion モデルのライブラリ。画像・動画・音声生成に利用可能。

生成AI拡散モデル生成画像テキスト

Awesome-Video-Diffusion — A curated list of recent diffusion models for video generation, editing, and various other applications.

Awesome-Video-Diffusionは、Recent Diffusion Models for Video Generation, Editing, and Othersのリストを公開しています。

生成AI拡散モデル生成動画

deepinv — DeepInverse: a PyTorch library for solving imaging inverse problems using deep learning

ピラミードライブラリを使ったイメージインバース問題の解決に使えるライブラリです。

生成AI拡散モデル画像自己教師

未読 31件

diffusers — 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

.diffusion モデルのライブラリ。画像・動画・音声生成に利用可能。

生成AI拡散モデル生成画像テキスト

用途: 画像・動画・音声生成
難易度: Easy
コスト: High

Awesome-Video-Diffusion — A curated list of recent diffusion models for video generation, editing, and various other applications.

Awesome-Video-Diffusionは、Recent Diffusion Models for Video Generation, Editing, and Othersのリストを公開しています。

生成AI拡散モデル生成動画

用途: ビデオ生成や編集の問題を解決する
難易度: Easy
コスト: High

deepinv — DeepInverse: a PyTorch library for solving imaging inverse problems using deep learning

ピラミードライブラリを使ったイメージインバース問題の解決に使えるライブラリです。

生成AI拡散モデル画像自己教師

用途: イメージインバース問題の解決
難易度: Easy
コスト: High

ComfyUI — The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

runanywhere-sdksは、AIをローカルに実行するために使用できるプロダクションレディのツールキットです。

生成AI拡散モデル

用途: 高性能ディフュージョンモデルGUI環境の実現
難易度: Easy
コスト: High

品質予測/異常検知生成AI動画生成生成画像テキスト

GraphVid: Interactive Graph-Controllable Video Generation

GraphVidは、グラフと文本から生成することができ、オブジェクトの複数の移動を正確に制御することができる。グラフではオブジェクトの動きを表す情報を保存し、文から生成の制約を指定することができる。

用途: コントロール可能なビデオ生成
難易度: Hard
コスト: High

ElasticTTT: Prior-Preserving Test-Time Tuning for Video Editing

ElasticTTTは、プログラムがテストのときに動作を調整できるようにした。方法は、テストのときにモデルが前のサンプルの情報と現在の情報を組み合わせて、ビデオを編集する際に正しく動作するようにした。

生成AI拡散モデル生成テキスト動画

用途: ビデオ編集時のテストタイムチューニング
難易度: Hard
コスト: High

品質予測/異常検知生成AIGAN生成画像マルチモーダル

Physics-Informed Deep Learning Model for Cross-Modality Super-Resolution in Fluorescence Microscopy

Cross-modality image translation offers a route to super-resolution fluorescence microscopy from low-resolutio

用途: 生成
難易度: Hard
コスト: High

Causal-AgentIR: Self-Evolving Causal Memory for Adaptive Image Restoration Agents

Image restoration agents have recently emerged as a flexible paradigm for handling diverse and unpredictable d

品質予測/異常検知生成AIGAN画像テキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Low

GuidedAttention: Interpretable and Correctable Visual Attention for OOD-Robust Robot Manipulation via Imitation Learning

視覚モータリティポリシーを学習する際、人間が視覚アタッチメントを理解し、修正できるようにするため、視覚アタッチメントを明示的にしたフレームワークを提案します。

説明可能生成AI拡散モデル異常検知画像

用途: ロボットマニュピュレーションの視覺アタッチメント
難易度: Hard
コスト: High

arxivPaper only2026-07-22

Self-organizing Architecture of Receptron Units: a Hardware-Aware Framework for Edge Intelligence

エッジコンピューティング用ニューロモーフィッククラッサを提案する。

説明可能生成AIGAN分類

用途: エッジコンピューティング用ニューロモーフィッククラッサ
難易度: Hard
コスト: Low

arxivPaper only2026-07-22

The Human-AI Substitution Principle: When will you be replaced by AI in your organization?

Artificial Intelligence (AI) is rapidly transforming organizations, raising a fundamental organizational and e

生成AIGAN

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

arxivPaper only2026-07-22

Generative AI floods and dilutes the market for books

Generative AI can produce book-length works of fiction at near-zero cost. These books are often dismissed as l

品質予測/異常検知生成AI拡散モデル検出生成テキスト

用途: 検出
難易度: Hard
コスト: High

arxivPaper only2026-07-21

MIRA-Ev:A Benchmark for Granular Evidence Detection and Relational Reasoning in Clinical Exams

Clinical NLP evaluation remains dominated by multiple-choice question answering (MCQA), which scores only fina

生成AIGAN分類検出QA

用途: 分類
難易度: Hard
コスト: Low

arxivPaper only2026-07-21

End-to-end Conditional Diffusion for Realistic and Controllable Visual Traffic Scenario Generation

この文書では、閉回路交通シナリオ生成のための変分ベースのアプローチ「E2E-CDiff」を提案しました。これを使用すると、実世界に近い交通ルールを生成したり、交通ルールを操作することができるようになります。

生成AI拡散モデル生成画像

用途: 自動運転データの生成
難易度: Hard
コスト: High

githubGitHubあり2026-07-21

DNA-Diffusion — 🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨

人工DNAシーケンスを生成するモデルを提案し、DNAシーケンスを扱える機械学習的手法を開発することを目的としている。

生成AI拡散モデル生成

用途: DNAシーケンスの発生学習
難易度: Easy
コスト: High

arxivPaper only2026-07-20

Using binary silver labels in electronic health records-based computable phenotyping algorithms

Gold-standard phenotype labels are often unavailable at scale in electronic health record (EHR) studies becaus

生成AI拡散モデル回帰テキスト

用途: 回帰
難易度: Hard
コスト: High

arxivPaper only2026-07-20

How Fast Do Signatures Learn? Statistical Theory and Applications for Path Regression

この論文では、パス値学習を行うためにpath signaturesという

MI向き生成AI拡散モデル検出回帰予測

用途: パス値学習を行う
難易度: Hard
コスト: High

huggingfaceHugging Faceあり2026-07-20

Subliminal Clocks: Latent Time Modelling in Diffusion Language Models

Diffusion Language Models (DLMs) have recently emerged as a promising alternative to autoregressive models. Un

説明可能生成AI拡散モデルテキスト

用途: 技術検証・論文読解補助
難易度: Easy
コスト: High

arxivGitHubあり2026-07-18

Twisted Schrödinger Bridge Matching

Over the past few years, diffusion-based Schrödinger bridge models have been proposed to approximate optimal t

生成AI拡散モデル生成

用途: 生成
難易度: Hard
コスト: High

arxivGitHubあり2026-07-18

A Deep Second-Order Stochastic Residual Method for Fully Nonlinear Parabolic PDEs

We introduce the Deep Second-Order Stochastic Residual Method (D2SRM) for high-dimensional, Hessian-dependent

生成AI拡散モデル

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

arxivPaper only2026-07-15

Asymptotical Analysis of the $(1+(λ,λ))$ GA Escape Time from Local Optima on Jump Functions

遺伝的アルゴリズムのランタイム分析を実行する。これにより、遺伝的アルゴリズムのパフォーマンスを理解し、改善することができる。

生成AIVAE

用途: 遺伝的アルゴリズムのランタイム分析
難易度: Hard
コスト: Medium

arxivPaper only2026-07-13

Decoupling Corruption and Horizon in Robust Contextual Pricing

We study robust repeated contextual pricing, where valuations depends linearly on the features. At each round

生成AIGANテキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: Medium

githubGitHubあり2026-07-13

Matcha-TTS — [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Matcha-TTSは、高速で条件付き流のマッチングを実現するTTSアーキテクチャであり、話者の特徴を考慮する。

生成AI拡散モデルテキスト音声

用途: TTSアーキテクチャ設計
難易度: Easy
コスト: High

githubGitHubあり2026-07-13

Irodori-TTS — A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control

Emotion-driven Style Controlを使用してテキストから声の変換が実行され、感情のあるテキストをエモタイザブルな声に変換することが可能になります。

生成AI拡散モデル生成テキスト音声

用途: テキスト-to-声の変換
難易度: Easy
コスト: High

arxivPaper only2026-07-11

Conservation Laws for Diffusion Models

While autoregressive models optimize the exact data likelihood via the chain rule, diffusion models are typica

生成AI拡散モデルテキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

githubGitHubあり2026-07-08

VoxCPM — VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

生成AI音声・音楽生成生成テキスト音声

用途: マルチラギングスピーチ生成
難易度: Easy
コスト: Medium

arxivPaper only2026-07-03

Teaming Up with AI: Coordination and Cooperation

Successful diffusion of AI in the workforce hinges on the economic value that AI brings to human endeavors. Br

生成AI拡散モデル

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High

githubGitHubあり2026-07-01

MeanFlow — PyTorch implementation of MeanFlow & iMF (one-step generative modeling).

Operad理論を用いて、モデルが組み合わせ式に対する複合的な回答の合致性を検証する手法が提案された。

生成AI拡散モデル生成

用途: 対象モデルが不正を検知する
難易度: Easy
コスト: High

githubGitHubあり2026-06-30

ComfyUI-LTXVideo — LTX-Video Support for ComfyUI

医療画像分析で、深層學習モデルが実装されている問題に対する解決策を提示します。治療を導くために、批判的結果に影響を与える変化について特に重点が置かれています。

生成AI拡散モデル生成画像テキスト

用途: 医療画像を分析し治療を導く
難易度: Easy
コスト: High

githubGitHubあり2026-06-28

LanPaint — High quality training free inpaint for every stable diffusion model. Supports ComfyUI

画像生成のためのHigh Quality Training Free Inpaintを提供します。このInpaintはStable Diffusionモデルに使用でき、ComfyUIもサポートしています。

品質予測/異常検知生成AI拡散モデル生成画像動画

用途: 画像生成
難易度: Easy
コスト: High

arxivPaper only2026-06-22

EEG Benchmarking Needs a Task Specification Layer: NeuroDoc for Rulebook-Guided, Executable Benchmark Construction

Electroencephalography (EEG) foundation models increasingly rely on multi-dataset training and evaluation, yet

生成AIGANテキスト

用途: 技術検証・論文読解補助
難易度: Hard
コスト: High