Hybrid Robustness Verification for Spatio-Temporal Neural Networks
With AI increasingly deployed in safety-critical systems, providing formal robustness guarantees for the under
- 用途
- 分類
- 難易度
- Hard
- コスト
- High
「3d」の検索結果
78 件With AI increasingly deployed in safety-critical systems, providing formal robustness guarantees for the under
Humans rely on spatially dense, geometry and force-aware tactile feedback at high temporal resolution for dext
自動運転車やインテリジェント輸送システムなどの自動化された車両の感知には3次元オブジェクト検出が必要です。道路での長距離検出は困難ですが、道路ではこの「長距離」に対する感知と決定の時間は約1-2秒です。2つの主な課題が現
Egocentricビデオを利用して手の圧力を推定できるモデル EgoTactile を提案している。
3次元シミュレーションシーンから知識グラフを構築することが、ロボットのタスク推論に重要な役割を果たすが、シーンのオブジェクトを形式的な分類にマッピングするステップが、現実に現れていない。LLMを使用して、このマッピングの
空中マルチスペクトル点群(MPC)では、三次元空間とスペクトルの情報を組み合わせたデータが取得できるが、点群データの分類は難しい課題であったため、新しい学習フレームワークを 提案。
Vision-and-Languageナビゲーションエージェントは、言語指示に従って環境を探索できる。Zero-shot Vision-and-Languageナビゲーションエージェントには、未知の環境における安全性と信
Video world models that maintain 3D spatial consistency across generated frames typically rely on explicit poi
Text-driven indoor scene generation and editing require an intermediate representation that language models ca
The vascular network in the human body is characterized by blood vessels exhibiting drastic structural variati
Reliable motion classification is critical for autonomous driving, as false dynamic predictions of static obje
3D semantic scene generation is crucial for autonomous driving applications, yet most methods rely on complex
With the growing demand for realistic virtual humans, parametric body models have become a cornerstone of mode
During warhead detonation, high-density, high-speed, and mutually occluded fragments are generated. Their mech
4D generation (\textit{i.e.}, dynamic 3D generation) has recently emerged as a rapidly growing research fronti
Multimodal 3D object detection based on LiDAR and cameras has demonstrated excellent performance in ground-veh
Despite the rapid advancements in event-based motion estimation, current geometric methods primarily focus on
Unmanned Aerial Vehicle (UAV) multispectral point clouds (MPC) provide high-dimensional spatial-spectral data
Existing pruning methods for 3D Gaussian splatting (3DGS) suffer from either severe quality degradation or pro
Neural radiance field (NeRF) and 3D Gaussian splatting (3DGS) are two mainstream approaches for novel view syn
This paper introduces EPS3D, a new end-to-end feed-forward framework for open-vocabulary 3D panoptic segmentat
Diffusion models have advanced 3D shape generation, yet most methods still denoise in high-cardinality spaces
Vision-Language-Action (VLA) models demonstrate strong perfor-1 mance on language-conditioned robotic manipula
Autonomous mobile robots operating in tight environments require motion planning frameworks that account for t
Generating high-quality dexterous grasps remains challenging for learning-based methods, which often depend on
Reliable robotic navigation necessitates the seamless integration of accurate global localization and dense, m
Simulation plays a key role in automated robotics research supported by large language models (LLMs). However,
While global data-driven models excel at predicting continuous atmospheric variables, three-dimensional hydrom
Modernization of legacy scientific codes is often necessary to keep up with the ever-evolving changes in the c
As autonomous systems expand from capital-intensive robotaxis to cost-sensitive logistics, sensor configuratio
Accurate quantification and uptake measurement in PET are critical for assessing disease progression and suppo
Global LiDAR localization is a fundamental task for autonomous navigation systems. Recent methods perform Scen
Achieving fully automated, physically plausible 3D motion synthesis is a core objective in graphics and genera
Fisheye cameras are widely deployed in autonomous driving perception suites for their low cost and full-covera
Large and demographically balanced datasets are essential for reliable neuroimaging biomarkers. Full-resolutio
To perform a wide range of daily tasks, robots need to construct a 3D representation that is semantically rich
Modeling high-frequency outgoing radiance distributions remains a fundamental challenge in global illumination
Robotic grasping is a fundamental capability in robotic manipulation. Yet grasping remains challenging under p
Generating complete 3D scenes from a single image requires inferring globally consistent geometry, object rela
Human manipulation videos are a convenient and intuitive source for robot learning. However, directly transfer
Robots deployed in human-centric environments routinely receive natural-language descriptions of spatial infor
Recent progress in robot manipulation has been largely driven by learning from large-scale demonstrations. For
Vision-Language-Action (VLA) models achieve strong benchmark performance but still struggle in real-world depl
Long-horizon robot operation requires spatio-temporal memory to record the environment state and recall it for
Understanding and comparing structures in scalar fields is a central challenge in scientific visualization, wi
We present SegmentAnyTreeV2, a sensor- and platform-agnostic framework for semantic and instance segmentation
Feed-forward 3D reconstruction models have recently shown strong generalization across diverse scenes, yet mos
Neural fields parameterize data as functions from coordinates to values, providing a unified framework for rep
MRI preprocessing defines the input distribution seen by brain MRI foundation models, yet it is usually treate
Designing 3D metamaterial microstructures that meet the intended functions remains a major challenge, as it ty
Ground contact forces acting on the human body, are crucial for biomechanics studies or sport performance anal
Facial rigging - creating FACS-based blendshapes together with inner-mouth geometry (teeth, gums, and tongue)
Facial hair is a defining trait of personal identity, yet remains a critical bottleneck for digital avatars. R
Enabling humanoid robots to operate in complex, dynamic environments remains a critical challenge, fundamental
Flexible robotic automation requires systems that interpret operator intent, verify physical feasibility, and
Monopedal hopping robots are conceptually simple but highly dynamic and inherently unstable. Achieving robust
Object navigation requires a robot to search for an unobserved target in an unknown environment by deciding wh
この論文では、ドライビングシミュレーションのためのフレームワークを提案しています。このフレームワークは、ドライビングシミュレーションを目的とした機械学習フレームワークです。このフレームワークは、大量のデータを扱う必要があ
3D Multi-Object Tracking (MOT)では、人の動きを検出し続けるために、3D点群データから3D人体の姿勢姿勢を推測する必要があり、主に幾何学情報に依存しているが、これは状況によっては人を分別するの
この論文では、四足ロボットのシマイルのためのQuadVerseフレームワークを提案した。QuadVerseは、視覚的、物理的、動的なギャップを考慮したシマイルを用い、四足ロボットの実験環境とシマイルを統合した。
3D visuomotor policies offer a promising direction for complex robotic manipulation, as depth maps and point c
この論文では、6-自由度のグレイスプースト推定を実現するための新しいフレームワークであるCross-view Fusionフレームワークを提案しました。
Aquatic robots have expanded human access to underwater environments, yet many underwater spaces contain obsta
この仕事では、LAAT(Locally Aligned Ant Technique)を拡張し、ノイジーで高次元のデータを扱うために設計されたフィルタリングアルゴリズムであるHub-Aware Hybrid Searchが
Robots that operate over extended periods should not merely visit space; they should progressively understand
Human video datasets used for cotraining robot manipulation policies largely consist of curated demonstrations
ポイントサンプリングを高速化する方法を開発しました。この方法は、ポイントサンプリングを高速化できます。
この研究では、G-solver という完全なガウシアンと分散フレームワークを提案します。
このリポジトリでは、画像認識モデルにアクション生成能力を付与することを目指したモデルを提案します。このモデルは、画像認識のための事前訓練モデルを用いて、複雑なアクションを生成することができます。
この研究では、実際のアカウシック現象を考慮して、3Dソナー シミュレーションを改善するモジュラー構成を提案します。
この研究では、水中移動の経路計画に適したフレームワークを提案し、経路計画の精度を向上させた。
Generalist robot intelligence is often framed as a policy-scaling problem: collect more robot demonstrations,
Open-vocabulary 3D functionality segmentation enables robots to localize functional object components in 3D sc
We introduce Convex Distance Operator Transport (CDOT), the first convex optimal transport framework that alig
Transformers trained on modular arithmetic exhibit sharp transitions between memorization, generalization, and
Surrogate Safety Measures (SSMs) are extensively utilised in the evaluation of traffic risk in automated drivi
The space L of linear value maps on a finite-player cooperative game G^N is finite-dimensional, and admits a c
The spatial and functional organization of the primate visual cortex is a fundamental problem in neuroscience.