2512.20871

Total: 1

#1 NeRV360: Neural Representation for 360-Degree Videos with a Viewport Decoder [PDF4] [Copy] [Kimi] [REL]

Authors: Daichi Arai, Kyohei Unno, Yasuko Sugito, Yuichi Kusakabe

Implicit neural representations for videos (NeRV) have shown strong potential for video compression. However, applying NeRV to high-resolution 360-degree videos causes high memory usage and slow decoding, making real-time applications impractical. We propose NeRV360, an end-to-end framework that decodes only the user-selected viewport instead of reconstructing the entire panoramic frame. Unlike conventional pipelines, NeRV360 integrates viewport extraction into decoding and introduces a spatial-temporal affine transform module for conditional decoding based on viewpoint and time. Experiments on 6K-resolution videos show that NeRV360 achieves a 7-fold reduction in memory consumption and a 2.5-fold increase in decoding speed compared to HNeRV, a representative prior work, while delivering better image quality in terms of objective metrics.

Subjects: Computer Vision and Pattern Recognition , Multimedia , Image and Video Processing

Publish: 2025-12-24 01:21:25 UTC