ArXiv TLDR

WildLIFT: Lifting monocular drone video to 3D for species-agnostic wildlife monitoring

🐦 Tweet
2604.24718

Vandita Shukla, Fabio Remondino, Blair Costelloe, Benjamin Risse

cs.CV

TLDR

WildLIFT uses monocular drone video to create 3D detections and tracks for species-agnostic wildlife monitoring, enhancing ecological analysis.

Key contributions

  • Integrates 3D geometry from monocular drone video with open-vocabulary 2D segmentation.
  • Enables species-agnostic 3D detection and tracking of wildlife from standard drone footage.
  • Generates oriented 3D bounding boxes with semantic face info for viewpoint and occlusion analysis.
  • Validated on 2,581 frames across four mammal species, reducing manual 3D annotation effort.

Why it matters

WildLIFT transforms standard 2D drone footage into structured 3D and viewpoint-aware representations. This significantly enhances the analytical utility of aerial wildlife datasets for behavioral research and population monitoring, unlocking new insights from existing data.

Original Abstract

Monocular RGB cameras mounted on drones are widely used for wildlife monitoring, yet most analytical pipelines remain confined to two-dimensional image space, leaving geometric information in video underexploited. We present WildLIFT, a computational framework that integrates three-dimensional scene geometry from monocular drone video with open-vocabulary 2D instance segmentation to enable species-agnostic 3D detection and tracking. Oriented 3D bounding box labels with semantic face information enable quantitative assessment of viewpoint coverage and inter-animal occlusion, producing structured metadata for downstream ecological analyses. We validate the framework on 2,581 manually curated frames comprising over 6,700 3D detections across four large mammal species. WildLIFT maintains high identity consistency in multi-animal scenes and substantially reduces manual 3D annotation effort through keyframe-based refinement. By transforming standard drone footage into structured 3D and viewpoint-aware representations, WildLIFT extends the analytical utility of aerial wildlife datasets for behavioural research and population monitoring.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.