-
Foundation Models for Generalist Geospatial Artificial Intelligence
Paper ⢠2310.18660 ⢠Published ⢠11 -
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization
Paper ⢠2309.16020 ⢠Published ⢠1 -
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper ⢠2312.02155 ⢠Published ⢠14 -
GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding
Paper ⢠2407.13519 ⢠Published
Collections
Discover the best community collections!
Collections including paper arxiv:2310.18660
-
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper ⢠2409.02097 ⢠Published ⢠34 -
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper ⢠2409.11406 ⢠Published ⢠27 -
Diffusion Models Are Real-Time Game Engines
Paper ⢠2408.14837 ⢠Published ⢠126 -
Segment Anything with Multiple Modalities
Paper ⢠2408.09085 ⢠Published ⢠22
-
Attention Is All You Need
Paper ⢠1706.03762 ⢠Published ⢠108 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ⢠2307.08691 ⢠Published ⢠9 -
Mixtral of Experts
Paper ⢠2401.04088 ⢠Published ⢠160 -
Mistral 7B
Paper ⢠2310.06825 ⢠Published ⢠56
-
ibm-granite/granite-geospatial-biomass
Image Feature Extraction ⢠Updated ⢠172 ⢠46 -
ibm-granite/granite-geospatial-wxc-downscaling
Image-to-Image ⢠Updated ⢠103 ⢠34 -
ibm-granite/granite-geospatial-canopyheight
Image Feature Extraction ⢠Updated ⢠14 ⢠18 -
ibm-granite/granite-geospatial-land-surface-temperature
Image Feature Extraction ⢠Updated ⢠98 ⢠19
-
Visual Instruction Tuning
Paper ⢠2304.08485 ⢠Published ⢠20 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper ⢠2311.05437 ⢠Published ⢠51 -
Improved Baselines with Visual Instruction Tuning
Paper ⢠2310.03744 ⢠Published ⢠39 -
Aligning Large Multimodal Models with Factually Augmented RLHF
Paper ⢠2309.14525 ⢠Published ⢠31
-
Foundation Models for Generalist Geospatial Artificial Intelligence
Paper ⢠2310.18660 ⢠Published ⢠11 -
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization
Paper ⢠2309.16020 ⢠Published ⢠1 -
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper ⢠2312.02155 ⢠Published ⢠14 -
GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding
Paper ⢠2407.13519 ⢠Published
-
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper ⢠2409.02097 ⢠Published ⢠34 -
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper ⢠2409.11406 ⢠Published ⢠27 -
Diffusion Models Are Real-Time Game Engines
Paper ⢠2408.14837 ⢠Published ⢠126 -
Segment Anything with Multiple Modalities
Paper ⢠2408.09085 ⢠Published ⢠22
-
ibm-granite/granite-geospatial-biomass
Image Feature Extraction ⢠Updated ⢠172 ⢠46 -
ibm-granite/granite-geospatial-wxc-downscaling
Image-to-Image ⢠Updated ⢠103 ⢠34 -
ibm-granite/granite-geospatial-canopyheight
Image Feature Extraction ⢠Updated ⢠14 ⢠18 -
ibm-granite/granite-geospatial-land-surface-temperature
Image Feature Extraction ⢠Updated ⢠98 ⢠19
-
Visual Instruction Tuning
Paper ⢠2304.08485 ⢠Published ⢠20 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper ⢠2311.05437 ⢠Published ⢠51 -
Improved Baselines with Visual Instruction Tuning
Paper ⢠2310.03744 ⢠Published ⢠39 -
Aligning Large Multimodal Models with Factually Augmented RLHF
Paper ⢠2309.14525 ⢠Published ⢠31
-
Attention Is All You Need
Paper ⢠1706.03762 ⢠Published ⢠108 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ⢠2307.08691 ⢠Published ⢠9 -
Mixtral of Experts
Paper ⢠2401.04088 ⢠Published ⢠160 -
Mistral 7B
Paper ⢠2310.06825 ⢠Published ⢠56