苹果已分享了其参加今年IEEE/CVF计算机视觉与模式识别会议(CVPR)的详细信息。以下是具体内容。
苹果研究人员将在CVPR上展示研究成果
苹果今天公布了其参加今年CVPR的日程安排和详细信息,该公司同时也是本次会议的赞助商。
今年的CVPR将于6月3日至6月7日在丹佛科罗拉多会议中心举行,苹果将通过海报展示、口头报告、邀请演讲、主旨演讲以及相关活动参与其中。
以下是苹果将在今年CVPR上展示的研究项目,其中部分我们此前曾报道过:
- AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding
- AToken: A Unified Tokenizer for Vision
- Bootstrapping Sign Language Annotations with Sign Language Models
- DSO: Direct Steering Optimization for Bias Mitigation
- From Where Things Are to What They’re For: Benchmarking Spatial–Functional Intelligence for Multimodal LLMs
- Learning Long-Term Motion Embeddings for Efficient Kinematics Generation
- Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
- SO-Bench: A Structural Output Evaluation of Multimodal LLMs
- STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
- TrajTok: Learning Trajectory Tokens enables better Video Understanding
- UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning
- Velox: Learning Representations of 4D Geometry and Appearance
- VSAS-Bench: Real-Time Evaluation of Visual Streaming Assistant Models
- What Matters in Practical Learned Image Compression
苹果研究员Colin Lea还将在生成式AI用于手语(GenSign)研讨会上发表主旨演讲,随后6月3日至4日期间还有三位苹果工程师的受邀演讲。
该公司还确认,研究员Hsin-Ping (Cindy) Huang和Maggie Xiao将代表苹果参加计算机视觉女性(WiCV)导师晚宴。
要查看苹果参加今年CVPR的完整日程安排,请点击此链接。


















