georgehu学习生活 - LLM tag

Articles tagged with LLM

论文阅读-Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

CoT综述

2025-09-08发布于论文随笔

论文阅读-fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models

本文从“CLIP缺少细粒度的语义和上下文信息”的观察出发，针对医疗视频三元组识别任务对CLIP进行了改进，并提供了一个经过大量训练的预训练模型。

2025-03-30发布于论文随笔

论文阅读-LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Lecture Learning

本文主要目标是建立一个在手术视频领域上的视频-文本对的数据集，并且设计了一种较为“资源节约”的能够在本地为此训练一个LLM的方案。

2025-03-30发布于论文随笔