| 中文标题 | 作者 | 论文ID | 分类简称 | 发布时间 |
|---|---|---|---|---|
| 适应性视频流媒体中基于场景的可察觉差异感知码率分层 | Vignesh V Menon, Jingwen Zhu, Prajit T Rajendran, Hadi Amirpour, Patrick Le Callet, Christian Timmerer | 2305.00225 | cs.MM | 2023-05-02 |
| 采用层次特征融合和迭代混合数据库训练的野外图像盲质量评估 | Wei Sun and Xiongkuo Min and Danyang Tu and Guangtao Zhai and Siwei Ma | 2105.14550 | cs.MM | 2023-04-28 |
| 基于嵌入域选择和自适应误差修正的鲁棒图像隐写术抵抗有损JPEG压缩 | Xiaolong Duan, Bin Li, Zhaoxia Yin, Xinpeng Zhang, Bin Luo | 2304.13297 | cs.MM | 2023-04-27 |
| 对象和关系场景图无损压缩的自适应预测 | Yufeng Zhang and Weiyao Lin and Wenrui Dai and Huabin Liu and Hongkai Xiong | 2304.13359 | cs.MM | 2023-04-27 |
| 自适应视频流中的高效编码的绿色视频复杂性分析 | Vignesh V Menon, Christian Feldmann, Klaus Schoeffmann, Mohammad Ghanbari, Christian Timmerer | 2304.12384 | cs.MM | 2023-04-26 |
| GA2MIF:基于图和注意力的两阶段多源信息融合在对话情感检测中的应用 | Jiang Li, Xiaoping Wang, Guoqing Lv, Zhigang Zeng | 2207.11900 | cs.MM | 2023-04-25 |
| 自适应视频流的转码质量预测 | Vignesh V Menon, Reza Farahani, Prajit T Rajendran, Mohammed Ghanbari, Hermann Hellwagner, Christian Timmerer | 2304.10234 | cs.MM | 2023-04-21 |
| 神经-OSVETA:鲁棒的3D网格水印 | Bata Vasc, Nithin Raveendran and Bane Vasic | 2304.10348 | cs.MM | 2023-04-21 |
| 超高场下使用扩散模型的联合概率分布将脑PET从MRI中合成 | Taofeng Xie, Chentao Cao, Zhuoxu Cui, Fanshi Li, Zidong Wei, Yanjie Zhu, Ye Li, Dong Liang, Qiyu Jin, Guoqing Chen and Haifeng Wang | 2211.08901 | cs.MM | 2023-04-18 |
| 加权对抗学习的跨领域食物图像到菜谱检索 | Bin Zhu, Chong-Wah Ngo, Jingjing Chen, Wing-Kwong Chan | 2304.07387 | cs.MM | 2023-04-18 |
| 评估图像编解码强幂等性 | Qian Zhang, Tongda Xu, Yanghao Li, Yan Wang | 2304.08269 | cs.MM | 2023-04-18 |
| 视口相关的360度视频流媒体的最佳SVC比特流模式 | Gang Shen, Mingyang Ma, Guangxin Xu | 2304.05654 | cs.MM | 2023-04-13 |
| Android和iOS的多媒体分发过程跟踪 | Yu-Min Jeon, Won-Mu Heo, Jong-Min Kim, Kyounggon Kim | 2304.03848 | cs.MM | 2023-04-11 |
| 深度强化学习与重要性加权A3C在视频传输服务中提高用户体验的研究 | Mandan Naresh, Paresh Saxena, Manik Gupta | 2304.04527 | cs.MM | 2023-04-11 |
| GraphMFT:基于图网络的对话情感识别的多模态融合技术 | Jiang Li, Xiaoping Wang, Guoqing Lv, Zhigang Zeng | 2208.00339 | cs.MM | 2023-03-28 |
| ETMA:高效基于Transformer的多层次注意力框架用于多模态假新闻检测 | Ashima Yadav, Shivani Gaba, Haneef Khan, Ishan Budhiraja, Akansha Singh, and Krishan Kant Singh | 2206.07331 | cs.MM | 2023-03-14 |
| 基于多模态信息的语音处理(MISP)2022挑战:音频-视觉日程安排与识别 | Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu | 2303.06326 | cs.MM | 2023-03-14 |
| 基于置信度的事件中心在线视频问答:基于新构建的ATBS数据集 | Weikai Kong, Shuhong Ye, Chenglin Yao, Jianfeng Ren | 2303.03105 | cs.MM | 2023-03-08 |
| 构建具有语义重建的模态平衡区块链 | Zhijie Tan, Xiang Yuan, Shengwei Meng, Yakun Huang, Weiping Li, Zhonghai Wu, Tong Mo | 2303.02428 | cs.MM | 2023-03-07 |
| 利用语义相似度度量改进音频字幕生成 | Rehana Mahfuz, Yinyi Guo, Erik Visser | 2210.16470 | cs.MM | 2023-03-06 |
| 在流媒体应用中基于纹理信息融合的视频质量评估 | Vignesh V Menon, Prajit T Rajendran, Reza Farahani, Klaus Schoeffmann, Christian Timmerer | 2302.14465 | cs.MM | 2023-03-01 |
| 增强记忆的对比学习在说话头生成中的应用 | Jianrong Wang, Yaxin Zhao, Li Liu, Hongkai Fan, Tianyi Xu, Qi Li, Sen Li | 2302.13469 | cs.MM | 2023-02-28 |
| 小即大:重新思考深度隐藏的嵌入率 | Han Li, Hangcheng Liu, Shangwei Guo, Mingliang Zhou, Ning Wang, Tao Xiang, Tianwei Zhang | 2302.11918 | cs.MM | 2023-02-24 |
| 常见社交媒体平台和照片存储服务处理上传图像的实际分析 | Duc-Tien Dang-Nguyen, Vegard Velle Sj{o}en, Dinh-Hai Le, Thien-Phu Dao, Anh-Duy Tran, and Minh-Triet Tran | 2302.12133 | cs.MM | 2023-02-24 |
| 用仅自动提取的亮度曲线为电影创作音乐的计算创造力 | Felipe Ariani (PRISM), Marcelo Caetano (PRISM), Javier Elipe Gimeno (PRISM), Ivan Magrin-Chagnolleau (PRISM) | 2302.09857 | cs.MM | 2023-02-21 |