加载中 . . .
中文标题 作者 论文ID 分类简称 发布时间
适应性视频流媒体中基于场景的可察觉差异感知码率分层 Vignesh V Menon, Jingwen Zhu, Prajit T Rajendran, Hadi Amirpour, Patrick Le Callet, Christian Timmerer 2305.00225 cs.MM 2023-05-02
采用层次特征融合和迭代混合数据库训练的野外图像盲质量评估 Wei Sun and Xiongkuo Min and Danyang Tu and Guangtao Zhai and Siwei Ma 2105.14550 cs.MM 2023-04-28
基于嵌入域选择和自适应误差修正的鲁棒图像隐写术抵抗有损JPEG压缩 Xiaolong Duan, Bin Li, Zhaoxia Yin, Xinpeng Zhang, Bin Luo 2304.13297 cs.MM 2023-04-27
对象和关系场景图无损压缩的自适应预测 Yufeng Zhang and Weiyao Lin and Wenrui Dai and Huabin Liu and Hongkai Xiong 2304.13359 cs.MM 2023-04-27
自适应视频流中的高效编码的绿色视频复杂性分析 Vignesh V Menon, Christian Feldmann, Klaus Schoeffmann, Mohammad Ghanbari, Christian Timmerer 2304.12384 cs.MM 2023-04-26
GA2MIF:基于图和注意力的两阶段多源信息融合在对话情感检测中的应用 Jiang Li, Xiaoping Wang, Guoqing Lv, Zhigang Zeng 2207.11900 cs.MM 2023-04-25
自适应视频流的转码质量预测 Vignesh V Menon, Reza Farahani, Prajit T Rajendran, Mohammed Ghanbari, Hermann Hellwagner, Christian Timmerer 2304.10234 cs.MM 2023-04-21
神经-OSVETA:鲁棒的3D网格水印 Bata Vasc, Nithin Raveendran and Bane Vasic 2304.10348 cs.MM 2023-04-21
超高场下使用扩散模型的联合概率分布将脑PET从MRI中合成 Taofeng Xie, Chentao Cao, Zhuoxu Cui, Fanshi Li, Zidong Wei, Yanjie Zhu, Ye Li, Dong Liang, Qiyu Jin, Guoqing Chen and Haifeng Wang 2211.08901 cs.MM 2023-04-18
加权对抗学习的跨领域食物图像到菜谱检索 Bin Zhu, Chong-Wah Ngo, Jingjing Chen, Wing-Kwong Chan 2304.07387 cs.MM 2023-04-18
评估图像编解码强幂等性 Qian Zhang, Tongda Xu, Yanghao Li, Yan Wang 2304.08269 cs.MM 2023-04-18
视口相关的360度视频流媒体的最佳SVC比特流模式 Gang Shen, Mingyang Ma, Guangxin Xu 2304.05654 cs.MM 2023-04-13
Android和iOS的多媒体分发过程跟踪 Yu-Min Jeon, Won-Mu Heo, Jong-Min Kim, Kyounggon Kim 2304.03848 cs.MM 2023-04-11
深度强化学习与重要性加权A3C在视频传输服务中提高用户体验的研究 Mandan Naresh, Paresh Saxena, Manik Gupta 2304.04527 cs.MM 2023-04-11
GraphMFT:基于图网络的对话情感识别的多模态融合技术 Jiang Li, Xiaoping Wang, Guoqing Lv, Zhigang Zeng 2208.00339 cs.MM 2023-03-28
ETMA:高效基于Transformer的多层次注意力框架用于多模态假新闻检测 Ashima Yadav, Shivani Gaba, Haneef Khan, Ishan Budhiraja, Akansha Singh, and Krishan Kant Singh 2206.07331 cs.MM 2023-03-14
基于多模态信息的语音处理(MISP)2022挑战:音频-视觉日程安排与识别 Zhe Wang, Shilong Wu, Hang Chen, Mao-Kui He, Jun Du, Chin-Hui Lee, Jingdong Chen, Shinji Watanabe, Sabato Siniscalchi, Odette Scharenborg, Diyuan Liu, Baocai Yin, Jia Pan, Jianqing Gao, Cong Liu 2303.06326 cs.MM 2023-03-14
基于置信度的事件中心在线视频问答:基于新构建的ATBS数据集 Weikai Kong, Shuhong Ye, Chenglin Yao, Jianfeng Ren 2303.03105 cs.MM 2023-03-08
构建具有语义重建的模态平衡区块链 Zhijie Tan, Xiang Yuan, Shengwei Meng, Yakun Huang, Weiping Li, Zhonghai Wu, Tong Mo 2303.02428 cs.MM 2023-03-07
利用语义相似度度量改进音频字幕生成 Rehana Mahfuz, Yinyi Guo, Erik Visser 2210.16470 cs.MM 2023-03-06
在流媒体应用中基于纹理信息融合的视频质量评估 Vignesh V Menon, Prajit T Rajendran, Reza Farahani, Klaus Schoeffmann, Christian Timmerer 2302.14465 cs.MM 2023-03-01
增强记忆的对比学习在说话头生成中的应用 Jianrong Wang, Yaxin Zhao, Li Liu, Hongkai Fan, Tianyi Xu, Qi Li, Sen Li 2302.13469 cs.MM 2023-02-28
小即大:重新思考深度隐藏的嵌入率 Han Li, Hangcheng Liu, Shangwei Guo, Mingliang Zhou, Ning Wang, Tao Xiang, Tianwei Zhang 2302.11918 cs.MM 2023-02-24
常见社交媒体平台和照片存储服务处理上传图像的实际分析 Duc-Tien Dang-Nguyen, Vegard Velle Sj{o}en, Dinh-Hai Le, Thien-Phu Dao, Anh-Duy Tran, and Minh-Triet Tran 2302.12133 cs.MM 2023-02-24
用仅自动提取的亮度曲线为电影创作音乐的计算创造力 Felipe Ariani (PRISM), Marcelo Caetano (PRISM), Javier Elipe Gimeno (PRISM), Ivan Magrin-Chagnolleau (PRISM) 2302.09857 cs.MM 2023-02-21