2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)最新文献_第3页

Lifelog Semantic Annotation using deep visual features and metadata-derived descriptors 使用深度视觉特征和元数据派生描述符的Lifelog语义注释

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500247

Bahjat Safadi, P. Mulhem, G. Quénot, J. Chevallet

引用次数: 0

A Demo of multimodal medical retrieval 多模式医疗检索的演示

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500263

Ranveer Joyseeree, Roger Schaer, H. Müller

引用次数: 0

Experimenting with musically motivated convolutional neural networks 用音乐驱动的卷积神经网络做实验

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500246

Jordi Pons, T. Lidy, Xavier Serra

引用次数: 130

Real-time multilevel sequencing of cataract surgery videos 白内障手术视频的实时多级测序

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500245

K. Charrière, G. Quellec, M. Lamard, D. Martiano, G. Cazuguel, G. Coatrieux, B. Cochener

{"title":"Real-time multilevel sequencing of cataract surgery videos","authors":"K. Charrière, G. Quellec, M. Lamard, D. Martiano, G. Cazuguel, G. Coatrieux, B. Cochener","doi":"10.1109/CBMI.2016.7500245","DOIUrl":"https://doi.org/10.1109/CBMI.2016.7500245","url":null,"abstract":"Data recorded and stored during video-monitored surgeries are a relevant source of information for surgeons, especially during their training period. But today, this data is virtually unexploited. In this paper, we propose to reuse videos recorded during cataract surgeries to automatically analyze the surgical process with the real-time constraint, with the aim to assist the surgeon during the surgery. We propose to automatically recognize, in real-time, what the surgeon is doing: what surgical phase or, more precisely, what surgical step he or she is performing. This recognition relies on the inference of a multilevel statistical model which uses 1) the conditional relations between levels of description (steps and phases) and 2) the temporal relations among steps and among phases. The model accepts two types of inputs: 1) the presence of surgical instruments, manually provided by the surgeons, or 2) motion in videos, automatically analyzed through the CBVR paradigm. A dataset of 30 cataract surgery videos was collected at Brest University hospital. The system was evaluated in terms of mean area under the ROC curve. Promising results were obtained using either motion analysis (Az = 0.759) or the presence of surgical instruments (Az = 0.983).","PeriodicalId":356608,"journal":{"name":"2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123002208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Static and dynamic autopsy of deep networks 深度网络的静态和动态解剖

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500267

Titouan Lorieul, Antoine Ghorra, B. Mérialdo

{"title":"Static and dynamic autopsy of deep networks","authors":"Titouan Lorieul, Antoine Ghorra, B. Mérialdo","doi":"10.1109/CBMI.2016.7500267","DOIUrl":"https://doi.org/10.1109/CBMI.2016.7500267","url":null,"abstract":"Although deep learning has been a major break-through in the recent years, Deep Neural Networks (DNNs) are still the subject of intense research, and many issues remain on how to use them efficiently. In particular, training a Deep Network remains a difficult process, which requires extensive computation, and for which very precise care has to be taken to avoid overfitting, a high risk because of the extremely large number of parameters. The purpose of our work is to perform an autopsy of pre-trained Deep Networks, with the objective of collecting information about the values of the various parameters, and their possible relations and correlations. The motivation is that some of these observations could be later used as a priori knowledge to facilitate the training of new networks, by guiding the exploration of the parameter space into more probable areas. In this paper, we first present a static analysis of the AlexNet Deep Network by computing various statistics on the existing parameter values. Then, we perform a dynamic analysis by measuring the effect of certain modifications of those values on the performance of the network. For example, we show that quantizing the values of the parameters to a small adequate set of values leads to similar performance as the original network. These results suggest that pursuing such studies could lead to the design of improved training procedures for Deep Networks.","PeriodicalId":356608,"journal":{"name":"2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123042829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Crowdsourcing as self-fulfilling prophecy: Influence of discarding workers in subjective assessment tasks 众包作为自我实现的预言:弃工对主观评估任务的影响

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500256

M. Riegler, V. Reddy, M. Larson, Ragnhild Eg, P. Halvorsen, C. Griwodz

引用次数: 15

Prediction of visual attention with Deep CNN for studies of neurodegenerative diseases 用深度CNN预测视觉注意力用于神经退行性疾病的研究

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500243

S. Chaabouni, F. Tison, J. Benois-Pineau, C. Amar

引用次数: 6

Temporal segmentation of laparoscopic videos into surgical phases 腹腔镜视频手术阶段的时间分割

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500249

Manfred Jürgen Primus, Klaus Schöffmann, L. Böszörményi

引用次数: 27

Interactive exploration of healthcare queries 医疗保健查询的交互式探索

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500275

A. Bampoulidis, M. Lupu, João Palotti, S. Metallidis, J. Brassey, A. Hanbury

引用次数: 2

A dataset of multimedia material about classical music: PHENICX-SMM 关于古典音乐的多媒体资料数据集:PHENICX-SMM

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2016-06-15 DOI: 10.1109/CBMI.2016.7500240

M. Schedl, D. Hauger, M. Tkalcic, M. Melenhorst, Cynthia C. S. Liem

引用次数: 3