blocks|key|2809053|text|2016年，麻省理工学院CSAIL小组的一些人“在预测视觉方面取得了重大突破，开发了一种能够比以往任何时候更准确地预测交互作用的算法。”他们写了一篇文章，教学机器预测未来。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|2809054|他们做了一个很棒的视频，它展示了他们的结果。他们在YouTube视频和电视节目上训练了一种算法来预测两个人何时握手、拥抱、亲吻或打五下。|2809055|📷|atomic|2809056|研究人员卡尔·冯德里克(+Carl+)、哈米德·皮尔西亚瓦什(+Hamed+)和安东尼奥·托拉尔巴(+Antonio+)发表了一篇题为“用未标记视频预测视觉表示”的论文。|2809057|贾文成有一个在TensorFlow中实现的基于LSTM的模型的存储库。|2809058|有关预测行为或活动的最新论文列表，请参阅此列表。|entityMap|0|LINK|mutability|MUTABLE|url|http://news.mit.edu/2016/teaching-machines-to-predict-the-future-0621|1|https://www.youtube.com/watch?v=AR3hY9iB5-I|2|IMAGE|IMMUTABLE|imageUrl|https://i.stack.imgur.com/9hSmT.png|imageAlt|3|http://www.cs.columbia.edu/~vondrick/prediction/|4|https://github.com/chiawen/activity-anticipation|5|6|https://github.com/chinancheng/awesome-activity-prediction^0|25|8|0|0|9|2|1|0|0|1|2|0|1W|C|3|0|0|3|4|V|3|5|0|L|2|6^^$0|@$1|2|3|4|5|6|7|1A|8|@]|9|@$A|1B|B|1C|1|1D]]|C|$]]|$1|D|3|E|5|6|7|1E|8|@]|9|@$A|1F|B|1G|1|1H]]|C|$]]|$1|F|3|G|5|H|7|1I|8|@]|9|@$A|1J|B|1K|1|1L]]|C|$]]|$1|I|3|J|5|6|7|1M|8|@]|9|@$A|1N|B|1O|1|1P]]|C|$]]|$1|K|3|L|5|6|7|1Q|8|@]|9|@$A|1R|B|1S|1|1T]|$A|1U|B|1V|1|1W]]|C|$]]|$1|M|3|N|5|6|7|1X|8|@]|9|@$A|1Y|B|1Z|1|20]]|C|$]]]|O|$P|$5|Q|R|S|C|$T|U]]|V|$5|Q|R|S|C|$T|W]]|X|$5|Y|R|Z|C|$10|11|12|-4]]|13|$5|Q|R|S|C|$T|14]]|15|$5|Q|R|S|C|$T|16]]|17|$5|Q|R|S|C|$T|16]]|18|$5|Q|R|S|C|$T|19]]]]

In 2016 some people at MIT's CSAIL group "made an important new breakthrough in predictive vision, developing an algorithm that can anticipate interactions more accurately than ever before." They wrote an article, <a href="http://news.mit.edu/2016/teaching-machines-to-predict-the-future-0621" rel="nofollow noreferrer">Teaching machines to predict the future</a>.

They made a great <a href="https://www.youtube.com/watch?v=AR3hY9iB5-I" rel="nofollow noreferrer">video</a> which shows their results. They trained an algorithm on YouTube videos and TV shows to predict when two individuals will shake hands, hug, kiss, or slap five.

<a href="https://i.stack.imgur.com/9hSmT.png" rel="nofollow noreferrer"><img src="https://i.stack.imgur.com/9hSmT.png" alt="enter image description here"></a>

The researchers, Carl Vondrick, Hamed Pirsiavash, and Antonio Torralba, published a paper at CVPR 2016 entitled <a href="http://www.cs.columbia.edu/~vondrick/prediction/" rel="nofollow noreferrer">Anticipating Visual Representations with Unlabeled Video</a>. 

<a href="https://github.com/chiawen/activity-anticipation" rel="nofollow noreferrer">Chia-Wen Cheng</a> has a <a href="https://github.com/chiawen/activity-anticipation" rel="nofollow noreferrer">repository</a> of an LSTM-based model implemented in TensorFlow.

For a more up to date list of papers on predicting actions or activities see this <a href="https://github.com/chinancheng/awesome-activity-prediction" rel="nofollow noreferrer">list</a>.

blocks|key|4331|text|是的有。实际上，在序列上使用NNs是深度学习的一个重要部分，其中最强大的NNs来自于这个领域的研究。它们被称为递归神经网络或RNN。我建议您阅读这的文章或观看这斯坦福的讲座，以了解更多关于它们的信息。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|4332|特别是RNN的一个变体，即所谓的LSTM，它代表长期内存，使网络能够记忆早期的输入。|4333|根据您的解释，我建议您有一个多对一的任务，您输入许多帧，并输出一个类。这些类型的任务，您输入图像最常解决的组合使用CNN和RNN。您可以阅读这的文章，以获得更多的洞察力，对这种技术。|entityMap|0|LINK|mutability|MUTABLE|url|http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/|1|https://www.youtube.com/watch?v=6niqTuYFZLQ|2|https://blog.coast.ai/continuous-video-classification-with-tensorflow-inception-and-recurrent-nets-250ba9ff6b85^0|20|1|0|27|1|1|0|0|1Y|1|2^^$0|@$1|2|3|4|5|6|7|S|8|@]|9|@$A|T|B|U|1|V]|$A|W|B|X|1|Y]]|C|$]]|$1|D|3|E|5|6|7|Z|8|@]|9|@]|C|$]]|$1|F|3|G|5|6|7|10|8|@]|9|@$A|11|B|12|1|13]]|C|$]]]|H|$I|$5|J|K|L|C|$M|N]]|O|$5|J|K|L|C|$M|P]]|Q|$5|J|K|L|C|$M|R]]]]

Yes there is. Actually using NNs on sequences is a big part of Deep Learning and one of the most powerful NNs come from this field of research. They are called Recurrent Neural Networks or RNNs. I would recommend you to read <a href="http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/" rel="nofollow noreferrer">this</a> article or watch <a href="https://www.youtube.com/watch?v=6niqTuYFZLQ" rel="nofollow noreferrer">this</a> Stanford lecture, to learn more about them. 

Especially a variant of RNNs, the so called LSTM that stands for Long Short Term Memory enables networks to memorize earlier inputs.

Based on your explanation I would suggest you have a many-to-one task where you input many frames and output one class. These types of tasks where you input images is most often solved using a combination of CNN and RNN. You could read <a href="https://blog.coast.ai/continuous-video-classification-with-tensorflow-inception-and-recurrent-nets-250ba9ff6b85" rel="nofollow noreferrer">this</a> article to get more insights, to that technique.

Is there any work done on analyzing sequence of frames from a video using Deep Learning techniques?

By "analyzing" I mean like memorizing them in order to classify or predict something (e.g. by taking into account first 10 frames of a video the model can make some sort of conclusion).

Analyzing Videos using Deep Learning

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

有没有做过用深度学习技术分析视频帧序列的工作？我的意思是“分析”，我的意思是，为了分类或预测某件事而记住它们(例如，考虑到视频的前10帧，模型就能得出某种结论)。

问利用深度学习分析视频
EN

回答 2

Data Science用户

Data Science用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问利用深度学习分析视频EN

回答 2

Data Science用户

Data Science用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问利用深度学习分析视频
EN