blocks|key|1055011|text|您可以浏览ffmpeg的源代码(可通过svn获取)或其API+documentation。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|1055012|entityMap|0|LINK|mutability|MUTABLE|url|https://git.ffmpeg.org/gitweb/ffmpeg.git|1|https://ffmpeg.org/doxygen/trunk/index.html^0|5|6|0|R|H|1|0^^$0|@$1|2|3|4|5|6|7|N|8|@]|9|@$A|O|B|P|1|Q]|$A|R|B|S|1|T]]|C|$]]|$1|D|3|-4|5|6|7|U|8|@]|9|@]|C|$]]]|E|$F|$5|G|H|I|C|$J|K]]|L|$5|G|H|I|C|$J|M]]]]

You can browse source code of <a href="https://git.ffmpeg.org/gitweb/ffmpeg.git" rel="nofollow noreferrer">ffmpeg</a> (available through svn), or its <a href="https://ffmpeg.org/doxygen/trunk/index.html" rel="nofollow noreferrer">API documentation</a>.

blocks|key|374921|text|从工作正常的编解码器中读取源代码似乎是正确的方法。我的建议如下：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|374922|http://www.mpeg.org/MPEG/video/mssg-free-mpeg-software.html|offset|length|374923|鉴于mpeg.org网站上提到了它，我想说你会在这里找到你需要的东西。|374924|在过去，我花了一些时间对mpeg视频进行解码(虽然没有音频)，而且原理非常简单。其中包含一些纯图像，一些中间图像相对于最接近的主图像进行描述，其余的使用最接近的主/中间图像进行描述。|374925|一个时隙，一个图像。但我想，最近的编解码器要复杂得多！|374926|编辑:同步|374927|我不是同步音频和视频的专家，但这个问题似乎是使用同步层解决的(参见there的定义)。|374928|entityMap|0|LINK|mutability|MUTABLE|url|1|https://mpeg.chiariglione.org/faq/which-layers-are-passed-mpeg-4-objects-are-composed^0|0|0|1N|0|0|0|0|0|0|X|5|1|0^^$0|@$1|2|3|4|5|6|7|Y|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|Z|8|@]|9|@$D|10|E|11|1|12]]|A|$]]|$1|F|3|G|5|6|7|13|8|@]|9|@]|A|$]]|$1|H|3|I|5|6|7|14|8|@]|9|@]|A|$]]|$1|J|3|K|5|6|7|15|8|@]|9|@]|A|$]]|$1|L|3|M|5|6|7|16|8|@]|9|@]|A|$]]|$1|N|3|O|5|6|7|17|8|@]|9|@$D|18|E|19|1|1A]]|A|$]]|$1|P|3|-4|5|6|7|1B|8|@]|9|@]|A|$]]]|Q|$R|$5|S|T|U|A|$V|C]]|W|$5|S|T|U|A|$V|X]]]]

Reading source code from a codec that works seems the right way to go.
I suggest the following :
<a href="http://www.mpeg.org/MPEG/video/mssg-free-mpeg-software.html" rel="nofollow noreferrer">http://www.mpeg.org/MPEG/video/mssg-free-mpeg-software.html</a>
Given that it's mentionned on the mpeg.org website, i'd say you'll find what you need here.
In the past i've had some time to work on decoding mpeg videos (no audio though), and the principles are quite simple. There are some pure images included, some intermediary images that are described relatively to the closest main ones, and the rest are described using the closest main/intermediary images.
One time slot, one image. But recent codecs are much more complicated, I guess !
EDIT : synchronization
I am no expert in synchronizing audio and video, but the issue seems to be dealt with using a sync layer (see <a href="https://mpeg.chiariglione.org/faq/which-layers-are-passed-mpeg-4-objects-are-composed" rel="nofollow noreferrer">there</a> for a definition).

blocks|key|4439176|text|对于音视频同步，基本上每个视频帧和音频帧都要打上时间戳。时间戳通常被称为PTS+(演示时间戳)。一旦视频/音频被解码器解码，音频/视频渲染器应该安排帧在正确的时间显示，以便音频/视频同步。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|4439177|我想你可以参考MPEG2+Tutorial的"Timing+Model“一章来了解细节。|offset|length|4439178|entityMap|0|LINK|mutability|MUTABLE|url|http://www.bretl.com/mpeghtml/MPEGindex.htm|1|http://www.bretl.com/mpeghtml/timemdl.HTM^0|0|7|E|0|N|C|1|0^^$0|@$1|2|3|4|5|6|7|P|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|Q|8|@]|9|@$D|R|E|S|1|T]|$D|U|E|V|1|W]]|A|$]]|$1|F|3|-4|5|6|7|X|8|@]|9|@]|A|$]]]|G|$H|$5|I|J|K|A|$L|M]]|N|$5|I|J|K|A|$L|O]]]]

For audio/video synchronization, basically, every video and audio frame should be time-stamped. The timestamp is typically known as PTS (Presentation Time Stamp). Once a video/audio is decoder by decoder, the audio/video renderer should schedule the frame to be displayed at the right time so that audio/video is synchronized.

I think you can refer to chapter "<a href="http://www.bretl.com/mpeghtml/timemdl.HTM" rel="nofollow noreferrer">Timing Model</a>" of <a href="http://www.bretl.com/mpeghtml/MPEGindex.htm" rel="nofollow noreferrer">MPEG2 Tutorial</a> for details.

blocks|key|3592607|text|根据您对MPEG-2格式的了解程度，您可能希望通过先阅读一篇关于MPEG-2格式的文章来获得广泛的概述。我的意思是这样的：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|3592608|A+Beginners+Guide+for+MPEG-2+Standard|offset|length|3592609|MPEG-2+VIDEO+COMPRESSION|3592610|entityMap|0|LINK|mutability|MUTABLE|url|http://www.fh-friedberg.de/fachbereiche/e2/telekom-labor/zinke/mk/mpeg2beg/beginnzi.htm|1|http://www.bbc.co.uk/rd/pubs/papers/paper_14/paper_14.shtml^0|0|0|11|0|0|0|O|1|0^^$0|@$1|2|3|4|5|6|7|R|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|S|8|@]|9|@$D|T|E|U|1|V]]|A|$]]|$1|F|3|G|5|6|7|W|8|@]|9|@$D|X|E|Y|1|Z]]|A|$]]|$1|H|3|-4|5|6|7|10|8|@]|9|@]|A|$]]]|I|$J|$5|K|L|M|A|$N|O]]|P|$5|K|L|M|A|$N|Q]]]]

Depending on how much you know about MPEG-2 format, you might want to get a broad overview by reading an article about it first. I mean something like these:

<a href="http://www.fh-friedberg.de/fachbereiche/e2/telekom-labor/zinke/mk/mpeg2beg/beginnzi.htm" rel="nofollow noreferrer">A Beginners Guide for MPEG-2 Standard</a>

<a href="http://www.bbc.co.uk/rd/pubs/papers/paper_14/paper_14.shtml" rel="nofollow noreferrer">MPEG-2 VIDEO COMPRESSION</a>

blocks|key|3592737|text|@+Patric和Nils|type|unstyled|depth|inlineStyleRanges|entityRanges|data|3592738|所以你说有时间戳，海因...这些只是视频部分，我猜。对于音频，我猜头部中有足够的信息(比如“每秒采样数”)。需要这些时间戳的频率是多少？我想，音频和视频数据包的交错确保了视频数据总是在音频数据之前，或者别的什么？|3592739|编辑:找到我需要的：http://www.dranger.com/ffmpeg/tutorial01.html|offset|length|3592740|entityMap|0|LINK|mutability|MUTABLE|url|http://www.dranger.com/ffmpeg/tutorial01.html^0|0|0|A|19|0|0^^$0|@$1|2|3|4|5|6|7|P|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|Q|8|@]|9|@]|A|$]]|$1|D|3|E|5|6|7|R|8|@]|9|@$F|S|G|T|1|U]]|A|$]]|$1|H|3|-4|5|6|7|V|8|@]|9|@]|A|$]]]|I|$J|$5|K|L|M|A|$N|O]]]]

@ Patric and Nils

So you say that there are timestamps, hein... These are for the video part only I guess. For audio I guess there is enough information in the header (like "samples per second"). How often these time stamps are needed? I imagine that interlacing of audio and video packets ensures that video data is always ahead of audio data or something?

EDIT: Found what I needed: 
<a href="http://www.dranger.com/ffmpeg/tutorial01.html" rel="nofollow noreferrer">http://www.dranger.com/ffmpeg/tutorial01.html</a>

blocks|key|3592761|text|地狱之音|type|unstyled|depth|inlineStyleRanges|entityRanges|data|3592762|音频数据的时间戳仍然是必要的，因为音频和视频帧可能不在同一位置对齐。例如：|3592763|电话:1000104010801120...A:+990+1013+1036+(丢失)+1082|3592764|您可能需要补偿第一个视频/音频帧之间的偏移量。此外，如果(在视频流过程中)可能出现丢包，您需要视频/音频的时间戳来保持准确的同步。|3592765|entityMap^0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|J|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|K|8|@]|9|@]|A|$]]|$1|D|3|E|5|6|7|L|8|@]|9|@]|A|$]]|$1|F|3|G|5|6|7|M|8|@]|9|@]|A|$]]|$1|H|3|-4|5|6|7|N|8|@]|9|@]|A|$]]]|I|$]]

Helltone,

Timestamps for audio data are still necessary because the audio and video frame may not be aligned at the same place. For example:

V: 1000 1040 1080 1120 ...
A: 990 1013 1036 (lost) 1082

You may need to compensate the offset between the first video/audio frame. Besides, if it is possible that there are packet loss (during video streaming), you need the timestamps of both video/audio to keep accurate synchronization.

I want to understand how video and audio decoding works, specially the timing synchronization (how to get 30fps video, how to couple that with audio, etc.). I don't want to know ALL the details, just the essence of it. I want to be able to write a high level simplification of an actual video/audio decoder.

Could you provide pointers to me? An actual C/C++ source code of a MPEG2 video/audio decoder would be the fastest way to understand those things I think.

mpeg 2 decoding

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

我想了解视频和音频解码是如何工作的，特别是定时同步(如何获得30fps视频，如何将其与音频耦合，等等)。我不想知道所有的细节，只想知道它的本质。我希望能够编写一个实际的视频/音频解码器的高度简化。你能给我指点一下吗？我认为，一个实际的MPEG2视频/音频解码器的C/C++源代码将是理解这些东西的最快方法。

问mpeg -2解码
EN

回答 6

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问mpeg -2解码EN

回答 6

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问mpeg -2解码
EN