搜索 - 腾讯云开发者社区-腾讯云

文章/答案/技术大牛

发布

来自专栏程序员互动联盟
【专业领域】jvm虚拟机android移植－接口篇
上一篇我们讲了如何下载phoneme虚拟机开源代码，如何配置编译脚本，编译测试验证基本功能。但是要让她在android上显示运行java小游戏我们还得费点劲。好吧让我们一起来移植接口吧。（由java层创建bitmap 通过jni转化实际的FrameBuffer指针然后传递给phoneme，这就就有了画的地方了） 2）如何刷新，正常刷新or旋转刷新（有用phoneme android上我们就在java层模拟并创建BufferedOutputStream文件来实现从onKeyDown 事件中获取按键值并通过转换写入流中，phoneme底层通过读取这个流设备从而取得按键值并发送相应的那我们就来挑战一下，纵观整个phoneme jvm虚拟机在以前的nokia手机上就有好多java小游戏，如贪吃蛇好经典的游戏不要对我说你没玩过，有点跑题了，其实我想说的是他们都是有声音的，而且phoneme
1K70发布于 2018-03-12
来自专栏程序员互动联盟
jvm虚拟机android移植－编译篇
还是借助开源来完成我们的功能，经过google找到了一个好的的开源jvm, PhoneMe sun开源的java虚拟机。 PhoneME简介 phoneME Feature software是一个优化了的Java ME架构。它的核心是支持多任务的MIDP2.1规范实现。当phoneME Feature software运行多个MIDlet时，它只使用一个系统进程，因为一个Java虚拟机实例可以执行几个应用，并提供独立的运行时空间。 phoneME Feature software提供给为MIDP或者是移动信息设备相关的开发人员。以下是phoneME Feature software MR2版本包含的内容特性： l 高性能的Java ME平台架构 l 支持每个功能领域(存储，网络和用户界面等)的模块化实现
1.4K120发布于 2018-03-12
来自专栏杨丝儿的小站
SP Module 5 Speech Synthesis – Phonemes and the Front End
Phonemes and allophones This video introduces the notion of phoneme as a basic unit of phonological analysis Phonologists sometimes formalize this relationship between the phoneme and its allophones in a rule. The blank shows where the phoneme occurs in order for the rule to apply. In order to fully define a phoneme, we first need to observe the surface forms that occur, along with More examples Pronunciation The phoneme inventory is a design choice when we build a TTS or ASR system
64830编辑于 2022-11-10
来自专栏mathor
Human Language Processing——Speech Recognition
整体来看，语音识别问题就是一个Seq2Seq的变换问题输出Token有哪几种选择简单的说，有以下五种选择： Phoneme 发音的最小单位 ? 将语音信号转化为Phoneme之后，还需要进一步将Phoneme信号转化为Text。所以该方法并不是end-to-end的，需要后处理步骤。那么如何将Phoneme转化为Text呢？以英文为例：该表包含了所有单词的Phoneme表示，英文单词有多少个，该表就有多少行。可想而知，表的条目是很多的。通过查表，我们才能进一步将Phoneme转化为text 对于英文和中文来说，这种token的选取方式都是适用的，英语有音标，汉语有汉语拼音。两者的Phoneme集合和Lexicon不一样 Grapheme 书写的最小单位对于英文来说，Grapheme指的就是26个英文字母；对于中文来说，Grapheme指的就是约4000+个常用汉字。
1K10发布于 2020-07-27
来自专栏centosDai
记一次智能语音软件的开发-终于为孩子找到了个靠谱的口语老师
句子综合评分 words 单词评分数组 -word 单词 -start 单词开始时间，单位是秒 -end 单词结束时间，单位是秒 -pronunciation 单词准确度得分 -phonemes 音标数组 --phoneme ，辅音时无意义 'pronunciation': 50.640331, //音标准确度评分 'stress_detect': False,//在一个单词中，用户该音标发音不为重音 'phoneme phonemes': [{ 'stress_ref': False, 'pronunciation': 79.084282, 'stress_detect': False, 'phoneme 0.944885 }, { 'stress_ref': True, 'pronunciation': 74.536934, 'stress_detect': True, 'phoneme 0.838557 }, { 'stress_ref': True, 'pronunciation': 63.982838, 'stress_detect': True, 'phoneme
1.9K00发布于 2021-05-23
来自专栏猿计划
看我如何使用Python打造一个带娃神奇(一玩能玩一天)？
句子综合评分 words 单词评分数组 -word 单词 -start 单词开始时间，单位是秒 -end 单词结束时间，单位是秒 -pronunciation 单词准确度得分 -phonemes 音标数组 –phoneme ，辅音时无意义 'pronunciation': 50.640331, //音标准确度评分 'stress_detect': False,//在一个单词中，用户该音标发音不为重音 'phoneme phonemes': [{ 'stress_ref': False, 'pronunciation': 79.084282, 'stress_detect': False, 'phoneme 0.944885 }, { 'stress_ref': True, 'pronunciation': 74.536934, 'stress_detect': True, 'phoneme 0.838557 }, { 'stress_ref': True, 'pronunciation': 63.982838, 'stress_detect': True, 'phoneme
1.9K10发布于 2021-04-20
来自专栏机器之心
ACL 2025 高分接收 | 高感情语音技术：逻辑智能小语种TTS破局之道
文本与语言学信息，构建覆盖广、颗粒度细的多维泰语语料库；随后通过 LLM 增强的停顿预测、词切分与混合式 G2P，将原始文本稳健转换为结构化的「音素-声调」序列；最后在此精炼输入之上，引入声调感知的 Phoneme-Tone 文本和注释三大类：语音数据——500 小时来自新闻、社媒、播客等多领域语料，外加 40 小时金融、医疗、教育、法律等垂直领域语料，兼顾通用合成与专业术语发音；文本数据——100 万句句子语料用于训练 Phoneme-Tone 模型集成了「多源特征 × 声调感知 × 零样本克隆」的组合设计：首先利用多语种预训练模型提取时长、音高、能量等强鲁棒特征，并以风格编码器压缩说话人/情感信息，为后续零样本克隆奠定基础；其次，通过 Phoneme-Tone
57910编辑于 2025-05-27
来自专栏杨丝儿的小站
SP Module 4 the Source-Filter Model
We find the impulse response/frequency response of the original sounds Phoneme The source-filter model It can generate any speech sound: any phoneme.
51430编辑于 2022-11-10
来自专栏APP开发
AI 英语口语 App 的开发流程
设计模型提供以下反馈类型：发音纠正（Pronunciation Correction）：基于音素（Phoneme）级别的精确度，指出用户单词发音、重音和语调的错误。发音纠正模块（Pitch/Phoneme Analysis）挑战：仅依赖 ASR 的得分是不够的。需要集成专门的语音评估 API 或自研模块，对用户的语音进行**音高（Pitch）和音素（Phoneme）**级别的分析。
35010编辑于 2025-10-09
来自专栏专知
【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正
Experiments show that the performance of the universal phoneme-based CTC system can be improved by applying alignment mode and many recognition passes searching for the substitution and deletion of each expected phoneme
3.1K40发布于 2018-04-13
来自专栏程序员
程序员英语学习网站项目实现全记录：从需求到上线的完整历程
从设计细节到产品哲学》） 2.3 技术可行性验证针对核心交互功能，我们验证了技术实现方案：语音朗读：Web Speech API支持英文发音播放，测试代码如下： function playPhonetic(phoneme ) { const utterance = new SpeechSynthesisUtterance(phoneme); utterance.lang = 'en-US'; speechSynthesis.speak = icon.nextElementSibling.textContent; // 获取音标文本 playPhonetic(phoneme); });});（2）学习进度存储与同步使用localStorage 从设计细节到产品哲学》） 2.3 技术可行性验证针对核心交互功能，我们验证了技术实现方案：语音朗读：Web Speech API支持英文发音播放，测试代码如下： function playPhonetic(phoneme = icon.nextElementSibling.textContent; // 获取音标文本 playPhonetic(phoneme); });});（2）学习进度存储与同步使用localStorage
50000编辑于 2025-07-19
来自专栏杨丝儿的小站
SP Modules Review Contents (2)
This leads us to revisit the concept of phoneme We use decision tree when we want learn rules from data
54050编辑于 2022-11-15
来自专栏杨丝儿的小站
SP Module 6 Speech Synthesis – Waveform Generation and Connected Speech
Coarticulation is the overlapping of adjacent articulations or the influence of the target phoneme on
54820编辑于 2022-11-24
来自专栏Dance with GenAI
用腾讯云语音合成（TTS）批量生成英语绘本的朗读音频
SessionId": "session-1234","Subtitles": [{"BeginIndex": 0,"BeginTime": 250,"EndIndex": 1,"EndTime": 430,"Phoneme ": "ni2","Text": "你"},{"BeginIndex": 1,"BeginTime": 430,"EndIndex": 2,"EndTime": 670,"Phoneme": "hao3
1.6K10编辑于 2024-06-23
来自专栏mathor
Human Language Processing——HMM
所以我们需要将$P(X|Y)$建模变为$P(X|S)$建模，S为状态，是人定义的，它是比音素 Phoneme 还要小的单位。序列中的每一个音素，都会受到前后音素单位的影响。这便是为什么我们要用比Phoneme还要小的单位来表示状态。因为我们要假设每个状态发射出来的分布稳定。为什么我们不用字符单位来当作状态呢？c这个字母它的发音不是固定的。
63410发布于 2020-08-03
来自专栏人工智能
📖《数字人开发手册：从建模到情感交互全链路》
传输协议（带宽降低40%）指令队列动态缓冲机制（抗网络抖动）多模态驱动融合方案语音口型同步系统架构：# 口型驱动优先级算法（示例）def lip_sync_priority(text, emotion):phoneme = analyze_phoneme(text)weight = emotion_dict[emotion]['lip_weight']return phoneme * weight情绪状态唇部幅度
73020编辑于 2025-04-01
来自专栏arXiv每日学术速递
金融/语音/音频处理学术速递[7.19]
Although a phoneme classifier can be used for KWS, exploiting a large amount of transcribed data for automatic speech recognition (ASR), there is a mismatch between the training criterion (phoneme recognition this approach, an output of an acoustic model is split into two branches for the two tasks, one for phoneme Although a phoneme classifier can be used for KWS, exploiting a large amount of transcribed data for automatic speech recognition (ASR), there is a mismatch between the training criterion (phoneme recognition
61850发布于 2021-07-27
来自专栏arXiv每日学术速递
自然语言处理学术速递[7.28]
Improving Word Recognition in Speech Transcriptions by Decision-level Fusion of Stemming and Two-way Phoneme correcting highly imperfect speech transcriptions based on a decision-level fusion of stemming and two-way phoneme In our approach we tried to improve the baseline accuracy from 9.34% by using stemming, phoneme extraction A two-way phoneme pruning is proposed that comprises of the two non-sequential steps: 1) filtering and After obtaining results of stemming and two-way phoneme pruning, we applied decision-level fusion and
79130发布于 2021-07-29
来自专栏AI技术应用
AI英语App的分类
特点：主打高保真语音实时对练，支持音素级（Phoneme-level）纠音。2026年的趋势是支持中英无缝切换，当你卡壳时，直接用中文问“这句怎么说”，AI会立即给出地道表达。
1000编辑于 2026-03-16
来自专栏arXiv每日学术速递
金融/语音/音频处理学术速递[6.28]
摘要：This paper proposes a multi-task learning network with phoneme-aware and channel-wise attentive learning The phoneme-aware attentive pooling is exploited on frame-level features in the main network for speaker classifier, with the corresponding posterior probability for the phoneme distribution in the auxiliary The phoneme-aware attentive pooling is exploited on frame-level features in the main network for speaker classifier, with the corresponding posterior probability for the phoneme distribution in the auxiliary
62520发布于 2021-07-02

第 2 页第 3 页第 4 页第 5 页第 6 页第 7 页

点击加载更多

【专业领域】jvm虚拟机android移植－接口篇

jvm虚拟机android移植－编译篇

SP Module 5 Speech Synthesis – Phonemes and the Front End

Human Language Processing——Speech Recognition

记一次智能语音软件的开发-终于为孩子找到了个靠谱的口语老师

看我如何使用Python打造一个带娃神奇(一玩能玩一天)？

ACL 2025 高分接收 | 高感情语音技术：逻辑智能小语种TTS破局之道

SP Module 4 the Source-Filter Model

AI 英语口语 App 的开发流程

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

程序员英语学习网站项目实现全记录：从需求到上线的完整历程

SP Modules Review Contents (2)

SP Module 6 Speech Synthesis – Waveform Generation and Connected Speech

用腾讯云语音合成（TTS）批量生成英语绘本的朗读音频

Human Language Processing——HMM

📖《数字人开发手册：从建模到情感交互全链路》

金融/语音/音频处理学术速递[7.19]

自然语言处理学术速递[7.28]

AI英语App的分类

金融/语音/音频处理学术速递[6.28]

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐