我是人工智能领域的初学者.我可以使用GATE或任何其他自然语言处理程序,但我对此没有答案:
你知道如何评价两句话如何接近吗?即使有一个大的数据集?
你有什么建议吗?我可以使用排列的数目,长度,标记的数量,元电话,等等.但我不知道该用什么测试。
我的目标是:-“你好,贾维斯”-“你好,罗曼,你好吗?”
- "Hello arvis"
- "Hello Romain, how are you"
- "Hello mister Swift"
- I don't know what you are expecting, is this like "Hello Jarvis" ?
- Yes
- Ok, Hello Romain, How are you?
- "Hello mister swift, how are you?"
- I don't know what are you expecting.例
1,2,3或n只是相似尺度的一个例子。
基础知识
- "Hello IA" is closed to
- "Hello IA" by 0
- "Hello AI" by 1
- "Hello Jarvis" is closed to
- "Hello AI" by 2
- "Hello IA" by 2
- "Hello! mister Swift" is closed to
- "Hello AI" by 3
- "Hello IA" by 3
- "Hello Jarvis" by 2不太基本
- "Hello IA" is (token length, token word, grammatically, syntactically) closed to
- "Hello IA" by (0,0,0,0)
- "Hello AI" by (0,1,0,0)
- "Hello Jarvis" is closed to
- "Hello AI" by (0,2,1,1)
- "Hello IA" by (0,2,1,1)
- "Hello! mister Swift" is closed to
- "Hello AI" by (1,2,2,2)
- "Hello IA" by (1,2,2,2)
- "Hello Jarvis" by (1,2,2,2)发布于 2014-09-07 14:00:51
如果您已经准备好学习硬核NLP,则可以使用分类器来完成此任务。例如,查看一下斯坦福NLP (Java)或NLTK (Python)。
如果您想保持简单,并使用开箱即用的解决方案,请查看Wit.ai API,它完全满足您的需要,等等。
发布于 2014-09-10 18:57:59
确定字符串相似性的一种方法是使用字符串内核。Lodhi等人写了一篇很好的论文,解释了这是如何工作的:
档案/LodhiSSCW02.pdf
为了使用CoreNLP创建一个分类器,您必须为字符串创建特性,例如n克、引理或类似的特性。
https://stackoverflow.com/questions/25701691
复制相似问题