FR3085785B1 - Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation - Google Patents
Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation Download PDFInfo
- Publication number
- FR3085785B1 FR3085785B1 FR1858041A FR1858041A FR3085785B1 FR 3085785 B1 FR3085785 B1 FR 3085785B1 FR 1858041 A FR1858041 A FR 1858041A FR 1858041 A FR1858041 A FR 1858041A FR 3085785 B1 FR3085785 B1 FR 3085785B1
- Authority
- FR
- France
- Prior art keywords
- audio signal
- normalization
- generating
- characteristic
- frequency component
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 title abstract 10
- 238000010606 normalization Methods 0.000 title abstract 4
- 238000000034 method Methods 0.000 title abstract 3
- 238000004519 manufacturing process Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/54—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Stereophonic System (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Circuit For Audible Band Transducer (AREA)
- Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
- Compounds Of Alkaline-Earth Elements, Aluminum Or Rare-Earth Metals (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Abstract
Des procédés, des appareils, des systèmes et des articles de fabrication sont divulgués pour générer des empreintes numériques audio par voie de normalisation. Un procédé exemplaire pour la génération d'empreintes numériques de données audio inclut la réception d'un signal audio dans des composants de fréquence incluant un premier composant de fréquence de signal audio à l'intérieur d'un premier bac de fréquences et un deuxième composant de fréquence de signal audio à l'intérieur d'un deuxième bac de fréquences, la détermination d'une première caractéristique du premier composant de fréquence de signal audio et d'une deuxième caractéristique du deuxième composant de fréquence de signal audio et la normalisation du signal audio pour générer ainsi des valeurs d'énergie normalisées, la normalisation du signal audio incluant (1) la normalisation du premier composant de fréquence de signal audio en ayant recours à la première caractéristique et (2) la normalisation du deuxième composant de fréquence de signal audio en ayant recours à la deuxième caractéristique. L'exemple inclut par ailleurs la sélection d'une des valeurs d'énergie normalisées et la génération d'une empreinte numérique du signal audio en utilisant la valeur sélectionnée parmi les valeurs d'énergie sélectionnée.
Priority Applications (12)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR1858041A FR3085785B1 (fr) | 2018-09-07 | 2018-09-07 | Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation |
US16/453,654 US20200082835A1 (en) | 2018-09-07 | 2019-06-26 | Methods and apparatus to fingerprint an audio signal via normalization |
CN201980072112.9A CN113614828B (zh) | 2018-09-07 | 2019-09-06 | 经由归一化对音频信号进行指纹识别的方法和装置 |
JP2021512712A JP7346552B2 (ja) | 2018-09-07 | 2019-09-06 | 正規化を介して音響信号をフィンガープリンティングするための方法、記憶媒体及び装置 |
KR1020247021395A KR20240108548A (ko) | 2018-09-07 | 2019-09-06 | 정규화를 통해 오디오 신호를 핑거프린팅하는 방법 및 장치 |
EP24167083.5A EP4372748A3 (fr) | 2018-09-07 | 2019-09-06 | Procédés et appareil pour empreinter un signal audio par normalisation |
EP19857365.1A EP3847642B1 (fr) | 2018-09-07 | 2019-09-06 | Procédés et appareil servant à établir une empreinte digitale pour un signal audio par normalisation |
CA3111800A CA3111800A1 (fr) | 2018-09-07 | 2019-09-06 | Procedes et appareil servant a etablir une empreinte digitale pour un signal audio par normalisation |
KR1020217010094A KR20210082439A (ko) | 2018-09-07 | 2019-09-06 | 정규화를 통해 오디오 신호를 핑거프린팅하는 방법 및 장치 |
PCT/US2019/049953 WO2020051451A1 (fr) | 2018-09-07 | 2019-09-06 | Procédés et appareil servant à établir une empreinte digitale pour un signal audio par normalisation |
AU2019335404A AU2019335404B2 (en) | 2018-09-07 | 2019-09-06 | Methods and apparatus to fingerprint an audio signal via normalization |
AU2022275486A AU2022275486B2 (en) | 2018-09-07 | 2022-11-24 | Methods and apparatus to fingerprint an audio signal via normalization |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR1858041A FR3085785B1 (fr) | 2018-09-07 | 2018-09-07 | Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation |
Publications (2)
Publication Number | Publication Date |
---|---|
FR3085785A1 FR3085785A1 (fr) | 2020-03-13 |
FR3085785B1 true FR3085785B1 (fr) | 2021-05-14 |
Family
ID=65861336
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
FR1858041A Active FR3085785B1 (fr) | 2018-09-07 | 2018-09-07 | Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation |
Country Status (9)
Country | Link |
---|---|
US (1) | US20200082835A1 (fr) |
EP (2) | EP4372748A3 (fr) |
JP (1) | JP7346552B2 (fr) |
KR (2) | KR20210082439A (fr) |
CN (1) | CN113614828B (fr) |
AU (2) | AU2019335404B2 (fr) |
CA (1) | CA3111800A1 (fr) |
FR (1) | FR3085785B1 (fr) |
WO (1) | WO2020051451A1 (fr) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US12032628B2 (en) | 2019-11-26 | 2024-07-09 | Gracenote, Inc. | Methods and apparatus to fingerprint an audio signal via exponential normalization |
US11727953B2 (en) * | 2020-12-31 | 2023-08-15 | Gracenote, Inc. | Audio content recognition method and system |
US11798577B2 (en) | 2021-03-04 | 2023-10-24 | Gracenote, Inc. | Methods and apparatus to fingerprint an audio signal |
US11804231B2 (en) * | 2021-07-02 | 2023-10-31 | Capital One Services, Llc | Information exchange on mobile devices using audio |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5481294A (en) | 1993-10-27 | 1996-01-02 | A. C. Nielsen Company | Audience measurement system utilizing ancillary codes and passive signatures |
WO2003009277A2 (fr) * | 2001-07-20 | 2003-01-30 | Gracenote, Inc. | Identification automatique d'enregistrements sonores |
JP2006505821A (ja) | 2002-11-12 | 2006-02-16 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | 指紋情報付マルチメディアコンテンツ |
DE102004036154B3 (de) * | 2004-07-26 | 2005-12-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zur robusten Klassifizierung von Audiosignalen sowie Verfahren zu Einrichtung und Betrieb einer Audiosignal-Datenbank sowie Computer-Programm |
US7647209B2 (en) * | 2005-02-08 | 2010-01-12 | Nippon Telegraph And Telephone Corporation | Signal separating apparatus, signal separating method, signal separating program and recording medium |
EP2259253B1 (fr) | 2008-03-03 | 2017-11-15 | LG Electronics Inc. | Procédé et appareil pour traiter un signal audio |
US9313359B1 (en) * | 2011-04-26 | 2016-04-12 | Gracenote, Inc. | Media content identification on mobile devices |
JP5602138B2 (ja) * | 2008-08-21 | 2014-10-08 | ドルビー ラボラトリーズ ライセンシング コーポレイション | オ−ディオ及びビデオ署名生成及び検出のための特徴の最適化及び信頼性予測 |
CA2716266C (fr) * | 2009-10-01 | 2016-08-16 | Crim (Centre De Recherche Informatique De Montreal) | Detection de polycopie magnetique a base de contenu |
JP5728888B2 (ja) * | 2010-10-29 | 2015-06-03 | ソニー株式会社 | 信号処理装置および方法、並びにプログラム |
EP2751804A1 (fr) * | 2011-08-29 | 2014-07-09 | Telefónica, S.A. | Procédé de génération d'empreintes digitales audio |
US9098576B1 (en) * | 2011-10-17 | 2015-08-04 | Google Inc. | Ensemble interest point detection for audio matching |
KR101286862B1 (ko) * | 2011-11-18 | 2013-07-17 | (주)이스트소프트 | 블록별 가중치 부여를 이용한 오디오 핑거프린트 검색방법 |
US9202472B1 (en) * | 2012-03-29 | 2015-12-01 | Google Inc. | Magnitude ratio descriptors for pitch-resistant audio matching |
US9390719B1 (en) * | 2012-10-09 | 2016-07-12 | Google Inc. | Interest points density control for audio matching |
US9183849B2 (en) * | 2012-12-21 | 2015-11-10 | The Nielsen Company (Us), Llc | Audio matching with semantic audio recognition and report generation |
CN104125509B (zh) | 2013-04-28 | 2015-09-30 | 腾讯科技(深圳)有限公司 | 节目识别方法、装置及服务器 |
CN104093079B (zh) * | 2014-05-29 | 2015-10-07 | 腾讯科技(深圳)有限公司 | 基于多媒体节目的交互方法、终端、服务器和系统 |
CN104050259A (zh) * | 2014-06-16 | 2014-09-17 | 上海大学 | 一种基于som算法的音频指纹提取方法 |
US9837101B2 (en) * | 2014-11-25 | 2017-12-05 | Facebook, Inc. | Indexing based on time-variant transforms of an audio signal's spectrogram |
US10713296B2 (en) * | 2016-09-09 | 2020-07-14 | Gracenote, Inc. | Audio identification based on data structure |
-
2018
- 2018-09-07 FR FR1858041A patent/FR3085785B1/fr active Active
-
2019
- 2019-06-26 US US16/453,654 patent/US20200082835A1/en active Pending
- 2019-09-06 CA CA3111800A patent/CA3111800A1/fr active Pending
- 2019-09-06 WO PCT/US2019/049953 patent/WO2020051451A1/fr unknown
- 2019-09-06 EP EP24167083.5A patent/EP4372748A3/fr active Pending
- 2019-09-06 EP EP19857365.1A patent/EP3847642B1/fr active Active
- 2019-09-06 KR KR1020217010094A patent/KR20210082439A/ko not_active Application Discontinuation
- 2019-09-06 AU AU2019335404A patent/AU2019335404B2/en active Active
- 2019-09-06 CN CN201980072112.9A patent/CN113614828B/zh active Active
- 2019-09-06 KR KR1020247021395A patent/KR20240108548A/ko active Search and Examination
- 2019-09-06 JP JP2021512712A patent/JP7346552B2/ja active Active
-
2022
- 2022-11-24 AU AU2022275486A patent/AU2022275486B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
KR20240108548A (ko) | 2024-07-09 |
KR20210082439A (ko) | 2021-07-05 |
CA3111800A1 (fr) | 2020-03-12 |
EP4372748A3 (fr) | 2024-08-14 |
EP3847642A4 (fr) | 2022-07-06 |
CN113614828A (zh) | 2021-11-05 |
JP2021536596A (ja) | 2021-12-27 |
CN113614828B (zh) | 2024-09-06 |
US20200082835A1 (en) | 2020-03-12 |
EP3847642B1 (fr) | 2024-04-10 |
AU2022275486B2 (en) | 2024-10-10 |
EP3847642A1 (fr) | 2021-07-14 |
WO2020051451A1 (fr) | 2020-03-12 |
FR3085785A1 (fr) | 2020-03-13 |
AU2022275486A1 (en) | 2023-01-05 |
EP4372748A2 (fr) | 2024-05-22 |
JP7346552B2 (ja) | 2023-09-19 |
AU2019335404B2 (en) | 2022-08-25 |
AU2019335404A1 (en) | 2021-04-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
FR3085785B1 (fr) | Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation | |
Chang et al. | Music Genre Classification via Compressive Sampling. | |
US10540993B2 (en) | Audio fingerprinting based on audio energy characteristics | |
KR20180034216A (ko) | 다른 신호의 스펙트럼을 검사하기 위한 신호 제거 | |
Panwar et al. | A deep learning approach for mapping music genres | |
CN103718242A (zh) | 采用谱运动变换的用于处理声音信号的系统和方法 | |
WO2014122287A1 (fr) | Procédé et appareil pour déterminer des directions de sources sonores non corrélées dans une représentation ambiophonique d'ordre supérieur d'un champ sonore | |
Korshunov et al. | Cross-database evaluation of audio-based spoofing detection systems | |
Ellis et al. | Echoprint: An open music identification service | |
Kamble et al. | Novel Variable Length Energy Separation Algorithm Using Instantaneous Amplitude Features for Replay Detection. | |
Nguyen et al. | Acoustic scene classification with mismatched recording devices using mixture of experts layer | |
Thambi et al. | Random forest algorithm for improving the performance of speech/non-speech detection | |
Ghasemzadeh | Multi-layer architecture for efficient steganalysis of UnderMp3Cover in multi-encoder scenario | |
Pandey et al. | Cell-phone identification from audio recordings using PSD of speech-free regions | |
Mendes et al. | Universal patterns in sound amplitudes of songs and music genres | |
Banchhor et al. | Musical instrument recognition using spectrogram and autocorrelation | |
Guragain et al. | Speech foundation model ensembles for the controlled singing voice deepfake detection (ctrsvdd) challenge 2024 | |
Cao et al. | Infant Cry Detection With Lightweight Wavelet Scattering Networks | |
Choi et al. | Light-weight Frequency Information Aware Neural Network Architecture for Voice Spoofing Detection | |
Lojka et al. | Modification of widely used feature vectors for real-time acoustic events detection | |
CN109150320B (zh) | 一种声波信号编码、解码方法及装置 | |
Hrabina et al. | Implementation of developed gunshot detection algorithm on TMS320C6713 processor | |
CN112581975A (zh) | 基于信号混叠和双声道相关性的超声波语音指令防御方法 | |
Liu et al. | Speaker-Aware Anti-Spoofing | |
Alluri et al. | Replay spoofing countermeasures using high spectro-temporal resolution features |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PLFP | Fee payment |
Year of fee payment: 2 |
|
PLSC | Publication of the preliminary search report |
Effective date: 20200313 |
|
PLFP | Fee payment |
Year of fee payment: 3 |
|
PLFP | Fee payment |
Year of fee payment: 4 |
|
PLFP | Fee payment |
Year of fee payment: 5 |
|
PLFP | Fee payment |
Year of fee payment: 6 |
|
PLFP | Fee payment |
Year of fee payment: 7 |