Nothing Special   »   [go: up one dir, main page]

CN107622775B - Method for splicing songs containing noise and related products - Google Patents

Method for splicing songs containing noise and related products Download PDF

Info

Publication number
CN107622775B
CN107622775B CN201710996071.5A CN201710996071A CN107622775B CN 107622775 B CN107622775 B CN 107622775B CN 201710996071 A CN201710996071 A CN 201710996071A CN 107622775 B CN107622775 B CN 107622775B
Authority
CN
China
Prior art keywords
audio files
noise
song audio
song
noise reduction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201710996071.5A
Other languages
Chinese (zh)
Other versions
CN107622775A (en
Inventor
程胜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN201710996071.5A priority Critical patent/CN107622775B/en
Publication of CN107622775A publication Critical patent/CN107622775A/en
Application granted granted Critical
Publication of CN107622775B publication Critical patent/CN107622775B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/101Music Composition or musical creation; Tools or processes therefor
    • G10H2210/125Medley, i.e. linking parts of different musical pieces in one single piece, e.g. sound collage, DJ mix
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/011Files or data streams containing coded musical information, e.g. for transmission
    • G10H2240/016File editing, i.e. modifying musical data files or streams as such
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2250/00Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
    • G10H2250/295Noise generation, its use, control or rejection for music processing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management Or Editing Of Information On Record Carriers (AREA)

Abstract

The embodiment of the invention discloses a method for splicing songs containing noise and a related product, wherein the method comprises the following steps: acquiring N song audio files; judging whether the N song audio files contain noise audio files or not; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; and splicing the M segments according to a preset sequence to obtain a spliced song audio file. The embodiment of the invention also discloses a device for splicing the songs containing the noise. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.

Description

Method for splicing songs containing noise and related products
Technical Field
The embodiment of the invention relates to the technical field of audio processing, in particular to a method for splicing songs containing noise and a related product.
Background
With the rapid development of mobile internet technology, devices (such as mobile phones, tablet computers, touch) and dedicated players are increasingly pursuing music. At present, in the prior art, the function playing of music is limited to improve the own tone quality, for example, the inferior audio file is properly processed by the processing software provided in the device or the special player, so as to improve the playing quality; or the requirements on the quality of the audio files are high, the original audio files can be played with good effects, and local processing of the audio files, such as shearing technology and extraction technology, is less researched.
In the prior art, the cutting of songs mainly depends on network software, and the software often needs manual operation and cannot accurately position the exact position of each lyric.
Disclosure of Invention
The embodiment of the invention provides a method for splicing songs containing noise and a related product, which can cut and splice audio files of the songs containing the noise.
The first aspect of the embodiments of the present invention provides a method for splicing noise-containing songs, including:
acquiring N song audio files, wherein N is an integer greater than 1;
judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value;
if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value;
extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1;
and splicing the M segments according to a preset sequence to obtain a spliced song audio file.
A second aspect of the embodiments of the present invention provides a device for splicing songs containing noise, including:
the device comprises an acquisition unit, a storage unit and a processing unit, wherein the acquisition unit is used for acquiring N song audio files, and N is an integer greater than 1;
the first judging unit is used for judging whether the N song audio files acquired by the acquiring unit contain noise audio files, and the noise audio files are noise audio files with noise larger than a noise threshold;
the first processing unit is used for performing noise reduction processing on the noise audio files in the N song audio files when the first judging unit judges that the N song audio files contain the noise audio files so as to obtain noise reduction audio files with noise smaller than a threshold value;
a first extracting unit, configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1;
and the splicing unit is used for splicing the M segments extracted by the first extraction unit according to a preset sequence so as to obtain a spliced song audio file.
The embodiment of the invention has the following beneficial effects:
in the embodiment of the invention, N song audio files are obtained, wherein N is an integer greater than 1; judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; and splicing the M segments according to a preset sequence to obtain a spliced song audio file. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the embodiments and the drawings used in the description will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a schematic flowchart of a method for splicing songs with noise according to a first embodiment of the present invention;
fig. 2 is a flowchart illustrating a method for splicing songs with noise according to a second embodiment of the present invention;
fig. 3 is a flowchart illustrating a method for splicing songs with noise according to a third embodiment of the present invention;
fig. 4 is a schematic flowchart of a fourth embodiment of a method for splicing songs with noise according to an embodiment of the present invention;
fig. 5 is a schematic flowchart of a fifth embodiment of a method for splicing songs with noise according to an embodiment of the present invention;
fig. 6 is a schematic structural diagram of an apparatus for splicing songs with noise according to a first embodiment of the present invention;
fig. 7 is a schematic structural diagram of an apparatus for splicing songs with noise according to a second embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present invention without any creative efforts shall fall within the protection scope of the embodiments of the present invention.
The terms "first," "second," "third," and "fourth," etc. in the description and claims of the invention and in the accompanying drawings are used for distinguishing between different objects and not for describing a particular order. Furthermore, the terms "include" and "have," as well as any variations thereof, are intended to cover non-exclusive inclusions. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those steps or elements listed, but may alternatively include other steps or elements not listed, or inherent to such process, method, article, or apparatus.
In implementation, in the embodiment of the present invention, the apparatus may include, but is not limited to: notebook computers, cell phones, tablet computers, smart wearable devices, players, MP3, MP4, smart televisions, set-top boxes, servers, and the like. The system of the device refers to the operating system of the device, and may include but is not limited to: android system, saiban system, Windows system, IOS (mobile operating system developed by apple inc.) system, and the like. It should be noted that the Android device refers to an Android system device, the shift device refers to a shift system device, and the like. The above devices are merely examples and are not exhaustive and include, but are not limited to, the above devices.
In implementation, in the embodiment of the present invention, the songs may include, but are not limited to: chinese songs, english songs, russian songs, spanish songs, classical songs, pop music songs, rock music songs, soft music songs, rap songs, solo songs, songs in video, etc. The above songs are merely exemplary and are not exhaustive, including but not limited to the above songs.
In implementation, the format of the song may include, but is not limited to: MP3, MP4, WMV, WAV, FLV, etc. The formats of the above songs are merely examples and are not exhaustive, including but not limited to the formats of the above songs.
Referring to fig. 1, fig. 1 is a flowchart illustrating an embodiment of a method for splicing songs with noise according to an embodiment of the present invention. The method for splicing the songs containing the noises described in the embodiment comprises the following steps:
s101, acquiring N song audio files.
Specifically, N song audio files may be acquired from a device for splicing the noise-containing songs, or N song audio files may be acquired from a mobile terminal, or N song audio files may be acquired from another manner, where N is an integer greater than 1, the mobile terminal may be, for example, a mobile phone, a tablet computer, a notebook computer, a palm computer, a mobile internet device (MID, mobile internet device), a wearable device (e.g., a smart watch (such as iwatch), a smart bracelet, a pedometer, etc.), or another terminal device capable of installing and deploying an instant messaging application client, and the acquired N song audio files are stored, and the location where the N song audio files are stored may be a local device for splicing the noise-containing songs, a cloud terminal, or another storage space.
S102, judging whether the N song audio files contain noise audio files or not, wherein the noise audio files are noise audio files with noise larger than a noise threshold value.
Specifically, it is determined whether the N song audio files contain noise, wherein the noise may be an interference signal that affects the sound quality of the audio files. In general, noise is a random signal, and whether the N song audio files contain noise can be determined by simply analyzing the audio files.
As a possible implementation, the noise level is determined by the interference level of the audio file, and the noise will generally cause interference in a certain playing time period of the audio file.
S103, carrying out noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value.
Specifically, if it is determined that the N song audio files contain noise audio files, noise reduction processing is performed on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value, obtain N noise reduction audio files, and store the N noise reduction audio files, where the position where the N noise reduction audio files are stored may be a device local where the noise-containing songs are spliced, or a cloud, or other storage spaces.
S104, extracting M fragments from the song audio files which are not subjected to the noise reduction processing in the N song audio files and the noise reduction audio files.
Specifically, a total of M segments are extracted from a storage space i for storing the N song audio files and a storage space j for storing the noise reduction audio files, and the extracted M segments are stored in a storage space of segments to be spliced, where M is an integer greater than 1.
And S105, splicing the M segments according to a preset sequence to obtain a spliced song audio file.
Specifically, the M segments stored in the storage space of the segments to be spliced are spliced according to a preset sequence to obtain a spliced song audio file, where the preset sequence may be a time sequence added to the storage space of the segments to be spliced, a random sequence preset by a device for splicing songs with noise, or other sequences.
In the embodiment of the invention, N song audio files are obtained, wherein N is an integer greater than 1; judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; and splicing the M segments according to a preset sequence to obtain a spliced song audio file. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.
Referring to fig. 2, fig. 2 is a flowchart illustrating a method for splicing songs with noise according to a second embodiment of the present invention, including the steps of:
s201, analyzing the N song audio files to obtain the spectrum change rule of the N song audio files.
Specifically, the N song audio files may be analyzed by an audio file spectrum processing device to obtain a spectrum variation law of the N song audio files.
S202, judging whether the N song audio files contain noise audio files according to the frequency spectrum change rule of the N song audio files.
Specifically, whether the N song audio files contain the noise audio files is judged according to the frequency spectrum change rule of the N song audio files. And analyzing and comparing the frequency spectrum of each song audio file under the normal condition with the frequency spectrum of each song audio file under the noise condition, wherein if the frequency spectrum with obvious fluctuation in the song audio files is the noise frequency spectrum, the song audio files containing the noise frequency spectrum are the noisy song audio files.
In the embodiment of the invention, the N song audio files are analyzed to obtain the frequency spectrum change rule of the N song audio files; and judging whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files, so that whether the N song audio files contain the noise audio files can be determined.
Referring to fig. 3, fig. 3 is a flowchart illustrating a method for splicing songs with noise according to a third embodiment of the present invention, including the steps of:
s301, extracting song audio files which are not subjected to the noise reduction processing and original singing parts of the noise reduction audio files from the N song audio files.
Specifically, the original singing part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files are extracted. In general, a song audio file may include an original singing portion and an accompaniment portion, and the song audio file may be separated to extract the original singing portion of the song audio file.
S302, determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files.
Specifically, the start time and the end time of each voice segment in the original part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files are determined. The original part of the speech can be separated into sentence segments, and each sentence segment has a corresponding start time and end time.
S303, cutting according to the song audio files which are not subjected to the noise reduction processing in the N song audio files and the start time and the end time of each voice fragment in the original part of the noise reduction audio files to extract the M fragments.
Specifically, the M segments are extracted by cutting the start time and the end time of each speech segment in the original part of the song audio file and the noise-reduced audio file which are not subjected to noise reduction processing in the N song audio files. The method can cut one sentence segment separated from the voice of the original part according to the corresponding starting time and ending time of each sentence segment to extract M segments.
Extracting song audio files which are not subjected to the noise reduction processing and original singing parts of the noise reduction audio files from the N song audio files; determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files; and cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing in the N song audio files and the original part of the noise reduction audio file to extract the M segments. By adopting the embodiment of the invention, the song audio file can be cut, and the fragments contained in the song audio file can be obtained.
Referring to fig. 4, fig. 4 is a schematic flowchart of a fourth embodiment of a method for splicing songs with noise according to the embodiment of the present invention, including the steps of:
s401, extracting song audio files which are not subjected to the noise reduction processing and accompaniment parts of the noise reduction audio files from the N song audio files.
Specifically, the accompaniment parts of the song audio files and the noise reduction audio files which are not subjected to noise reduction processing in the N song audio files are extracted. In general, a song audio file may include an original song portion and an accompaniment portion, and the song audio file may be separated to extract the accompaniment portion of the song audio file.
S402, determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files.
Specifically, the start time and the end time of each tune fragment in the song audio file without noise reduction processing and the accompaniment part of the noise reduction audio file in the N song audio files are determined. The voice of the accompaniment part can be separated into sentence fragments, and each sentence fragment has a corresponding starting time and ending time.
And S403, cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment part of the noise reduction audio files in the N song audio files to extract the M fragments.
Specifically, the M clips are extracted by cutting the start time and the end time of each speech clip in the accompaniment part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files. The method can cut one sentence fragment separated from the voice of the accompaniment part according to the corresponding starting time and ending time of each sentence fragment so as to extract M fragments.
Extracting song audio files which are not subjected to the noise reduction processing and accompaniment parts of the noise reduction audio files from the N song audio files; determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files; and cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files to extract the M fragments. By adopting the embodiment of the invention, the song audio file can be cut, and the fragments contained in the song audio file can be obtained.
Referring to fig. 5, fig. 5 is a flowchart illustrating a fifth embodiment of a method for splicing songs with noise according to an embodiment of the present invention, including the steps of:
s501, acquiring N song audio files.
Specifically, N song audio files may be acquired from a device for splicing the noise-containing songs, or N song audio files may be acquired from a mobile terminal, or N song audio files may be acquired from another manner, where N is an integer greater than 1, the mobile terminal may be, for example, a mobile phone, a tablet computer, a notebook computer, a palm computer, a mobile internet device (MID, mobile internet device), a wearable device (e.g., a smart watch (such as iwatch), a smart bracelet, a pedometer, etc.), or another terminal device capable of installing and deploying an instant messaging application client, and the acquired N song audio files are stored, and the location where the N song audio files are stored may be a local device for splicing the noise-containing songs, a cloud terminal, or another storage space.
S502, judging whether the N song audio files contain noise audio files or not, wherein the noise audio files are noise audio files with noise larger than a noise threshold value.
Specifically, it is determined whether the N song audio files contain noise, wherein the noise may be an interference signal that affects the sound quality of the audio files. In general, noise is a random signal, and whether the N song audio files contain noise can be determined by simply analyzing the audio files.
As a possible implementation, the noise level is determined by the interference level of the audio file, and the noise will generally cause interference in a certain playing time period of the audio file.
S503, carrying out noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value.
Specifically, if it is determined that the N song audio files contain noise audio files, noise reduction processing is performed on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value, obtain N noise reduction audio files, and store the N noise reduction audio files, where the position where the N noise reduction audio files are stored may be a device local where the noise-containing songs are spliced, or a cloud, or other storage spaces.
S504, M fragments are extracted from the song audio files which are not subjected to the noise reduction processing in the N song audio files and the noise reduction audio files.
Specifically, a total of M segments are extracted from a storage space i for storing the N song audio files and a storage space j for storing the noise reduction audio files, and the extracted M segments are stored in a storage space of segments to be spliced, where M is an integer greater than 1.
And S505, splicing the M segments according to a preset sequence to obtain a spliced song audio file.
Specifically, the M segments stored in the storage space of the segments to be spliced are spliced according to a preset sequence to obtain a spliced song audio file, where the preset sequence may be a time sequence added to the storage space of the segments to be spliced, a random sequence preset by a device for splicing songs with noise, or other sequences.
And S506, locking the splicing position of the spliced song audio files.
Specifically, the splicing place of the high-quality spliced songs is locked. In order to achieve seamless splicing, therefore, the splicing trace cannot be displayed, and the splicing place needs to be locked and preprocessed.
And S507, processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.
Specifically, the splicing position of the spliced song audio files is processed to obtain seamless spliced song audio files. Among them, the pretreatment may include, but is not limited to: and adjusting the melody of the spliced place of the spliced songs, inserting the melodies with similar change degrees, and smoothing the spliced place with larger change amplitude.
The embodiment of the invention obtains N song audio files, wherein N is an integer greater than 1; judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; splicing the M segments according to a preset sequence to obtain a spliced song audio file, and locking a splicing place of the high-quality spliced song; and presetting the splicing place of the spliced songs to obtain seamless spliced songs. By adopting the embodiment of the invention, the spliced song audio file can be subjected to preset processing so as to obtain a seamless song with better tone quality.
Referring to fig. 6, fig. 6 is a schematic structural diagram of a device for splicing noise-containing songs according to a first embodiment of the present invention, where the device for splicing noise-containing songs shown in fig. 6 may include an obtaining unit 601, a first determining unit 602, a first processing unit 603, a first extracting unit 604, and a splicing unit 605, which are as follows:
an obtaining unit 601, configured to obtain N song audio files, where N is an integer greater than 1.
A first determining unit 602, configured to determine whether the N song audio files acquired by the acquiring unit contain a noise audio file, where the noise audio file is a noise audio file whose noise is greater than a noise threshold.
Specifically, the first determining unit 602 includes: an analyzing unit (not shown) for analyzing the N song audio files to obtain a spectrum variation rule of the N song audio files; a second determining unit (not shown) configured to determine whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files analyzed by the analyzing unit.
A first processing unit 603, configured to perform noise reduction processing on the noise audio file in the N song audio files when the first determining unit determines that the N song audio files contain a noise audio file, so as to obtain a noise-reduced audio file with noise smaller than a threshold.
A first extracting unit 604, configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1.
Specifically, the first extracting unit 604 includes: a second extracting unit (not shown) for extracting song audio files that are not subjected to the noise reduction processing and original singing portions of the noise reduction audio files from among the N song audio files; a first determining unit (not shown) configured to determine a start time and an end time of each speech segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, among the N song audio files extracted by the second extracting unit; a first clipping unit (not shown) configured to clip, according to the start time and the end time of each voice segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, in the N song audio files determined by the first determining unit, so as to extract the M segments.
Specifically, the first extracting unit 604 includes: a third extracting unit (not shown) for extracting the song audio files that are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files from the N song audio files; a second determining unit (not shown) for determining a start time and an end time of each tune fragment in the accompaniment part of the song audio file and the noise-reduced song audio file that are not subjected to the noise reduction processing in the N song audio files of the third extracting unit; a second clipping unit (not shown) configured to clip the start time and the end time of each tune clip in the song audio file that is not subjected to the noise reduction processing and the accompaniment portion of the noise reduction audio file in the N song audio files determined by the second determining unit to extract the M clips.
And the splicing unit 605 is configured to splice the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file.
Optionally, after the splicing unit 605 splices the M segments according to a preset sequence, the apparatus further includes: locking the splicing position of the spliced song audio files; and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.
It can be understood that the functions of the functional modules of the apparatus for splicing songs with noise in this embodiment may be specifically implemented according to the method in the method embodiment, and the specific implementation process may refer to the related description of the method embodiment, which is not described herein again.
In the embodiment of the present invention, the obtaining unit 601 obtains N song audio files, where N is an integer greater than 1; a first judging unit 602 judges whether the N song audio files acquired by the acquiring unit contain a noise audio file, where the noise audio file is a noise audio file whose noise is greater than a noise threshold; when the first judging unit judges that the N song audio files contain noise audio files, the first processing unit 603 performs noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold; a first extracting unit 604 extracts M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files among the N song audio files, where M is an integer greater than 1; the splicing unit 605 splices the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file; the locking unit is used for locking the splicing position of the spliced song audio files; and the second processing unit is used for processing the splicing position of the spliced song audio file locked by the locking unit so as to obtain a seamless spliced song audio file. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.
Referring to fig. 7, fig. 7 is a schematic structural diagram of a device for splicing songs with noise according to a second embodiment of the present invention. The device for splicing the songs containing the noise described in the embodiment comprises: at least one input device 1000; at least one output device 2000; at least one processor 3000, e.g., a CPU; and a memory 4000, the input device 1000, the output device 2000, the processor 3000, and the memory 4000 being connected by a bus 5000.
The input device 1000 may be a touch panel, a general PC, a liquid crystal display, a touch screen, or the like.
The memory 4000 may be a high-speed RAM memory or a non-volatile memory (e.g., a magnetic disk memory). The memory 4000 is used for storing a set of program codes, and the input device 1000, the output device 2000 and the processor 3000 are used for calling the program codes stored in the memory 4000 to execute the following operations:
the processor 3000 is configured to obtain N song audio files, where N is an integer greater than 1;
the processor 3000 is further configured to determine whether the N song audio files include a noise audio file, where the noise audio file is a noise audio file with noise greater than a noise threshold;
the processor 3000 is further configured to perform noise reduction processing on the noise audio file in the N song audio files when it is determined that the N song audio files contain the noise audio file, so as to obtain a noise reduction audio file with noise smaller than a threshold;
the processor 3000 is further configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1;
the processor 3000 is further configured to splice the M segments according to a preset sequence to obtain a spliced song audio file.
In some possible embodiments, the processor 3000 is further configured to:
analyzing the N song audio files to obtain the spectrum change rule of the N song audio files;
and judging whether the N song audio files contain noise audio files according to the frequency spectrum change rule of the N song audio files.
In some possible embodiments, the processor 3000 is further configured to:
extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and original singing parts of the noise reduction audio files;
determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files;
and cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing in the N song audio files and the original part of the noise reduction audio file to extract the M segments.
In some possible embodiments, the processor 3000 is further configured to:
extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and accompaniment parts of the noise reduction audio files;
determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files;
and cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files to extract the M fragments.
In some possible embodiments, after the processor 3000 splices the M segments according to a preset order to obtain a spliced song audio file, the processor 3000 is further specifically configured to:
locking the splicing position of the spliced song audio files;
and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.
In a specific implementation, the input device 1000, the output device 2000, and the processor 3000 described in this embodiment of the present invention may execute the implementation manners described in the embodiments of the method for splicing songs including noise described in fig. 1 to fig. 5 provided in this embodiment of the present invention, or may execute the implementation manners of the apparatus for splicing songs including noise described in the first embodiment of the apparatus for splicing songs including noise provided in this embodiment of the present invention, which is not described herein again.
The modules or sub-modules in all the embodiments of the present invention may be implemented by a general-purpose Integrated Circuit, such as a CPU (Central Processing Unit), or an ASIC (Application Specific Integrated Circuit).
The steps in the method of the embodiment of the invention can be sequentially adjusted, combined and deleted according to actual needs.
The units in the device of the embodiment of the invention can be merged, divided and deleted according to actual needs.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), or the like.
While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.

Claims (6)

1. A method for splicing songs containing noise is characterized by comprising the following steps:
acquiring N song audio files from a cloud, wherein N is an integer greater than 1;
judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value;
if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value;
extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1;
splicing the M segments according to a preset sequence to obtain a spliced song audio file, wherein the M segments are stored in a storage space for storing the segments to be spliced, and the preset sequence is a time sequence of adding the M segments into the storage space of the segments to be spliced; locking the splicing position of the spliced song audio files, and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files, wherein the processing comprises the following steps: adjusting the melody of the splicing place of the spliced song audio files or inserting the melodies with similar variation degrees;
wherein, the extracting M segments from the song audio files without the noise reduction processing and the noise reduction audio files in the N song audio files, where M is an integer greater than 1, includes:
extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and original singing parts of the noise reduction audio files;
determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio filesThe determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files comprises: the voice of the original singing partSeparating into one statement segment, wherein each statement segment has corresponding starting time and ending time;
and cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing in the N song audio files and the original part of the noise reduction audio file to extract the M segments.
2. The method of claim 1, wherein said determining whether the N song audio files contain a noise audio file comprises:
analyzing the N song audio files to obtain the spectrum change rule of the N song audio files;
and judging whether the N song audio files contain noise audio files according to the frequency spectrum change rule of the N song audio files.
3. An apparatus for splicing noise-containing songs, comprising:
the acquisition unit is used for acquiring N song audio files from the cloud, wherein N is an integer greater than 1;
the first judging unit is used for judging whether the N song audio files acquired by the acquiring unit contain noise audio files, and the noise audio files are noise audio files with noise larger than a noise threshold;
the first processing unit is used for performing noise reduction processing on the noise audio files in the N song audio files when the first judging unit judges that the N song audio files contain the noise audio files so as to obtain noise reduction audio files with noise smaller than a threshold value;
a first extracting unit, configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1;
the splicing unit is used for splicing the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file, wherein the M segments are stored in a storage space for storing the segments to be spliced, and the preset sequence is a time sequence of adding the M segments into the storage space of the segments to be spliced;
the apparatus is further specifically configured to: locking the splicing position of the spliced song audio files; processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files, wherein the processing comprises the following steps: adjusting the melody of the splicing place of the spliced song audio files or inserting the melodies with similar variation degrees;
wherein the first extraction unit includes:
the second extraction unit is used for extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and original singing parts of the noise reduction audio files;
a first determining unit, configured to determine a start time and an end time of each speech segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, among the N song audio files extracted by the second extracting unitThe determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files comprises: separating the voice of the original singing part into sentence segments, wherein each sentence segment has corresponding starting time and ending time;
and the first cutting unit is used for cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files determined by the first determining unit so as to extract the M segments.
4. The apparatus of claim 3, wherein the first determination unit comprises:
the analysis unit is used for analyzing the N song audio files to obtain the spectrum change rule of the N song audio files;
and the second judging unit is used for judging whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files analyzed and obtained by the analyzing unit.
5. An apparatus for splicing noise-containing songs, comprising:
a processor and a memory; wherein the processor performs the method of any of claims 1 to 2 by calling code or instructions in the memory.
6. A computer storage medium for storing a computer program, wherein the computer program causes a computer to perform the method of any one of claims 1 to 2.
CN201710996071.5A 2015-03-20 2015-03-20 Method for splicing songs containing noise and related products Expired - Fee Related CN107622775B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710996071.5A CN107622775B (en) 2015-03-20 2015-03-20 Method for splicing songs containing noise and related products

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710996071.5A CN107622775B (en) 2015-03-20 2015-03-20 Method for splicing songs containing noise and related products
CN201510125170.7A CN104778958B (en) 2015-03-20 2015-03-20 A kind of method and device of Noise song splicing

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201510125170.7A Division CN104778958B (en) 2015-03-20 2015-03-20 A kind of method and device of Noise song splicing

Publications (2)

Publication Number Publication Date
CN107622775A CN107622775A (en) 2018-01-23
CN107622775B true CN107622775B (en) 2020-12-18

Family

ID=53620385

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201710996071.5A Expired - Fee Related CN107622775B (en) 2015-03-20 2015-03-20 Method for splicing songs containing noise and related products
CN201510125170.7A Expired - Fee Related CN104778958B (en) 2015-03-20 2015-03-20 A kind of method and device of Noise song splicing

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201510125170.7A Expired - Fee Related CN104778958B (en) 2015-03-20 2015-03-20 A kind of method and device of Noise song splicing

Country Status (1)

Country Link
CN (2) CN107622775B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106970950B (en) * 2017-03-07 2021-08-24 腾讯音乐娱乐(深圳)有限公司 Similar audio data searching method and device
CN107591149B (en) * 2017-09-18 2021-09-28 腾讯音乐娱乐科技(深圳)有限公司 Audio synthesis method, device and storage medium
CN109949792B (en) * 2019-03-28 2021-08-13 优信拍(北京)信息科技有限公司 Multi-audio synthesis method and device
CN112037739B (en) * 2020-09-01 2024-02-27 腾讯音乐娱乐科技(深圳)有限公司 Data processing method and device and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102348150A (en) * 2010-07-22 2012-02-08 索尼公司 Audio signal processing apparatus, audio signal processing method, and program
CN102568527A (en) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 Method and system for easily cutting audio files and applied mobile handheld device
CN203070769U (en) * 2013-02-26 2013-07-17 珠海市杰理科技有限公司 Automatic denoising device for audio playing and audio player
US20140039891A1 (en) * 2007-10-16 2014-02-06 Adobe Systems Incorporated Automatic separation of audio data
CN103943113A (en) * 2014-04-15 2014-07-23 福建星网视易信息系统有限公司 Method and device for removing accompaniment from song
CN104244086A (en) * 2014-09-03 2014-12-24 陈飞 Video real-time splicing device and method based on real-time conversation semantic analysis

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7948981B1 (en) * 2006-10-23 2011-05-24 Adobe Systems Incorpoated Methods and apparatus for representing audio data
CN102467939B (en) * 2010-11-04 2014-08-13 北京彩云在线技术开发有限公司 Song audio frequency cutting apparatus and method thereof
CN104112444B (en) * 2014-07-28 2018-11-06 中国科学院自动化研究所 A kind of waveform concatenation phoneme synthesizing method based on text message

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140039891A1 (en) * 2007-10-16 2014-02-06 Adobe Systems Incorporated Automatic separation of audio data
CN102348150A (en) * 2010-07-22 2012-02-08 索尼公司 Audio signal processing apparatus, audio signal processing method, and program
CN102568527A (en) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 Method and system for easily cutting audio files and applied mobile handheld device
CN203070769U (en) * 2013-02-26 2013-07-17 珠海市杰理科技有限公司 Automatic denoising device for audio playing and audio player
CN103943113A (en) * 2014-04-15 2014-07-23 福建星网视易信息系统有限公司 Method and device for removing accompaniment from song
CN104244086A (en) * 2014-09-03 2014-12-24 陈飞 Video real-time splicing device and method based on real-time conversation semantic analysis

Also Published As

Publication number Publication date
CN107622775A (en) 2018-01-23
CN104778958B (en) 2017-11-24
CN104778958A (en) 2015-07-15

Similar Documents

Publication Publication Date Title
EP3142031B1 (en) Preset style song processing method and apparatus
US11176141B2 (en) Preserving emotion of user input
CN102568478B (en) Video play control method and system based on voice recognition
CN107622775B (en) Method for splicing songs containing noise and related products
CN105630787B (en) Animation realization method and device based on dynamic portable network graphics
CN111767740B (en) Sound effect adding method and device, storage medium and electronic equipment
US12119023B2 (en) Audio onset detection method and apparatus
CN104485115A (en) Pronunciation evaluation equipment, method and system
CN110688518A (en) Rhythm point determining method, device, equipment and storage medium
CN108509175B (en) Voice interaction method and electronic equipment
CN112908292A (en) Text voice synthesis method and device, electronic equipment and storage medium
CN104778217B (en) A kind of song stitching algorithm and device
CN104778221A (en) Music collaborate splicing method and device
CN110070885B (en) Audio starting point detection method and device
CN104778220B (en) A kind of method and device for song splicing of singing opera arias
CN104778957A (en) Song audio processing method and device
CN110286776A (en) Input method, device, electronic equipment and the storage medium of character combination information
CN110085214B (en) Audio starting point detection method and device
CN111459446B (en) Resource processing method of electronic book, computing equipment and computer storage medium
CN104780438A (en) Method and device for splicing video and song audio
CN106572140B (en) Media file playing method and terminal equipment
CN109495786B (en) Pre-configuration method and device of video processing parameter information and electronic equipment
CN112542157B (en) Speech processing method, device, electronic equipment and computer readable storage medium
CN115484503B (en) Bullet screen generation method and device, electronic equipment and storage medium
CN105469794B (en) Information processing method and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Applicant after: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd.

Address before: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Applicant before: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20201218

CF01 Termination of patent right due to non-payment of annual fee