CN107622775B - Method for splicing songs containing noise and related products - Google Patents
Method for splicing songs containing noise and related products Download PDFInfo
- Publication number
- CN107622775B CN107622775B CN201710996071.5A CN201710996071A CN107622775B CN 107622775 B CN107622775 B CN 107622775B CN 201710996071 A CN201710996071 A CN 201710996071A CN 107622775 B CN107622775 B CN 107622775B
- Authority
- CN
- China
- Prior art keywords
- audio files
- noise
- song audio
- song
- noise reduction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 239000012634 fragment Substances 0.000 claims abstract description 28
- 238000001228 spectrum Methods 0.000 claims description 20
- 238000000605 extraction Methods 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/81—Detection of presence or absence of voice signals for discriminating voice from music
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/101—Music Composition or musical creation; Tools or processes therefor
- G10H2210/125—Medley, i.e. linking parts of different musical pieces in one single piece, e.g. sound collage, DJ mix
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/011—Files or data streams containing coded musical information, e.g. for transmission
- G10H2240/016—File editing, i.e. modifying musical data files or streams as such
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/295—Noise generation, its use, control or rejection for music processing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management Or Editing Of Information On Record Carriers (AREA)
Abstract
The embodiment of the invention discloses a method for splicing songs containing noise and a related product, wherein the method comprises the following steps: acquiring N song audio files; judging whether the N song audio files contain noise audio files or not; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; and splicing the M segments according to a preset sequence to obtain a spliced song audio file. The embodiment of the invention also discloses a device for splicing the songs containing the noise. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.
Description
Technical Field
The embodiment of the invention relates to the technical field of audio processing, in particular to a method for splicing songs containing noise and a related product.
Background
With the rapid development of mobile internet technology, devices (such as mobile phones, tablet computers, touch) and dedicated players are increasingly pursuing music. At present, in the prior art, the function playing of music is limited to improve the own tone quality, for example, the inferior audio file is properly processed by the processing software provided in the device or the special player, so as to improve the playing quality; or the requirements on the quality of the audio files are high, the original audio files can be played with good effects, and local processing of the audio files, such as shearing technology and extraction technology, is less researched.
In the prior art, the cutting of songs mainly depends on network software, and the software often needs manual operation and cannot accurately position the exact position of each lyric.
Disclosure of Invention
The embodiment of the invention provides a method for splicing songs containing noise and a related product, which can cut and splice audio files of the songs containing the noise.
The first aspect of the embodiments of the present invention provides a method for splicing noise-containing songs, including:
acquiring N song audio files, wherein N is an integer greater than 1;
judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value;
if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value;
extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1;
and splicing the M segments according to a preset sequence to obtain a spliced song audio file.
A second aspect of the embodiments of the present invention provides a device for splicing songs containing noise, including:
the device comprises an acquisition unit, a storage unit and a processing unit, wherein the acquisition unit is used for acquiring N song audio files, and N is an integer greater than 1;
the first judging unit is used for judging whether the N song audio files acquired by the acquiring unit contain noise audio files, and the noise audio files are noise audio files with noise larger than a noise threshold;
the first processing unit is used for performing noise reduction processing on the noise audio files in the N song audio files when the first judging unit judges that the N song audio files contain the noise audio files so as to obtain noise reduction audio files with noise smaller than a threshold value;
a first extracting unit, configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1;
and the splicing unit is used for splicing the M segments extracted by the first extraction unit according to a preset sequence so as to obtain a spliced song audio file.
The embodiment of the invention has the following beneficial effects:
in the embodiment of the invention, N song audio files are obtained, wherein N is an integer greater than 1; judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; and splicing the M segments according to a preset sequence to obtain a spliced song audio file. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the embodiments and the drawings used in the description will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a schematic flowchart of a method for splicing songs with noise according to a first embodiment of the present invention;
fig. 2 is a flowchart illustrating a method for splicing songs with noise according to a second embodiment of the present invention;
fig. 3 is a flowchart illustrating a method for splicing songs with noise according to a third embodiment of the present invention;
fig. 4 is a schematic flowchart of a fourth embodiment of a method for splicing songs with noise according to an embodiment of the present invention;
fig. 5 is a schematic flowchart of a fifth embodiment of a method for splicing songs with noise according to an embodiment of the present invention;
fig. 6 is a schematic structural diagram of an apparatus for splicing songs with noise according to a first embodiment of the present invention;
fig. 7 is a schematic structural diagram of an apparatus for splicing songs with noise according to a second embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present invention without any creative efforts shall fall within the protection scope of the embodiments of the present invention.
The terms "first," "second," "third," and "fourth," etc. in the description and claims of the invention and in the accompanying drawings are used for distinguishing between different objects and not for describing a particular order. Furthermore, the terms "include" and "have," as well as any variations thereof, are intended to cover non-exclusive inclusions. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those steps or elements listed, but may alternatively include other steps or elements not listed, or inherent to such process, method, article, or apparatus.
In implementation, in the embodiment of the present invention, the apparatus may include, but is not limited to: notebook computers, cell phones, tablet computers, smart wearable devices, players, MP3, MP4, smart televisions, set-top boxes, servers, and the like. The system of the device refers to the operating system of the device, and may include but is not limited to: android system, saiban system, Windows system, IOS (mobile operating system developed by apple inc.) system, and the like. It should be noted that the Android device refers to an Android system device, the shift device refers to a shift system device, and the like. The above devices are merely examples and are not exhaustive and include, but are not limited to, the above devices.
In implementation, in the embodiment of the present invention, the songs may include, but are not limited to: chinese songs, english songs, russian songs, spanish songs, classical songs, pop music songs, rock music songs, soft music songs, rap songs, solo songs, songs in video, etc. The above songs are merely exemplary and are not exhaustive, including but not limited to the above songs.
In implementation, the format of the song may include, but is not limited to: MP3, MP4, WMV, WAV, FLV, etc. The formats of the above songs are merely examples and are not exhaustive, including but not limited to the formats of the above songs.
Referring to fig. 1, fig. 1 is a flowchart illustrating an embodiment of a method for splicing songs with noise according to an embodiment of the present invention. The method for splicing the songs containing the noises described in the embodiment comprises the following steps:
s101, acquiring N song audio files.
Specifically, N song audio files may be acquired from a device for splicing the noise-containing songs, or N song audio files may be acquired from a mobile terminal, or N song audio files may be acquired from another manner, where N is an integer greater than 1, the mobile terminal may be, for example, a mobile phone, a tablet computer, a notebook computer, a palm computer, a mobile internet device (MID, mobile internet device), a wearable device (e.g., a smart watch (such as iwatch), a smart bracelet, a pedometer, etc.), or another terminal device capable of installing and deploying an instant messaging application client, and the acquired N song audio files are stored, and the location where the N song audio files are stored may be a local device for splicing the noise-containing songs, a cloud terminal, or another storage space.
S102, judging whether the N song audio files contain noise audio files or not, wherein the noise audio files are noise audio files with noise larger than a noise threshold value.
Specifically, it is determined whether the N song audio files contain noise, wherein the noise may be an interference signal that affects the sound quality of the audio files. In general, noise is a random signal, and whether the N song audio files contain noise can be determined by simply analyzing the audio files.
As a possible implementation, the noise level is determined by the interference level of the audio file, and the noise will generally cause interference in a certain playing time period of the audio file.
S103, carrying out noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value.
Specifically, if it is determined that the N song audio files contain noise audio files, noise reduction processing is performed on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value, obtain N noise reduction audio files, and store the N noise reduction audio files, where the position where the N noise reduction audio files are stored may be a device local where the noise-containing songs are spliced, or a cloud, or other storage spaces.
S104, extracting M fragments from the song audio files which are not subjected to the noise reduction processing in the N song audio files and the noise reduction audio files.
Specifically, a total of M segments are extracted from a storage space i for storing the N song audio files and a storage space j for storing the noise reduction audio files, and the extracted M segments are stored in a storage space of segments to be spliced, where M is an integer greater than 1.
And S105, splicing the M segments according to a preset sequence to obtain a spliced song audio file.
Specifically, the M segments stored in the storage space of the segments to be spliced are spliced according to a preset sequence to obtain a spliced song audio file, where the preset sequence may be a time sequence added to the storage space of the segments to be spliced, a random sequence preset by a device for splicing songs with noise, or other sequences.
In the embodiment of the invention, N song audio files are obtained, wherein N is an integer greater than 1; judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; and splicing the M segments according to a preset sequence to obtain a spliced song audio file. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.
Referring to fig. 2, fig. 2 is a flowchart illustrating a method for splicing songs with noise according to a second embodiment of the present invention, including the steps of:
s201, analyzing the N song audio files to obtain the spectrum change rule of the N song audio files.
Specifically, the N song audio files may be analyzed by an audio file spectrum processing device to obtain a spectrum variation law of the N song audio files.
S202, judging whether the N song audio files contain noise audio files according to the frequency spectrum change rule of the N song audio files.
Specifically, whether the N song audio files contain the noise audio files is judged according to the frequency spectrum change rule of the N song audio files. And analyzing and comparing the frequency spectrum of each song audio file under the normal condition with the frequency spectrum of each song audio file under the noise condition, wherein if the frequency spectrum with obvious fluctuation in the song audio files is the noise frequency spectrum, the song audio files containing the noise frequency spectrum are the noisy song audio files.
In the embodiment of the invention, the N song audio files are analyzed to obtain the frequency spectrum change rule of the N song audio files; and judging whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files, so that whether the N song audio files contain the noise audio files can be determined.
Referring to fig. 3, fig. 3 is a flowchart illustrating a method for splicing songs with noise according to a third embodiment of the present invention, including the steps of:
s301, extracting song audio files which are not subjected to the noise reduction processing and original singing parts of the noise reduction audio files from the N song audio files.
Specifically, the original singing part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files are extracted. In general, a song audio file may include an original singing portion and an accompaniment portion, and the song audio file may be separated to extract the original singing portion of the song audio file.
S302, determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files.
Specifically, the start time and the end time of each voice segment in the original part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files are determined. The original part of the speech can be separated into sentence segments, and each sentence segment has a corresponding start time and end time.
S303, cutting according to the song audio files which are not subjected to the noise reduction processing in the N song audio files and the start time and the end time of each voice fragment in the original part of the noise reduction audio files to extract the M fragments.
Specifically, the M segments are extracted by cutting the start time and the end time of each speech segment in the original part of the song audio file and the noise-reduced audio file which are not subjected to noise reduction processing in the N song audio files. The method can cut one sentence segment separated from the voice of the original part according to the corresponding starting time and ending time of each sentence segment to extract M segments.
Extracting song audio files which are not subjected to the noise reduction processing and original singing parts of the noise reduction audio files from the N song audio files; determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files; and cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing in the N song audio files and the original part of the noise reduction audio file to extract the M segments. By adopting the embodiment of the invention, the song audio file can be cut, and the fragments contained in the song audio file can be obtained.
Referring to fig. 4, fig. 4 is a schematic flowchart of a fourth embodiment of a method for splicing songs with noise according to the embodiment of the present invention, including the steps of:
s401, extracting song audio files which are not subjected to the noise reduction processing and accompaniment parts of the noise reduction audio files from the N song audio files.
Specifically, the accompaniment parts of the song audio files and the noise reduction audio files which are not subjected to noise reduction processing in the N song audio files are extracted. In general, a song audio file may include an original song portion and an accompaniment portion, and the song audio file may be separated to extract the accompaniment portion of the song audio file.
S402, determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files.
Specifically, the start time and the end time of each tune fragment in the song audio file without noise reduction processing and the accompaniment part of the noise reduction audio file in the N song audio files are determined. The voice of the accompaniment part can be separated into sentence fragments, and each sentence fragment has a corresponding starting time and ending time.
And S403, cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment part of the noise reduction audio files in the N song audio files to extract the M fragments.
Specifically, the M clips are extracted by cutting the start time and the end time of each speech clip in the accompaniment part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files. The method can cut one sentence fragment separated from the voice of the accompaniment part according to the corresponding starting time and ending time of each sentence fragment so as to extract M fragments.
Extracting song audio files which are not subjected to the noise reduction processing and accompaniment parts of the noise reduction audio files from the N song audio files; determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files; and cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files to extract the M fragments. By adopting the embodiment of the invention, the song audio file can be cut, and the fragments contained in the song audio file can be obtained.
Referring to fig. 5, fig. 5 is a flowchart illustrating a fifth embodiment of a method for splicing songs with noise according to an embodiment of the present invention, including the steps of:
s501, acquiring N song audio files.
Specifically, N song audio files may be acquired from a device for splicing the noise-containing songs, or N song audio files may be acquired from a mobile terminal, or N song audio files may be acquired from another manner, where N is an integer greater than 1, the mobile terminal may be, for example, a mobile phone, a tablet computer, a notebook computer, a palm computer, a mobile internet device (MID, mobile internet device), a wearable device (e.g., a smart watch (such as iwatch), a smart bracelet, a pedometer, etc.), or another terminal device capable of installing and deploying an instant messaging application client, and the acquired N song audio files are stored, and the location where the N song audio files are stored may be a local device for splicing the noise-containing songs, a cloud terminal, or another storage space.
S502, judging whether the N song audio files contain noise audio files or not, wherein the noise audio files are noise audio files with noise larger than a noise threshold value.
Specifically, it is determined whether the N song audio files contain noise, wherein the noise may be an interference signal that affects the sound quality of the audio files. In general, noise is a random signal, and whether the N song audio files contain noise can be determined by simply analyzing the audio files.
As a possible implementation, the noise level is determined by the interference level of the audio file, and the noise will generally cause interference in a certain playing time period of the audio file.
S503, carrying out noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value.
Specifically, if it is determined that the N song audio files contain noise audio files, noise reduction processing is performed on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value, obtain N noise reduction audio files, and store the N noise reduction audio files, where the position where the N noise reduction audio files are stored may be a device local where the noise-containing songs are spliced, or a cloud, or other storage spaces.
S504, M fragments are extracted from the song audio files which are not subjected to the noise reduction processing in the N song audio files and the noise reduction audio files.
Specifically, a total of M segments are extracted from a storage space i for storing the N song audio files and a storage space j for storing the noise reduction audio files, and the extracted M segments are stored in a storage space of segments to be spliced, where M is an integer greater than 1.
And S505, splicing the M segments according to a preset sequence to obtain a spliced song audio file.
Specifically, the M segments stored in the storage space of the segments to be spliced are spliced according to a preset sequence to obtain a spliced song audio file, where the preset sequence may be a time sequence added to the storage space of the segments to be spliced, a random sequence preset by a device for splicing songs with noise, or other sequences.
And S506, locking the splicing position of the spliced song audio files.
Specifically, the splicing place of the high-quality spliced songs is locked. In order to achieve seamless splicing, therefore, the splicing trace cannot be displayed, and the splicing place needs to be locked and preprocessed.
And S507, processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.
Specifically, the splicing position of the spliced song audio files is processed to obtain seamless spliced song audio files. Among them, the pretreatment may include, but is not limited to: and adjusting the melody of the spliced place of the spliced songs, inserting the melodies with similar change degrees, and smoothing the spliced place with larger change amplitude.
The embodiment of the invention obtains N song audio files, wherein N is an integer greater than 1; judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; splicing the M segments according to a preset sequence to obtain a spliced song audio file, and locking a splicing place of the high-quality spliced song; and presetting the splicing place of the spliced songs to obtain seamless spliced songs. By adopting the embodiment of the invention, the spliced song audio file can be subjected to preset processing so as to obtain a seamless song with better tone quality.
Referring to fig. 6, fig. 6 is a schematic structural diagram of a device for splicing noise-containing songs according to a first embodiment of the present invention, where the device for splicing noise-containing songs shown in fig. 6 may include an obtaining unit 601, a first determining unit 602, a first processing unit 603, a first extracting unit 604, and a splicing unit 605, which are as follows:
an obtaining unit 601, configured to obtain N song audio files, where N is an integer greater than 1.
A first determining unit 602, configured to determine whether the N song audio files acquired by the acquiring unit contain a noise audio file, where the noise audio file is a noise audio file whose noise is greater than a noise threshold.
Specifically, the first determining unit 602 includes: an analyzing unit (not shown) for analyzing the N song audio files to obtain a spectrum variation rule of the N song audio files; a second determining unit (not shown) configured to determine whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files analyzed by the analyzing unit.
A first processing unit 603, configured to perform noise reduction processing on the noise audio file in the N song audio files when the first determining unit determines that the N song audio files contain a noise audio file, so as to obtain a noise-reduced audio file with noise smaller than a threshold.
A first extracting unit 604, configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1.
Specifically, the first extracting unit 604 includes: a second extracting unit (not shown) for extracting song audio files that are not subjected to the noise reduction processing and original singing portions of the noise reduction audio files from among the N song audio files; a first determining unit (not shown) configured to determine a start time and an end time of each speech segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, among the N song audio files extracted by the second extracting unit; a first clipping unit (not shown) configured to clip, according to the start time and the end time of each voice segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, in the N song audio files determined by the first determining unit, so as to extract the M segments.
Specifically, the first extracting unit 604 includes: a third extracting unit (not shown) for extracting the song audio files that are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files from the N song audio files; a second determining unit (not shown) for determining a start time and an end time of each tune fragment in the accompaniment part of the song audio file and the noise-reduced song audio file that are not subjected to the noise reduction processing in the N song audio files of the third extracting unit; a second clipping unit (not shown) configured to clip the start time and the end time of each tune clip in the song audio file that is not subjected to the noise reduction processing and the accompaniment portion of the noise reduction audio file in the N song audio files determined by the second determining unit to extract the M clips.
And the splicing unit 605 is configured to splice the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file.
Optionally, after the splicing unit 605 splices the M segments according to a preset sequence, the apparatus further includes: locking the splicing position of the spliced song audio files; and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.
It can be understood that the functions of the functional modules of the apparatus for splicing songs with noise in this embodiment may be specifically implemented according to the method in the method embodiment, and the specific implementation process may refer to the related description of the method embodiment, which is not described herein again.
In the embodiment of the present invention, the obtaining unit 601 obtains N song audio files, where N is an integer greater than 1; a first judging unit 602 judges whether the N song audio files acquired by the acquiring unit contain a noise audio file, where the noise audio file is a noise audio file whose noise is greater than a noise threshold; when the first judging unit judges that the N song audio files contain noise audio files, the first processing unit 603 performs noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold; a first extracting unit 604 extracts M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files among the N song audio files, where M is an integer greater than 1; the splicing unit 605 splices the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file; the locking unit is used for locking the splicing position of the spliced song audio files; and the second processing unit is used for processing the splicing position of the spliced song audio file locked by the locking unit so as to obtain a seamless spliced song audio file. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.
Referring to fig. 7, fig. 7 is a schematic structural diagram of a device for splicing songs with noise according to a second embodiment of the present invention. The device for splicing the songs containing the noise described in the embodiment comprises: at least one input device 1000; at least one output device 2000; at least one processor 3000, e.g., a CPU; and a memory 4000, the input device 1000, the output device 2000, the processor 3000, and the memory 4000 being connected by a bus 5000.
The input device 1000 may be a touch panel, a general PC, a liquid crystal display, a touch screen, or the like.
The memory 4000 may be a high-speed RAM memory or a non-volatile memory (e.g., a magnetic disk memory). The memory 4000 is used for storing a set of program codes, and the input device 1000, the output device 2000 and the processor 3000 are used for calling the program codes stored in the memory 4000 to execute the following operations:
the processor 3000 is configured to obtain N song audio files, where N is an integer greater than 1;
the processor 3000 is further configured to determine whether the N song audio files include a noise audio file, where the noise audio file is a noise audio file with noise greater than a noise threshold;
the processor 3000 is further configured to perform noise reduction processing on the noise audio file in the N song audio files when it is determined that the N song audio files contain the noise audio file, so as to obtain a noise reduction audio file with noise smaller than a threshold;
the processor 3000 is further configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1;
the processor 3000 is further configured to splice the M segments according to a preset sequence to obtain a spliced song audio file.
In some possible embodiments, the processor 3000 is further configured to:
analyzing the N song audio files to obtain the spectrum change rule of the N song audio files;
and judging whether the N song audio files contain noise audio files according to the frequency spectrum change rule of the N song audio files.
In some possible embodiments, the processor 3000 is further configured to:
extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and original singing parts of the noise reduction audio files;
determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files;
and cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing in the N song audio files and the original part of the noise reduction audio file to extract the M segments.
In some possible embodiments, the processor 3000 is further configured to:
extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and accompaniment parts of the noise reduction audio files;
determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files;
and cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files to extract the M fragments.
In some possible embodiments, after the processor 3000 splices the M segments according to a preset order to obtain a spliced song audio file, the processor 3000 is further specifically configured to:
locking the splicing position of the spliced song audio files;
and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.
In a specific implementation, the input device 1000, the output device 2000, and the processor 3000 described in this embodiment of the present invention may execute the implementation manners described in the embodiments of the method for splicing songs including noise described in fig. 1 to fig. 5 provided in this embodiment of the present invention, or may execute the implementation manners of the apparatus for splicing songs including noise described in the first embodiment of the apparatus for splicing songs including noise provided in this embodiment of the present invention, which is not described herein again.
The modules or sub-modules in all the embodiments of the present invention may be implemented by a general-purpose Integrated Circuit, such as a CPU (Central Processing Unit), or an ASIC (Application Specific Integrated Circuit).
The steps in the method of the embodiment of the invention can be sequentially adjusted, combined and deleted according to actual needs.
The units in the device of the embodiment of the invention can be merged, divided and deleted according to actual needs.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), or the like.
While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.
Claims (6)
1. A method for splicing songs containing noise is characterized by comprising the following steps:
acquiring N song audio files from a cloud, wherein N is an integer greater than 1;
judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value;
if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value;
extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1;
splicing the M segments according to a preset sequence to obtain a spliced song audio file, wherein the M segments are stored in a storage space for storing the segments to be spliced, and the preset sequence is a time sequence of adding the M segments into the storage space of the segments to be spliced; locking the splicing position of the spliced song audio files, and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files, wherein the processing comprises the following steps: adjusting the melody of the splicing place of the spliced song audio files or inserting the melodies with similar variation degrees;
wherein, the extracting M segments from the song audio files without the noise reduction processing and the noise reduction audio files in the N song audio files, where M is an integer greater than 1, includes:
extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and original singing parts of the noise reduction audio files;
determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files,The determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files comprises: the voice of the original singing partSeparating into one statement segment, wherein each statement segment has corresponding starting time and ending time;
and cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing in the N song audio files and the original part of the noise reduction audio file to extract the M segments.
2. The method of claim 1, wherein said determining whether the N song audio files contain a noise audio file comprises:
analyzing the N song audio files to obtain the spectrum change rule of the N song audio files;
and judging whether the N song audio files contain noise audio files according to the frequency spectrum change rule of the N song audio files.
3. An apparatus for splicing noise-containing songs, comprising:
the acquisition unit is used for acquiring N song audio files from the cloud, wherein N is an integer greater than 1;
the first judging unit is used for judging whether the N song audio files acquired by the acquiring unit contain noise audio files, and the noise audio files are noise audio files with noise larger than a noise threshold;
the first processing unit is used for performing noise reduction processing on the noise audio files in the N song audio files when the first judging unit judges that the N song audio files contain the noise audio files so as to obtain noise reduction audio files with noise smaller than a threshold value;
a first extracting unit, configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1;
the splicing unit is used for splicing the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file, wherein the M segments are stored in a storage space for storing the segments to be spliced, and the preset sequence is a time sequence of adding the M segments into the storage space of the segments to be spliced;
the apparatus is further specifically configured to: locking the splicing position of the spliced song audio files; processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files, wherein the processing comprises the following steps: adjusting the melody of the splicing place of the spliced song audio files or inserting the melodies with similar variation degrees;
wherein the first extraction unit includes:
the second extraction unit is used for extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and original singing parts of the noise reduction audio files;
a first determining unit, configured to determine a start time and an end time of each speech segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, among the N song audio files extracted by the second extracting unit,The determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files comprises: separating the voice of the original singing part into sentence segments, wherein each sentence segment has corresponding starting time and ending time;
and the first cutting unit is used for cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files determined by the first determining unit so as to extract the M segments.
4. The apparatus of claim 3, wherein the first determination unit comprises:
the analysis unit is used for analyzing the N song audio files to obtain the spectrum change rule of the N song audio files;
and the second judging unit is used for judging whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files analyzed and obtained by the analyzing unit.
5. An apparatus for splicing noise-containing songs, comprising:
a processor and a memory; wherein the processor performs the method of any of claims 1 to 2 by calling code or instructions in the memory.
6. A computer storage medium for storing a computer program, wherein the computer program causes a computer to perform the method of any one of claims 1 to 2.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710996071.5A CN107622775B (en) | 2015-03-20 | 2015-03-20 | Method for splicing songs containing noise and related products |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710996071.5A CN107622775B (en) | 2015-03-20 | 2015-03-20 | Method for splicing songs containing noise and related products |
CN201510125170.7A CN104778958B (en) | 2015-03-20 | 2015-03-20 | A kind of method and device of Noise song splicing |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510125170.7A Division CN104778958B (en) | 2015-03-20 | 2015-03-20 | A kind of method and device of Noise song splicing |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107622775A CN107622775A (en) | 2018-01-23 |
CN107622775B true CN107622775B (en) | 2020-12-18 |
Family
ID=53620385
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710996071.5A Expired - Fee Related CN107622775B (en) | 2015-03-20 | 2015-03-20 | Method for splicing songs containing noise and related products |
CN201510125170.7A Expired - Fee Related CN104778958B (en) | 2015-03-20 | 2015-03-20 | A kind of method and device of Noise song splicing |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510125170.7A Expired - Fee Related CN104778958B (en) | 2015-03-20 | 2015-03-20 | A kind of method and device of Noise song splicing |
Country Status (1)
Country | Link |
---|---|
CN (2) | CN107622775B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106970950B (en) * | 2017-03-07 | 2021-08-24 | 腾讯音乐娱乐(深圳)有限公司 | Similar audio data searching method and device |
CN107591149B (en) * | 2017-09-18 | 2021-09-28 | 腾讯音乐娱乐科技(深圳)有限公司 | Audio synthesis method, device and storage medium |
CN109949792B (en) * | 2019-03-28 | 2021-08-13 | 优信拍(北京)信息科技有限公司 | Multi-audio synthesis method and device |
CN112037739B (en) * | 2020-09-01 | 2024-02-27 | 腾讯音乐娱乐科技(深圳)有限公司 | Data processing method and device and electronic equipment |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102348150A (en) * | 2010-07-22 | 2012-02-08 | 索尼公司 | Audio signal processing apparatus, audio signal processing method, and program |
CN102568527A (en) * | 2011-12-20 | 2012-07-11 | 广东步步高电子工业有限公司 | Method and system for easily cutting audio files and applied mobile handheld device |
CN203070769U (en) * | 2013-02-26 | 2013-07-17 | 珠海市杰理科技有限公司 | Automatic denoising device for audio playing and audio player |
US20140039891A1 (en) * | 2007-10-16 | 2014-02-06 | Adobe Systems Incorporated | Automatic separation of audio data |
CN103943113A (en) * | 2014-04-15 | 2014-07-23 | 福建星网视易信息系统有限公司 | Method and device for removing accompaniment from song |
CN104244086A (en) * | 2014-09-03 | 2014-12-24 | 陈飞 | Video real-time splicing device and method based on real-time conversation semantic analysis |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7948981B1 (en) * | 2006-10-23 | 2011-05-24 | Adobe Systems Incorpoated | Methods and apparatus for representing audio data |
CN102467939B (en) * | 2010-11-04 | 2014-08-13 | 北京彩云在线技术开发有限公司 | Song audio frequency cutting apparatus and method thereof |
CN104112444B (en) * | 2014-07-28 | 2018-11-06 | 中国科学院自动化研究所 | A kind of waveform concatenation phoneme synthesizing method based on text message |
-
2015
- 2015-03-20 CN CN201710996071.5A patent/CN107622775B/en not_active Expired - Fee Related
- 2015-03-20 CN CN201510125170.7A patent/CN104778958B/en not_active Expired - Fee Related
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140039891A1 (en) * | 2007-10-16 | 2014-02-06 | Adobe Systems Incorporated | Automatic separation of audio data |
CN102348150A (en) * | 2010-07-22 | 2012-02-08 | 索尼公司 | Audio signal processing apparatus, audio signal processing method, and program |
CN102568527A (en) * | 2011-12-20 | 2012-07-11 | 广东步步高电子工业有限公司 | Method and system for easily cutting audio files and applied mobile handheld device |
CN203070769U (en) * | 2013-02-26 | 2013-07-17 | 珠海市杰理科技有限公司 | Automatic denoising device for audio playing and audio player |
CN103943113A (en) * | 2014-04-15 | 2014-07-23 | 福建星网视易信息系统有限公司 | Method and device for removing accompaniment from song |
CN104244086A (en) * | 2014-09-03 | 2014-12-24 | 陈飞 | Video real-time splicing device and method based on real-time conversation semantic analysis |
Also Published As
Publication number | Publication date |
---|---|
CN107622775A (en) | 2018-01-23 |
CN104778958B (en) | 2017-11-24 |
CN104778958A (en) | 2015-07-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3142031B1 (en) | Preset style song processing method and apparatus | |
US11176141B2 (en) | Preserving emotion of user input | |
CN102568478B (en) | Video play control method and system based on voice recognition | |
CN107622775B (en) | Method for splicing songs containing noise and related products | |
CN105630787B (en) | Animation realization method and device based on dynamic portable network graphics | |
CN111767740B (en) | Sound effect adding method and device, storage medium and electronic equipment | |
US12119023B2 (en) | Audio onset detection method and apparatus | |
CN104485115A (en) | Pronunciation evaluation equipment, method and system | |
CN110688518A (en) | Rhythm point determining method, device, equipment and storage medium | |
CN108509175B (en) | Voice interaction method and electronic equipment | |
CN112908292A (en) | Text voice synthesis method and device, electronic equipment and storage medium | |
CN104778217B (en) | A kind of song stitching algorithm and device | |
CN104778221A (en) | Music collaborate splicing method and device | |
CN110070885B (en) | Audio starting point detection method and device | |
CN104778220B (en) | A kind of method and device for song splicing of singing opera arias | |
CN104778957A (en) | Song audio processing method and device | |
CN110286776A (en) | Input method, device, electronic equipment and the storage medium of character combination information | |
CN110085214B (en) | Audio starting point detection method and device | |
CN111459446B (en) | Resource processing method of electronic book, computing equipment and computer storage medium | |
CN104780438A (en) | Method and device for splicing video and song audio | |
CN106572140B (en) | Media file playing method and terminal equipment | |
CN109495786B (en) | Pre-configuration method and device of video processing parameter information and electronic equipment | |
CN112542157B (en) | Speech processing method, device, electronic equipment and computer readable storage medium | |
CN115484503B (en) | Bullet screen generation method and device, electronic equipment and storage medium | |
CN105469794B (en) | Information processing method and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18 Applicant after: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd. Address before: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18 Applicant before: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20201218 |
|
CF01 | Termination of patent right due to non-payment of annual fee |