CN107622775B

CN107622775B - Method for splicing songs containing noise and related products

Info

Publication number: CN107622775B
Application number: CN201710996071.5A
Authority: CN
Inventors: 程胜
Original assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Current assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date: 2015-03-20
Filing date: 2015-03-20
Publication date: 2020-12-18
Anticipated expiration: 2035-03-20
Also published as: CN107622775A; CN104778958B; CN104778958A

Abstract

The embodiment of the invention discloses a method for splicing songs containing noise and a related product, wherein the method comprises the following steps: acquiring N song audio files; judging whether the N song audio files contain noise audio files or not; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; and splicing the M segments according to a preset sequence to obtain a spliced song audio file. The embodiment of the invention also discloses a device for splicing the songs containing the noise. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.

Description

Method for splicing songs containing noise and related products

Technical Field

The embodiment of the invention relates to the technical field of audio processing, in particular to a method for splicing songs containing noise and a related product.

Background

With the rapid development of mobile internet technology, devices (such as mobile phones, tablet computers, touch) and dedicated players are increasingly pursuing music. At present, in the prior art, the function playing of music is limited to improve the own tone quality, for example, the inferior audio file is properly processed by the processing software provided in the device or the special player, so as to improve the playing quality; or the requirements on the quality of the audio files are high, the original audio files can be played with good effects, and local processing of the audio files, such as shearing technology and extraction technology, is less researched.

In the prior art, the cutting of songs mainly depends on network software, and the software often needs manual operation and cannot accurately position the exact position of each lyric.

Disclosure of Invention

The embodiment of the invention provides a method for splicing songs containing noise and a related product, which can cut and splice audio files of the songs containing the noise.

The first aspect of the embodiments of the present invention provides a method for splicing noise-containing songs, including:

acquiring N song audio files, wherein N is an integer greater than 1;

judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value;

if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value;

extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1;

and splicing the M segments according to a preset sequence to obtain a spliced song audio file.

A second aspect of the embodiments of the present invention provides a device for splicing songs containing noise, including:

the device comprises an acquisition unit, a storage unit and a processing unit, wherein the acquisition unit is used for acquiring N song audio files, and N is an integer greater than 1;

the first judging unit is used for judging whether the N song audio files acquired by the acquiring unit contain noise audio files, and the noise audio files are noise audio files with noise larger than a noise threshold;

the first processing unit is used for performing noise reduction processing on the noise audio files in the N song audio files when the first judging unit judges that the N song audio files contain the noise audio files so as to obtain noise reduction audio files with noise smaller than a threshold value;

a first extracting unit, configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1;

and the splicing unit is used for splicing the M segments extracted by the first extraction unit according to a preset sequence so as to obtain a spliced song audio file.

The embodiment of the invention has the following beneficial effects:

in the embodiment of the invention, N song audio files are obtained, wherein N is an integer greater than 1; judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; and splicing the M segments according to a preset sequence to obtain a spliced song audio file. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the embodiments and the drawings used in the description will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without creative efforts.

Fig. 1 is a schematic flowchart of a method for splicing songs with noise according to a first embodiment of the present invention;

fig. 2 is a flowchart illustrating a method for splicing songs with noise according to a second embodiment of the present invention;

fig. 3 is a flowchart illustrating a method for splicing songs with noise according to a third embodiment of the present invention;

fig. 4 is a schematic flowchart of a fourth embodiment of a method for splicing songs with noise according to an embodiment of the present invention;

fig. 5 is a schematic flowchart of a fifth embodiment of a method for splicing songs with noise according to an embodiment of the present invention;

fig. 6 is a schematic structural diagram of an apparatus for splicing songs with noise according to a first embodiment of the present invention;

fig. 7 is a schematic structural diagram of an apparatus for splicing songs with noise according to a second embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present invention without any creative efforts shall fall within the protection scope of the embodiments of the present invention.

The terms "first," "second," "third," and "fourth," etc. in the description and claims of the invention and in the accompanying drawings are used for distinguishing between different objects and not for describing a particular order. Furthermore, the terms "include" and "have," as well as any variations thereof, are intended to cover non-exclusive inclusions. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those steps or elements listed, but may alternatively include other steps or elements not listed, or inherent to such process, method, article, or apparatus.

In implementation, in the embodiment of the present invention, the apparatus may include, but is not limited to: notebook computers, cell phones, tablet computers, smart wearable devices, players, MP3, MP4, smart televisions, set-top boxes, servers, and the like. The system of the device refers to the operating system of the device, and may include but is not limited to: android system, saiban system, Windows system, IOS (mobile operating system developed by apple inc.) system, and the like. It should be noted that the Android device refers to an Android system device, the shift device refers to a shift system device, and the like. The above devices are merely examples and are not exhaustive and include, but are not limited to, the above devices.

In implementation, in the embodiment of the present invention, the songs may include, but are not limited to: chinese songs, english songs, russian songs, spanish songs, classical songs, pop music songs, rock music songs, soft music songs, rap songs, solo songs, songs in video, etc. The above songs are merely exemplary and are not exhaustive, including but not limited to the above songs.

In implementation, the format of the song may include, but is not limited to: MP3, MP4, WMV, WAV, FLV, etc. The formats of the above songs are merely examples and are not exhaustive, including but not limited to the formats of the above songs.

Referring to fig. 1, fig. 1 is a flowchart illustrating an embodiment of a method for splicing songs with noise according to an embodiment of the present invention. The method for splicing the songs containing the noises described in the embodiment comprises the following steps:

s101, acquiring N song audio files.

Specifically, N song audio files may be acquired from a device for splicing the noise-containing songs, or N song audio files may be acquired from a mobile terminal, or N song audio files may be acquired from another manner, where N is an integer greater than 1, the mobile terminal may be, for example, a mobile phone, a tablet computer, a notebook computer, a palm computer, a mobile internet device (MID, mobile internet device), a wearable device (e.g., a smart watch (such as iwatch), a smart bracelet, a pedometer, etc.), or another terminal device capable of installing and deploying an instant messaging application client, and the acquired N song audio files are stored, and the location where the N song audio files are stored may be a local device for splicing the noise-containing songs, a cloud terminal, or another storage space.

S102, judging whether the N song audio files contain noise audio files or not, wherein the noise audio files are noise audio files with noise larger than a noise threshold value.

Specifically, it is determined whether the N song audio files contain noise, wherein the noise may be an interference signal that affects the sound quality of the audio files. In general, noise is a random signal, and whether the N song audio files contain noise can be determined by simply analyzing the audio files.

As a possible implementation, the noise level is determined by the interference level of the audio file, and the noise will generally cause interference in a certain playing time period of the audio file.

S103, carrying out noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value.

Specifically, if it is determined that the N song audio files contain noise audio files, noise reduction processing is performed on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value, obtain N noise reduction audio files, and store the N noise reduction audio files, where the position where the N noise reduction audio files are stored may be a device local where the noise-containing songs are spliced, or a cloud, or other storage spaces.

S104, extracting M fragments from the song audio files which are not subjected to the noise reduction processing in the N song audio files and the noise reduction audio files.

Specifically, a total of M segments are extracted from a storage space i for storing the N song audio files and a storage space j for storing the noise reduction audio files, and the extracted M segments are stored in a storage space of segments to be spliced, where M is an integer greater than 1.

And S105, splicing the M segments according to a preset sequence to obtain a spliced song audio file.

Specifically, the M segments stored in the storage space of the segments to be spliced are spliced according to a preset sequence to obtain a spliced song audio file, where the preset sequence may be a time sequence added to the storage space of the segments to be spliced, a random sequence preset by a device for splicing songs with noise, or other sequences.

Referring to fig. 2, fig. 2 is a flowchart illustrating a method for splicing songs with noise according to a second embodiment of the present invention, including the steps of:

s201, analyzing the N song audio files to obtain the spectrum change rule of the N song audio files.

Specifically, the N song audio files may be analyzed by an audio file spectrum processing device to obtain a spectrum variation law of the N song audio files.

S202, judging whether the N song audio files contain noise audio files according to the frequency spectrum change rule of the N song audio files.

Specifically, whether the N song audio files contain the noise audio files is judged according to the frequency spectrum change rule of the N song audio files. And analyzing and comparing the frequency spectrum of each song audio file under the normal condition with the frequency spectrum of each song audio file under the noise condition, wherein if the frequency spectrum with obvious fluctuation in the song audio files is the noise frequency spectrum, the song audio files containing the noise frequency spectrum are the noisy song audio files.

In the embodiment of the invention, the N song audio files are analyzed to obtain the frequency spectrum change rule of the N song audio files; and judging whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files, so that whether the N song audio files contain the noise audio files can be determined.

Referring to fig. 3, fig. 3 is a flowchart illustrating a method for splicing songs with noise according to a third embodiment of the present invention, including the steps of:

s301, extracting song audio files which are not subjected to the noise reduction processing and original singing parts of the noise reduction audio files from the N song audio files.

Specifically, the original singing part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files are extracted. In general, a song audio file may include an original singing portion and an accompaniment portion, and the song audio file may be separated to extract the original singing portion of the song audio file.

S302, determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files.

Specifically, the start time and the end time of each voice segment in the original part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files are determined. The original part of the speech can be separated into sentence segments, and each sentence segment has a corresponding start time and end time.

S303, cutting according to the song audio files which are not subjected to the noise reduction processing in the N song audio files and the start time and the end time of each voice fragment in the original part of the noise reduction audio files to extract the M fragments.

Specifically, the M segments are extracted by cutting the start time and the end time of each speech segment in the original part of the song audio file and the noise-reduced audio file which are not subjected to noise reduction processing in the N song audio files. The method can cut one sentence segment separated from the voice of the original part according to the corresponding starting time and ending time of each sentence segment to extract M segments.

Extracting song audio files which are not subjected to the noise reduction processing and original singing parts of the noise reduction audio files from the N song audio files; determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files; and cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing in the N song audio files and the original part of the noise reduction audio file to extract the M segments. By adopting the embodiment of the invention, the song audio file can be cut, and the fragments contained in the song audio file can be obtained.

Referring to fig. 4, fig. 4 is a schematic flowchart of a fourth embodiment of a method for splicing songs with noise according to the embodiment of the present invention, including the steps of:

s401, extracting song audio files which are not subjected to the noise reduction processing and accompaniment parts of the noise reduction audio files from the N song audio files.

Specifically, the accompaniment parts of the song audio files and the noise reduction audio files which are not subjected to noise reduction processing in the N song audio files are extracted. In general, a song audio file may include an original song portion and an accompaniment portion, and the song audio file may be separated to extract the accompaniment portion of the song audio file.

S402, determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files.

Specifically, the start time and the end time of each tune fragment in the song audio file without noise reduction processing and the accompaniment part of the noise reduction audio file in the N song audio files are determined. The voice of the accompaniment part can be separated into sentence fragments, and each sentence fragment has a corresponding starting time and ending time.

And S403, cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment part of the noise reduction audio files in the N song audio files to extract the M fragments.

Specifically, the M clips are extracted by cutting the start time and the end time of each speech clip in the accompaniment part of the song audio file and the noise reduction audio file which are not subjected to noise reduction processing in the N song audio files. The method can cut one sentence fragment separated from the voice of the accompaniment part according to the corresponding starting time and ending time of each sentence fragment so as to extract M fragments.

Extracting song audio files which are not subjected to the noise reduction processing and accompaniment parts of the noise reduction audio files from the N song audio files; determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files; and cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files to extract the M fragments. By adopting the embodiment of the invention, the song audio file can be cut, and the fragments contained in the song audio file can be obtained.

Referring to fig. 5, fig. 5 is a flowchart illustrating a fifth embodiment of a method for splicing songs with noise according to an embodiment of the present invention, including the steps of:

s501, acquiring N song audio files.

S502, judging whether the N song audio files contain noise audio files or not, wherein the noise audio files are noise audio files with noise larger than a noise threshold value.

S503, carrying out noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value.

S504, M fragments are extracted from the song audio files which are not subjected to the noise reduction processing in the N song audio files and the noise reduction audio files.

And S505, splicing the M segments according to a preset sequence to obtain a spliced song audio file.

And S506, locking the splicing position of the spliced song audio files.

Specifically, the splicing place of the high-quality spliced songs is locked. In order to achieve seamless splicing, therefore, the splicing trace cannot be displayed, and the splicing place needs to be locked and preprocessed.

And S507, processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.

Specifically, the splicing position of the spliced song audio files is processed to obtain seamless spliced song audio files. Among them, the pretreatment may include, but is not limited to: and adjusting the melody of the spliced place of the spliced songs, inserting the melodies with similar change degrees, and smoothing the spliced place with larger change amplitude.

The embodiment of the invention obtains N song audio files, wherein N is an integer greater than 1; judging whether the N song audio files contain noise audio files, wherein the noise audio files are noise audio files with noise larger than a noise threshold value; if the N song audio files contain noise audio files, carrying out noise reduction treatment on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold value; extracting M fragments from the song audio files which are not subjected to the noise reduction processing and the noise reduction audio files in the N song audio files, wherein M is an integer larger than 1; splicing the M segments according to a preset sequence to obtain a spliced song audio file, and locking a splicing place of the high-quality spliced song; and presetting the splicing place of the spliced songs to obtain seamless spliced songs. By adopting the embodiment of the invention, the spliced song audio file can be subjected to preset processing so as to obtain a seamless song with better tone quality.

Referring to fig. 6, fig. 6 is a schematic structural diagram of a device for splicing noise-containing songs according to a first embodiment of the present invention, where the device for splicing noise-containing songs shown in fig. 6 may include an obtaining unit 601, a first determining unit 602, a first processing unit 603, a first extracting unit 604, and a splicing unit 605, which are as follows:

an obtaining unit 601, configured to obtain N song audio files, where N is an integer greater than 1.

A first determining unit 602, configured to determine whether the N song audio files acquired by the acquiring unit contain a noise audio file, where the noise audio file is a noise audio file whose noise is greater than a noise threshold.

Specifically, the first determining unit 602 includes: an analyzing unit (not shown) for analyzing the N song audio files to obtain a spectrum variation rule of the N song audio files; a second determining unit (not shown) configured to determine whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files analyzed by the analyzing unit.

A first processing unit 603, configured to perform noise reduction processing on the noise audio file in the N song audio files when the first determining unit determines that the N song audio files contain a noise audio file, so as to obtain a noise-reduced audio file with noise smaller than a threshold.

A first extracting unit 604, configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1.

Specifically, the first extracting unit 604 includes: a second extracting unit (not shown) for extracting song audio files that are not subjected to the noise reduction processing and original singing portions of the noise reduction audio files from among the N song audio files; a first determining unit (not shown) configured to determine a start time and an end time of each speech segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, among the N song audio files extracted by the second extracting unit; a first clipping unit (not shown) configured to clip, according to the start time and the end time of each voice segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, in the N song audio files determined by the first determining unit, so as to extract the M segments.

Specifically, the first extracting unit 604 includes: a third extracting unit (not shown) for extracting the song audio files that are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files from the N song audio files; a second determining unit (not shown) for determining a start time and an end time of each tune fragment in the accompaniment part of the song audio file and the noise-reduced song audio file that are not subjected to the noise reduction processing in the N song audio files of the third extracting unit; a second clipping unit (not shown) configured to clip the start time and the end time of each tune clip in the song audio file that is not subjected to the noise reduction processing and the accompaniment portion of the noise reduction audio file in the N song audio files determined by the second determining unit to extract the M clips.

And the splicing unit 605 is configured to splice the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file.

Optionally, after the splicing unit 605 splices the M segments according to a preset sequence, the apparatus further includes: locking the splicing position of the spliced song audio files; and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.

It can be understood that the functions of the functional modules of the apparatus for splicing songs with noise in this embodiment may be specifically implemented according to the method in the method embodiment, and the specific implementation process may refer to the related description of the method embodiment, which is not described herein again.

In the embodiment of the present invention, the obtaining unit 601 obtains N song audio files, where N is an integer greater than 1; a first judging unit 602 judges whether the N song audio files acquired by the acquiring unit contain a noise audio file, where the noise audio file is a noise audio file whose noise is greater than a noise threshold; when the first judging unit judges that the N song audio files contain noise audio files, the first processing unit 603 performs noise reduction processing on the noise audio files in the N song audio files to obtain noise reduction audio files with noise smaller than a threshold; a first extracting unit 604 extracts M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files among the N song audio files, where M is an integer greater than 1; the splicing unit 605 splices the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file; the locking unit is used for locking the splicing position of the spliced song audio files; and the second processing unit is used for processing the splicing position of the spliced song audio file locked by the locking unit so as to obtain a seamless spliced song audio file. The invention can cut and splice the audio files containing noise, and can reduce the noise of the spliced audio files so as to improve the tone quality of the spliced audio files.

Referring to fig. 7, fig. 7 is a schematic structural diagram of a device for splicing songs with noise according to a second embodiment of the present invention. The device for splicing the songs containing the noise described in the embodiment comprises: at least one input device 1000; at least one output device 2000; at least one processor 3000, e.g., a CPU; and a memory 4000, the input device 1000, the output device 2000, the processor 3000, and the memory 4000 being connected by a bus 5000.

The input device 1000 may be a touch panel, a general PC, a liquid crystal display, a touch screen, or the like.

The memory 4000 may be a high-speed RAM memory or a non-volatile memory (e.g., a magnetic disk memory). The memory 4000 is used for storing a set of program codes, and the input device 1000, the output device 2000 and the processor 3000 are used for calling the program codes stored in the memory 4000 to execute the following operations:

the processor 3000 is configured to obtain N song audio files, where N is an integer greater than 1;

the processor 3000 is further configured to determine whether the N song audio files include a noise audio file, where the noise audio file is a noise audio file with noise greater than a noise threshold;

the processor 3000 is further configured to perform noise reduction processing on the noise audio file in the N song audio files when it is determined that the N song audio files contain the noise audio file, so as to obtain a noise reduction audio file with noise smaller than a threshold;

the processor 3000 is further configured to extract M segments from the song audio files that are not subjected to the noise reduction processing and the noise reduction audio files, where M is an integer greater than 1;

the processor 3000 is further configured to splice the M segments according to a preset sequence to obtain a spliced song audio file.

In some possible embodiments, the processor 3000 is further configured to:

analyzing the N song audio files to obtain the spectrum change rule of the N song audio files;

and judging whether the N song audio files contain noise audio files according to the frequency spectrum change rule of the N song audio files.

In some possible embodiments, the processor 3000 is further configured to:

extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and original singing parts of the noise reduction audio files;

determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files;

and cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing in the N song audio files and the original part of the noise reduction audio file to extract the M segments.

In some possible embodiments, the processor 3000 is further configured to:

extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and accompaniment parts of the noise reduction audio files;

determining the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files;

and cutting according to the starting time and the ending time of each tune fragment in the song audio files which are not subjected to the noise reduction processing and the accompaniment parts of the noise reduction audio files in the N song audio files to extract the M fragments.

In some possible embodiments, after the processor 3000 splices the M segments according to a preset order to obtain a spliced song audio file, the processor 3000 is further specifically configured to:

locking the splicing position of the spliced song audio files;

and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files.

In a specific implementation, the input device 1000, the output device 2000, and the processor 3000 described in this embodiment of the present invention may execute the implementation manners described in the embodiments of the method for splicing songs including noise described in fig. 1 to fig. 5 provided in this embodiment of the present invention, or may execute the implementation manners of the apparatus for splicing songs including noise described in the first embodiment of the apparatus for splicing songs including noise provided in this embodiment of the present invention, which is not described herein again.

The modules or sub-modules in all the embodiments of the present invention may be implemented by a general-purpose Integrated Circuit, such as a CPU (Central Processing Unit), or an ASIC (Application Specific Integrated Circuit).

The steps in the method of the embodiment of the invention can be sequentially adjusted, combined and deleted according to actual needs.

The units in the device of the embodiment of the invention can be merged, divided and deleted according to actual needs.

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), or the like.

While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.

Claims

1. A method for splicing songs containing noise is characterized by comprising the following steps:

acquiring N song audio files from a cloud, wherein N is an integer greater than 1;

splicing the M segments according to a preset sequence to obtain a spliced song audio file, wherein the M segments are stored in a storage space for storing the segments to be spliced, and the preset sequence is a time sequence of adding the M segments into the storage space of the segments to be spliced; locking the splicing position of the spliced song audio files, and processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files, wherein the processing comprises the following steps: adjusting the melody of the splicing place of the spliced song audio files or inserting the melodies with similar variation degrees;

wherein, the extracting M segments from the song audio files without the noise reduction processing and the noise reduction audio files in the N song audio files, where M is an integer greater than 1, includes:

determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files，The determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files comprises: the voice of the original singing partSeparating into one statement segment, wherein each statement segment has corresponding starting time and ending time;

2. The method of claim 1, wherein said determining whether the N song audio files contain a noise audio file comprises:

3. An apparatus for splicing noise-containing songs, comprising:

the acquisition unit is used for acquiring N song audio files from the cloud, wherein N is an integer greater than 1;

the splicing unit is used for splicing the M segments extracted by the first extraction unit according to a preset sequence to obtain a spliced song audio file, wherein the M segments are stored in a storage space for storing the segments to be spliced, and the preset sequence is a time sequence of adding the M segments into the storage space of the segments to be spliced;

the apparatus is further specifically configured to: locking the splicing position of the spliced song audio files; processing the splicing position of the spliced song audio files to obtain seamless spliced song audio files, wherein the processing comprises the following steps: adjusting the melody of the splicing place of the spliced song audio files or inserting the melodies with similar variation degrees;

wherein the first extraction unit includes:

the second extraction unit is used for extracting song audio files which are not subjected to the noise reduction processing in the N song audio files and original singing parts of the noise reduction audio files;

a first determining unit, configured to determine a start time and an end time of each speech segment in the original portion of the song audio file and the song audio file that is not subjected to the noise reduction processing, among the N song audio files extracted by the second extracting unit，The determining the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files comprises: separating the voice of the original singing part into sentence segments, wherein each sentence segment has corresponding starting time and ending time;

and the first cutting unit is used for cutting according to the starting time and the ending time of each voice segment in the song audio file which is not subjected to the noise reduction processing and the original part of the noise reduction audio file in the N song audio files determined by the first determining unit so as to extract the M segments.

4. The apparatus of claim 3, wherein the first determination unit comprises:

the analysis unit is used for analyzing the N song audio files to obtain the spectrum change rule of the N song audio files;

and the second judging unit is used for judging whether the N song audio files contain noise audio files according to the spectrum change rule of the N song audio files analyzed and obtained by the analyzing unit.

5. An apparatus for splicing noise-containing songs, comprising:

a processor and a memory; wherein the processor performs the method of any of claims 1 to 2 by calling code or instructions in the memory.

6. A computer storage medium for storing a computer program, wherein the computer program causes a computer to perform the method of any one of claims 1 to 2.