上节我们讲了怎么写fft,这次我们来实践下fft的应用。
前天又开始唱从初二唱到大二的晴天,由于长期音不准,于是干脆借此做一个算音调的软件来校准自己的唱歌音调。
首先,是蛋疼的一个工作,读入麦克风。这个着实让我折腾很久,使用java.sound的接口,终于完成了采集。
一,采集
其实很简单(虽然搞了一天),抄了抄demo,发现只需要直接捕捉line。
- TargetDataLine line;
- AudioFormat format;
- format=new AudioFormat(AudioFormat.Encoding.PCM_SIGNED,44100,16,1,(16/8),44100,true);//定声音格式为44100hz采样,单通道
- DataLine.Info info = new DataLine.Info(TargetDataLine.class,
- format);
- if (!AudioSystem.isLineSupported(info)) //监测是否支持
- {
- System.out.println(“Line matching “ + info + ” not supported.”);
- }
- line = (TargetDataLine) AudioSystem.getLine(info);//打开line
- line.open(format, line.getBufferSize());
- ByteArrayOutputStream out = new ByteArrayOutputStream();
- int frameSizeInBytes = format.getFrameSize();
- int bufferLengthInFrames = line.getBufferSize() / 8;
- int bufferLengthInBytes = bufferLengthInFrames * frameSizeInBytes;
- byte[] data = new byte[bufferLengthInBytes];
- int numBytesRead;
- line.start();//打开轨道采集
- long st,tm;
- st=System.currentTimeMillis();
- tm=st;
- //System.out.format(“leninbd:%d\n”,bufferLengthInBytes);
- System.out.println(“Wait”);
- while(tm–st<3000)
- tm=System.currentTimeMillis();
- System.out.println(“Run”);
- st=System.currentTimeMillis();//一个准备时间
- tm=st;
- while(tm–st<2000)//读两秒的数据
- {
- if((numBytesRead = line.read(data, 0, bufferLengthInBytes)) == –1)
- {
- break;
- }
- out.write(data, 0, numBytesRead);
- tm=System.currentTimeMillis();
- }
- //System.out.println(“Done”);
- line.stop();
- line.close();
- line = null;
- out.flush();
- out.close();
- //完成采集
- int audioint[] = convert(format,out.toByteArray());
- //将byte转化为int
- PrintWriter pw=new PrintWriter(new FileWriter(new File(“dt.txt”)),true);
- for(int i=0;i<audioint.length;i++)
- pw.format(“%d “,audioint[i]);
- }
其中转化算法如下(不要问我什么原理,这种东西我一般都抄的。。。没什么深入了解的意义)
- int [] convert(AudioFormat format,byte[] audioBytes)
- {
- int[] audioData = null;
- if (format.getSampleSizeInBits() == 16)
- {
- int nlengthInSamples = audioBytes.length / 2;
- audioData = new int[nlengthInSamples];
- if (format.isBigEndian())
- {
- for (int i = 0; i < nlengthInSamples; i++)
- {
- /* First byte is MSB (high order) */
- int MSB = (int) audioBytes[2*i];
- /* Second byte is LSB (low order) */
- int LSB = (int) audioBytes[2*i+1];
- audioData[i] = MSB << 8 | (255 > LSB);
- }
- }
- else
- {
- for (int i = 0; i < nlengthInSamples; i++)
- {
- /* First byte is LSB (low order) */
- int LSB = (int) audioBytes[2*i];
- /* Second byte is MSB (high order) */
- int MSB = (int) audioBytes[2*i+1];
- audioData[i] = MSB << 8 | (255 > LSB);
- }
- }
- }
- else if (format.getSampleSizeInBits() == 8)
- {
- int nlengthInSamples = audioBytes.length;
- audioData = new int[nlengthInSamples];
- if (format.getEncoding().toString().startsWith(“PCM_SIGN”))
- {
- for (int i = 0; i < audioBytes.length; i++)
- {
- audioData[i] = audioBytes[i];
- }
- }
- else
- {
- for (int i = 0; i < audioBytes.length; i++)
- {
- audioData[i] = audioBytes[i] – 128;
- }
- }
- }
- return audioData;
- }
于是就完成了声音的基础采集,拿去放到mathematica测试,用吉他弹几个音嘛
- (*Mathematica Code*)
- Init[] :=
- (dt =
- Import[“/Users/none/Develop/mph/build/dt.txt”, “Table”][[1]];
- ListLinePlot[dt, PlotRange -> All])
- Anal := (
- (*dt=SignalMk[1,263];*)
- ts = Length[dt]/44100;
- dr = 1000;
- ListLinePlot[
- Abs[
- Fourier[dt[[100 ;; Length[dt] ]] ][[1 ;;
- Round[dr*ts] ]]],
- PlotRange -> All
- , DataRange -> {1, dr*ts}*1/ts]
- 1
- )
分析下得到两个图
(其实是晴天的第一个小结啦)
频谱几乎正确,虽然不知道200hz哪来的音啊。。。。。