Vs 1003
Vs 1003
mic
audio
VS1003 Mono Stereo Stereo Ear−
audio
L
MIC AMP MUX R
ADC DAC phone Driver
line
audio output
GPIO 4
GPIO
X ROM
DREQ
SO
SI Serial X RAM
Data/
SCLK Control 4
XCS
Interface VSDSP
XDCS Y ROM
RX
TX UART
Y RAM
Clock
multiplier
Instruction Instruction
RAM ROM
Contents
VS1003 1
Table of Contents 2
List of Figures 5
1 Licenses 6
2 Disclaimer 6
3 Definitions 6
7 SPI Buses 16
7.1 General . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
7.2 SPI Bus Pin Descriptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
7.2.1 VS10xx Native Modes (New Mode) . . . . . . . . . . . . . . . . . . . 16
7.2.2 VS1001 Compatibility Mode . . . . . . . . . . . . . . . . . . . . . . . 16
7.3 Data Request Pin DREQ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
7.4 Serial Protocol for Serial Data Interface (SDI) . . . . . . . . . . . . . . . . . . . 17
7.4.1 General . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
7.4.2 SDI in VS10xx Native Modes (New Mode, recommended) . . . . . . 17
7.4.3 SDI in VS1001 Compatibility Mode . . . . . . . . . . . . . . . . . . . 18
7.4.4 Passive SDI Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
7.5 Serial Protocol for Serial Command Interface (SCI) . . . . . . . . . . . . . . . . 18
7.5.1 General . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
7.5.2 SCI Read . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
7.5.3 SCI Write . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
7.6 SPI Timing Diagram . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
7.7 SPI Examples with SM_SDINEW and SM_SDISHARED set . . . . . . . . . . . 21
7.7.1 Two SCI Writes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
8 Functional Description 23
8.1 Main Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
8.2 Supported Audio Codecs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
8.2.1 Supported MP3 (MPEG layer III) Formats . . . . . . . . . . . . . . . 23
8.2.2 Supported WMA Formats . . . . . . . . . . . . . . . . . . . . . . . . 24
8.2.3 Supported RIFF WAV Formats . . . . . . . . . . . . . . . . . . . . . . 25
8.2.4 Supported MIDI Formats . . . . . . . . . . . . . . . . . . . . . . . . . 26
8.3 Data Flow of VS1003 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
8.4 Serial Data Interface (SDI) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
8.5 Serial Control Interface (SCI) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
8.6 SCI Registers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
8.6.1 SCI_MODE (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
8.6.2 SCI_STATUS (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
8.6.3 SCI_BASS (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
8.6.4 SCI_CLOCKF (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
8.6.5 SCI_DECODE_TIME (RW) . . . . . . . . . . . . . . . . . . . . . . . 33
8.6.6 SCI_AUDATA (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
8.6.7 SCI_WRAM (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
8.6.8 SCI_WRAMADDR (W) . . . . . . . . . . . . . . . . . . . . . . . . . . 33
8.6.9 SCI_HDAT0 and SCI_HDAT1 (R) . . . . . . . . . . . . . . . . . . . . 34
8.6.10 SCI_AIADDR (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
8.6.11 SCI_VOL (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
8.6.12 SCI_AICTRL[x] (RW) . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
9 Operation 37
9.1 Clocking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
9.2 Hardware Reset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
9.3 Software Reset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
9.4 ADPCM Recording . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
9.4.1 Activating ADPCM mode . . . . . . . . . . . . . . . . . . . . . . . . . 38
9.4.2 Reading IMA ADPCM Data . . . . . . . . . . . . . . . . . . . . . . . 38
9.4.3 Adding a RIFF Header . . . . . . . . . . . . . . . . . . . . . . . . . . 39
9.4.4 Playing ADPCM Data . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
9.4.5 Sample Rate Considerations . . . . . . . . . . . . . . . . . . . . . . . 40
9.4.6 Example Code . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
9.5 SPI Boot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
9.6 Play/Decode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
9.7 Feeding PCM data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
9.8 SDI Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
9.8.1 Sine Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
9.8.2 Pin Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
9.8.3 Memory Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
9.8.4 SCI Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
10 VS1003 Registers 45
10.1 Who Needs to Read This Chapter . . . . . . . . . . . . . . . . . . . . . . . . . . 45
10.2 The Processor Core . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
10.3 VS1003 Memory Map . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
12 Contact Information 61
List of Figures
1 Measured ADC performance of the LINEIN pin. . . . . . . . . . . . . . . . . . . . 10
2 Measured ADC performance of the MIC pins (differential). . . . . . . . . . . . . . 10
3 Measured performance of RIGHT (or LEFT) output. . . . . . . . . . . . . . . . . 11
4 Typical spectrum of RIGHT (or LEFT) output. . . . . . . . . . . . . . . . . . . . . 11
5 Pin Configuration, LQFP-48. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
6 Pin Configuration, BGA-49. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
7 Typical Connection Diagram Using LQFP-48. . . . . . . . . . . . . . . . . . . . . 15
8 BSYNC Signal - one byte transfer. . . . . . . . . . . . . . . . . . . . . . . . . . . 18
9 BSYNC Signal - two byte transfer. . . . . . . . . . . . . . . . . . . . . . . . . . . 18
10 SCI Word Read . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
11 SCI Word Write . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
12 SPI Timing Diagram. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
13 Two SCI Operations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
14 Two SDI Bytes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
15 Two SDI Bytes Separated By an SCI Operation. . . . . . . . . . . . . . . . . . . . 22
16 Data Flow of VS1003. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
17 ADPCM Frequency Responses with 8kHz sample rate. . . . . . . . . . . . . . . 30
18 User’s Memory Map. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
19 RS232 Serial Interface Protocol . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
1 Licenses
2 Disclaimer
3 Definitions
ABR Average BitRate. Bitrate of stream may vary locally, but will stay close to a given number
when averaged over a longer time.
B Byte, 8 bits.
b Bit.
CBR Constant BitRate. Bitrate of stream will be the same for each compression block.
SDI Serial Data Interface, an SPI bus for VS1003 bitstream data.
VBR Variable BitRate. Bitrate will vary depending on the complexity of the source material.
W Word. In VS_DSP, instruction words are 32 bits and data words are 16 bits wide.
1 Must be connected together as close the device as possible for latch-up immunity.
2 The maximum sample rate that can be played with correct speed is XTALI/256.
Thus, XTALI must be at least 12.288 MHz to be able to play 48 kHz at correct speed.
3 Reset value is 1.0×. Recommended SC_MULT=3.0×, SC_ADD=1.0× (SCI_CLOCKF=0x9000).
4 52.0 MHz is the maximum clock for the full CVDD range.
Typical values are measured of about 5000 devices of Lot 4234011, Week Code 0452.
1 3.0 volts can be achieved with +-to-+ wiring for mono difference sound.
2 AOLR may be much lower, but below Typical distortion performance may be compromised.
3 Above typical amplitude the Harmonic Distortion increases.
4 Unweighted, A-weighted is about 3 dB better.
5 Limit low due to noise level of production tester.
Tested with an MPEG 1.0 Layer-3 128 kbit/s sample and generated sine. Output at full volume.
XTALI 12.288 MHz. Internal clock multiplier 3.0×. CVDD = 2.5 V, AVDD = 2.8 V.
90
80
70
dB
60
50
40
SNR
30 SNRa
THD
20
0.001 0.01 0.1 1
input voltage (rms)
Measured ADC performance of the LINEIN pin. X-axis is rms amplitude of 1 kHz sine input.
Curves are unweighted signal-to-noise ratio (blue), A-weighted signal-to-noise ratio (green),
and unweighted signal-to-distortion ratio (red). Sampling rate of ADC is 48 kHz (master clock
12.288 MHz), noise calculated from 0 to 20 kHz.
90
80
70
dB
60
50
40
SNR
30 SNRa
THD
20
0.001 0.01 0.1
input voltage (rms)
Measured ADC performance of the MIC pins (differential). Other settings same as in Fig. 1.
100
80
60
dB
40
Measured performance of RIGHT (or LEFT) output with 1 kHz generated sine. Sampling rate
of DAC is 48 kHz (master clock 12.288 MHz), noise calculated from 0 to 20 kHz.
-20
-40
amplitude dB
-60
-80
-100
-120
0 5000 10000 15000 20000
frequency Hz
Typical spectrum of RIGHT (or LEFT) output with maximum level and 30 Ohm load. Setup is
the same is in Fig. 3.
Both LPQFP-48 and BGA-49 are lead (Pb) free and also RoHS compliant packages. RoHS
is a short name of Directive 2002/95/EC on the restriction of the use of certain hazardous
substances in electrical and electronic equipment.
5.1.1 LQFP-48
48
1
5.1.2 BGA-49
C
4.80
7.00
0.80 TYP
1.10 REF
0.80 TYP
1.10 REF
4.80
7.00
TOP VIEW
Pin types:
In BGA-49, no-connect balls are A7, B7, D4, E5, F1, G1.
In LQFP-48, no-connect pins are 11, 12, 25, 35, 36.
7 SPI Buses
7.1 General
The SPI Bus - that was originally used in some Motorola devices - has been used for both
VS1003’s Serial Data Interface SDI (Chapters 7.4 and 8.4) and Serial Control Interface SCI
(Chapters 7.5 and 8.5).
These modes are active on VS1003 when SM_SDINEW is set to 1 (default at startup). DCLK
and SDATA are not used for data transfer and they can be used as general-purpose I/O pins
(GPIO2 and GPIO3). BSYNC function changes to data interface chip select (XDCS).
This mode is active when SM_SDINEW is set to 0. In this mode, DCLK, SDATA and BSYNC
are active.
The DREQ pin/signal is used to signal if VS1003’s FIFO is capable of receiving data. If DREQ
is high, VS1003 can take at least 32 bytes of SDI data or one SCI command. When these
criteria are not met, DREQ is turned low, and the sender should stop transferring new data.
Because of the 32-byte safety area, the sender may send up to 32 bytes of SDI data at a
time without checking the status of DREQ, making controlling VS1003 easier for low-speed
microcontrollers.
Note: DREQ may turn low or high at any time, even during a byte transmission. Thus, DREQ
should only be used to decide whether to send more bytes. It should not abort a transmission
that has already started.
Note: In VS10XX products up to VS1002, DREQ was only used for SDI. In VS1003 DREQ is
also used to tell the status of SCI.
There are cases when you still want to send SCI commands when DREQ is low. Because
DREQ is shared between SDI and SCI, you can not determine if a SCI command has been
executed if SDI is not ready to receive. In this case you need a long enough delay after every
SCI command to make certain none of them is missed. The SCI Registers table in section 8.6
gives the worst-case handling time for each SCI register write.
7.4.1 General
The serial data interface operates in slave mode so DCLK signal must be generated by an
external circuit.
Data (SDATA signal) can be clocked in at either the rising or falling edge of DCLK (Chapter 8.6).
VS1003 assumes its data input to be byte-sychronized. SDI bytes may be transmitted either
MSb or LSb first, depending of contents of SCI_MODE (Chapter 8.6.1).
The firmware is able to accept the maximum bitrate the SDI supports.
If SM_SDISHARE is 1, the XDCS signal is internally generated by inverting the XCS input.
BSYNC
SDATA D7 D6 D5 D4 D3 D2 D1 D0
DCLK
When VS1003 is running in VS1001 compatibility mode, a BSYNC signal must be generated
to ensure correct bit-alignment of the input bitstream. The first DCLK sampling edge (rising or
falling, depending on selected polarity), during which the BSYNC is high, marks the first bit of
a byte (LSB, if LSB-first order is used, MSB, if MSB-first order is used). If BSYNC is ’1’ when
the last bit is received, the receiver stays active and next 8 bits are also received.
BSYNC
SDATA D7 D6 D5 D4 D3 D2 D1 D0 D7 D6 D5 D4 D3 D2 D1 D0
DCLK
7.5.1 General
The serial bus protocol for the Serial Command Interface SCI (Chapter 8.5) consists of an
instruction byte, address byte and one 16-bit data word. Each read or write operation can read
or write a single register. Data bits are read at the rising edge, so the user should update data
at the falling edge. Bytes are always send MSb first. XCS should be low for the full duration of
the operation, but you can have pauses between bits if needed.
The operation is specified by an 8-bit instruction opcode. The supported instructions are read
and write. See table below.
Instruction
Name Opcode Operation
READ 0b0000 0011 Read data
WRITE 0b0000 0010 Write data
Note: VS1003 sets DREQ low after each SCI operation. The duration depends on the opera-
tion. It is not allowed to start a new SCI/SDI operation before DREQ is high again.
XCS
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 30 31
SCK
3 2 1 0
SI 0 0 0 0 0 0 1 1 0 0 0 0 don’t care don’t care
15 14 1 0
SO 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 X
execution
DREQ
VS1003 registers are read from using the following sequence, as shown in Figure 10. First,
XCS line is pulled low to select the device. Then the READ opcode (0x3) is transmitted via
the SI line followed by an 8-bit word address. After the address has been read in, any further
data on SI is ignored by the chip. The 16-bit data corresponding to the received address will be
shifted out onto the SO line.
XCS should be driven high after data has been shifted out.
DREQ is driven low for a short while when in a read operation by the chip. This is a very short
time and doesn’t require special user attention.
XCS
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 30 31
SCK
3 2 1 0 15 14 1 0
SI 0 0 0 0 0 0 1 0 0 0 0 0 X
execution
DREQ
VS1003 registers are written from using the following sequence, as shown in Figure 11. First,
XCS line is pulled low to select the device. Then the WRITE opcode (0x2) is transmitted via the
SI line followed by an 8-bit word address.
After the word has been shifted in and the last clock has been sent, XCS should be pulled high
to end the WRITE sequence.
After the last bit has been sent, DREQ is driven low for the duration of the register update,
marked “execution” in the figure. The time varies depending on the register and its contents
(see table in Chapter 8.6 for details). If the maximum time is longer than what it takes from
the microcontroller to feed the next SCI command or SDI byte, it is not allowed to finish a new
SCI/SDI operation before DREQ has risen up again.
XCS
tXCS
0 1 14 15 16 30 31
SCK
SI
tH
tSU
SO
tZ
tV tDIS
1 25ns is when pin loaded with 100pF capacitance. The time is shorter with lower capacitance.
Note: As tWL and tWH, as well as tH require at least 2 clock cycles, the maximum speed for
the SPI bus that can easily be used with asynchronous clocks is 1/7 of VS1003’s internal clock
speed CLKI.
Note: Although the timing is derived from the internal clock CLKI, the system always starts up
in 1.0× mode, thus CLKI=XTALI.
XCS
0 1 2 3 30 31 32 33 61 62 63
SCK
1 0 2 1 0
SI 0 0 0 0 X 0 0 X
DREQ
Figure 13 shows two consecutive SCI operations. Note that xCS must be raised to inactive
state between the writes. Also DREQ must be respected as shown in the figure.
SDI Byte 1
SDI Byte 2
XCS
0 1 2 3 6 7 8 9 13 14 15
SCK
7 6 5 4 3 1 0 7 6 5 2 1 0
SI X
DREQ
SDI data is synchronized with a raising edge of xCS as shown in Figure 14. However, every
byte doesn’t need separate synchronization.
XCS
0 1 7 8 9 39 40 41 46 47
SCK
7 6 5 1 0 7 6 5 1 0
SI 0 0 X
DREQ
Figure 15 shows how an SCI operation is embedded in between SDI operations. xCS edges
are used to synchronize both SDI and SCI. Remember to respect DREQ as shown in the figure.
8 Functional Description
VS1003 is based on a proprietary digital signal processor, VS_DSP. It contains all the code
and data memory needed for MP3, WMA and WAV PCM + ADPCM audio decoding, MIDI
synthesizer, together with serial interfaces, a multirate stereo audio DAC and analog output
amplifiers and filters. Also ADPCM audio encoding is supported using a microphone amplifier
and A/D converter. A UART is provided for debugging purposes.
Conventions
Mark Description
+ Format is supported
- Format exists but is not supported
Format doesn’t exist
MPEG 1.01 :
Samplerate / Hz Bitrate / kbit/s
32 40 48 56 64 80 96 112 128 160 192 224 256 320
48000 + + + + + + + + + + + + + +
44100 + + + + + + + + + + + + + +
32000 + + + + + + + + + + + + + +
MPEG 2.01 :
Samplerate / Hz Bitrate / kbit/s
8 16 24 32 40 48 56 64 80 96 112 128 144 160
24000 + + + + + + + + + + + + + +
22050 + + + + + + + + + + + + + +
16000 + + + + + + + + + + + + + +
MPEG 2.51 2 :
Samplerate / Hz Bitrate / kbit/s
8 16 24 32 40 48 56 64 80 96 112 128 144 160
12000 + + + + + + + + + + + + + +
11025 + + + + + + + + + + + + + +
8000 + + + + + + + + + + + + + +
Windows Media Audio codec versions 2, 7, 8, and 9 are supported. All WMA profiles (L1, L2,
and L3) are supported. Previously streams were separated into Classes 1, 2a, 2b, and 3. WMA
9 Professional and WMA 9 Lossless are not supported. The decoder has passed Microsoft’s
conformance testing program.
WMA 7:
Samplerate Bitrate / kbit/s
/ Hz 5 6 8 10 12 16 20 22 32 40 48 64 80 96 128 160 192
8000 + + + +
11025 + +
16000 + + + +
22050 + + + +
32000 + + + +
44100 + + + + + + + +
48000 + +
WMA 8:
Samplerate Bitrate / kbit/s
/ Hz 5 6 8 10 12 16 20 22 32 40 48 64 80 96 128 160 192
8000 + + + +
11025 + +
16000 + + + +
22050 + + + +
32000 + + + +
44100 + + + + + + + +
48000 + + +
WMA 9:
Samplerate Bitrate / kbit/s
/ Hz 5 6 8 10 12 16 20 22 32 40 48 64 80 96 128 160 192 256 320
8000 + + + +
11025 + +
16000 + + + +
22050 + + + +
32000 + + + +
44100 + + + + + + + + + + +
48000 + + + + +
In addition to these expected WMA decoding profiles, all other bitrate and samplerate com-
binations are supported, including variable bitrate WMA streams. Note that WMA does not
consume the bitstream as evenly as MP3, so you need a higher peak transfer capability for
clean playback at the same bitrate.
General MIDI and SP-MIDI format 0 files are played. Format 1 and 2 files must be converted to
format 0 by the user. The maximum simultaneous polyphony is 40. Actual polyphony depends
on the internal clock rate (which is user-selectable), the instruments used, and the possible
postprocessing effects enabled, such as bass and treble enhancers. The polyphony restriction
algorithm makes use of the SP-MIDI MIP table, if present.
36.86 MHz (3.0× input clock) achieves 16-26 simultaneous sustained notes. The instantaneous
amount of notes can be larger. 36 MHz is a fair compromise between power consumption and
quality, but higher clocks can be used to increase polyphony.
VS1003b implements 36 distinct instruments. Each melodic, effect, and percussion instrument
is mapped into one of these instruments.
VS1003b
Melodic Effect Percussion
piano reverse cymbal bass drum
vibraphone guitar fret noise snare
organ breath closed hihat
guitar seashore open hihat
distortion guitar bird tweet high tom
bass telephone low tom
violin helicopter crash cymbal 2
strings applause ride cymbal
trumpet gunshot tambourine
sax high conga
flute low conga
lead maracas
pad claves
steeldrum
SDI MP3/PlusV/
Bitstream WAV/ADPCM/
FIFO WMA decode/
MIDI decode
SM_ADPCM=0
L
User Bass Treble Volume Audio S.rate.conv.
Application enhancer enhancer control FIFO and DAC R
First, depending on the audio data, and provided ADPCM encoding mode is not set, MP3,
WMA, PCM WAV, IMA ADPCM WAV, or MIDI data is received and decoded from the SDI bus.
After decoding, if SCI_AIADDR is non-zero, application code is executed from the address
pointed to by that register. For more details, see Application Notes for VS10XX.
Then data may be sent to the Bass and Treble Enhancer depending on the SCI_BASS register.
After that the signal is fed to the volume control unit, which also copies the data to the Audio
FIFO.
The Audio FIFO holds the data, which is read by the Audio interrupt (Chapter 10.13.1) and fed
to the sample rate converter and DACs. The size of the audio FIFO is 2048 stereo (2×16-bit)
samples, or 8 KiB.
The sample rate converter converts all different sample rates to XTALI/2, or 128 times the
highest usable sample rate. This removes the need for complex PLL-based clocking schemes
and allows almost unlimited sample rate accuracy with one fixed input clock frequency. With
a 12.288 MHz clock, the DA converter operates at 128 × 48 kHz, i.e. 6.144 MHz, and creates
a stereo in-phase analog signal. The oversampled output is low-pass filtered by an on-chip
analog filter. This signal is then forwarded to the earphone amplifier.
The serial data interface is meant for transferring compressed MP3 or WMA data, WAV PCM
and ADPCM data as well as MIDI data.
If the input of the decoder is invalid or it is not received fast enough, analog outputs are auto-
matically muted.
Also several different tests may be activated through SDI as described in Chapter 9.
The serial control interface is compatible with the SPI bus specification. Data transfers are
always 16 bits. VS1003 is controlled by writing and reading the registers of the interface.
1This is the worst-case time that DREQ stays low after writing to this register. The user may
choose to skip the DREQ check for those register writes that take less than 100 clock cycles to
execute.
2 In addition, the cycles spent in the user application routine must be counted.
3Firmware changes the value of this register immediately to 0x38, and in less than 100 ms to
0x30.
4 When mode register write specifies a software reset the worst-case time is 16600 XTALI
cycles.
5 Writing to this register may force internal clock to run at 1.0 × XTALI for a while. Thus it is not
a good idea to send SCI or SDI bits while this register update is in progress.
Note that if DREQ is low when an SCI write is done, DREQ also stays low after SCI write
processing.
SCI_MODE is used to control the operation of VS1003 and defaults to 0x0800 (SM_SDINEW
set).
When SM_DIFF is set, the player inverts the left channel output. For a stereo input this creates
virtual surround, and for a mono input this creates a differential left/right signal.
If you want to stop decoding a WAV, WMA, or MIDI file in the middle, set SM_OUTOFWAV, and
send data honouring DREQ until SM_OUTOFWAV is cleared. SCI_HDAT1 will also be cleared.
For WMA and MIDI it is safest to continue sending the stream, send zeroes for WAV.
Bit SM_PDOWN sets VS1003 into software powerdown mode. Note that software powerdown
is not nearly as power efficient as hardware powerdown activated with the XRESET pin.
If SM_TESTS is set, SDI tests are allowed. For more details on SDI tests, look at Chapter 9.8.
SM_STREAM activates VS1003’s stream mode. In this mode, data should be sent with as
even intervals as possible (and preferable with data blocks of less than 512 bytes), and VS1003
makes every attempt to keep its input buffer half full by changing its playback speed up to 5%.
For best quality sound, the average speed error should be within 0.5%, the bitrate should not
exceed 160 kbit/s and VBR should not be used. For details, see Application Notes for VS10XX.
This mode does not work with WMA files.
SM_DACT defines the active edge of data clock for SDI. When ’0’, data is read at the rising
edge, when ’1’, data is read at the falling edge.
When SM_SDIORD is clear, bytes on SDI are sent as a default MSb first. By setting SM_SDIORD,
the user may reverse the bit order for SDI, i.e. bit 0 is received first and bit 7 last. Bytes are,
however, still sent in the default order. This register bit has no effect on the SCI bus.
Setting SM_SDISHARE makes SCI and SDI share the same chip select, as explained in Chap-
ter 7.2, if also SM_SDINEW is set.
Setting SM_SDINEW will activate VS10xx native serial modes as described in Chapters 7.2.1 and 7.4.2.
Note, that this bit is set as a default when VS1003 is started up.
By activating SM_ADPCM and SM_RESET at the same time, the user will activate IMA ADPCM
recording mode. More information is available in the Application Notes for VS10XX.
If SM_ADPCM_HP is set at the same time as SM_ADPCM and SM_RESET, ADPCM mode
will start with a high-pass filter. This may help intelligibility of speech when there is lots of
background noise. The difference created to the ADPCM encoder frequency response is as
shown in Figure 17.
VS1003 AD Converter with and Without HP Filter
5
No High−Pass
High−Pass
−5
Amplitude / dB
−10
−15
−20
0 500 1000 1500 2000 2500 3000 3500 4000
Frequency / Hz
SM_LINE_IN is used to select the input for ADPCM recording. If ’0’, microphone input pins
MICP and MICN are used; if ’1’, LINEIN is used.
SCI_STATUS contains information on the current status of VS1003 and lets the user shutdown
the chip without audio glitches.
SS_VER is 0 for VS1001, 1 for VS1011, 2 for VS1002 and 3 for VS1003.
SS_APDOWN2 controls analog driver powerdown. Normally this bit is controlled by the sys-
tem firmware. However, if the user wants to powerdown VS1003 with a minimum power-off
transient, turn this bit to 1, then wait for at least a few milliseconds before activating reset.
SS_APDOWN1 controls internal analog powerdown. This bit is meant to be used by the system
firmware only.
SS_AVOL is the analog volume control: 0 = -0 dB, 1 = -6 dB, 3 = -12 dB. This register is meant
to be used automatically by the system firmware only.
The Bass Enhancer VSBE is a powerful bass boosting DSP algorithm, which tries to take the
most out of the users earphones without causing clipping.
Note: Because VSBE tries to avoid clipping, it gives the best bass boost with dynamical music
material, or when the playback volume is not set to maximum. It also does not create bass: the
source material must have some bass to begin with.
Treble Control VSTC is activated when ST_AMPLITUDE is non-zero. For example setting
SCI_BASS to 0x7a00 will have 10.5 dB treble enhancement at and above 10 kHz.
Bass Enhancer uses about 3.0 MIPS and Treble Control 1.2 MIPS at 44100 Hz sample rate.
Both can be on simultaneously.
The operation of SCI_CLOCKF is different in VS1003 than in VS1001, VS1011, and VS1002.
SCI_CLOCKF bits
Name Bits Description
SC_MULT 15:13 Clock multiplier
SC_ADD 12:11 Allowed multiplier addition
SC_FREQ 10: 0 Clock frequency
SC_MULT activates the built-in clock multiplier. This will multiply XTALI to create a higher CLKI.
The values are as follows:
SC_ADD tells, how much the decoder firmware is allowed to add to the multiplier specified by
SC_MULT if more cycles are temporarily needed to decode a WMA stream. The values are:
SC_FREQ is used to tell if the input clock XTALI is running at something else than 12.288 MHz.
XTALI is set in 4 kHz steps. The formula for calculating the correct value for this register is
XT ALI−8000000
4000 (XTALI is in Hz).
XT ALI
Note: because maximum sample rate is 256 , all sample rates are not available if XTALI
< 12.288 MHz.
Note: Automatic clock change can only happen when decoding WMA files. Automatic clock
change is done one 0.5× at a time. This does not cause a drop to 1.0× clock and you can
use the same SCI and SDI clock throughout the WMA file. When decoding ends the default
multiplier is restored and can cause 1.0× clock to be used momentarily.
When decoding correct data, current decoded time is shown in this register in full seconds.
The user may change the value of this register. In that case the new value should be written
twice.
SCI_DECODE_TIME is reset at every software reset and also when WAV (PCM or IMA AD-
PCM), WMA, or MIDI decoding starts or ends.
When decoding correct data, the current sample rate and number of channels can be found in
bits 15:1 and 0 of SCI_AUDATA, respectively. Bits 15:1 contain the sample rate divided by two,
and bit 0 is 0 for mono data and 1 for stereo. Writing to SCI_AUDATA will change the sample
rate directly.
Note: due to a bug, an odd sample rate reverses the operation of the stereo bit in VS1003b.
SCI_WRAM is used to upload application programs and data to instruction and data RAMs.
The start address must be initialized by writing to SCI_WRAMADDR prior to the first write/read
of SCI_WRAM. As 16 bits of data can be transferred with one SCI_WRAM write/read, and the
instruction word is 32 bits long, two consecutive writes/reads are needed for each instruction
word. The byte order is big-endian (i.e. most significant words first). After each full-word
write/read, the internal pointer is autoincremented.
SCI_WRAMADDR is used to set the program address for following SCI_WRAM writes/reads.
Address offset of 0 is used for X, 0x4000 for Y, and 0x8000 for instruction memory. Peripheral
registers can also be accessed.
Only user areas in X, Y, and instruction memory are listed above. Other areas can be accessed,
but should not be written to unless otherwise specified.
For WAV files, SCI_HDAT0 and SCI_HDAT1 read as 0x7761, and 0x7665, respectively.
For WMA files, SCI_HDAT1 contains 0x574D and SCI_HDAT0 contains the data speed mea-
sured in bytes per second. To get the bit-rate of the file, multiply the value of SCI_HDAT0 by
8.
for MIDI files, SCI_HDAT1 contains 0x4D54 and SCI_HDAT0 contains values according to the
following table:
When read, SCI_HDAT0 and SCI_HDAT1 contain header information that is extracted from
MP3 stream currently being decoded. After reset both registers are cleared, indicating no data
has been found yet.
The “sample rate” field in SCI_HDAT0 is interpreted according to the following table:
SCI_AIADDR indicates the start address of the application code written earlier with SCI_WRAMADDR
and SCI_WRAM registers. If no application code is used, this register should not be initialized,
or it should be initialized to zero. For more details, see Application Notes for VS10XX.
SCI_VOL is a volume control for the player hardware. For each channel, a value in the range of
0..254 may be defined to set its attenuation from the maximum volume level (in 0.5 dB steps).
The left channel value is then multiplied by 256 and the values are added. Thus, maximum
volume is 0 and total silence is 0xFEFE.
Example: for a volume of -2.0 dB for the left channel and -3.5 dB for the right channel: (4*256)
+ 7 = 0x407. Note, that at startup volume is set to full volume. Resetting the software does not
reset the volume setting.
SCI_AICTRL[x] registers ( x=[0 .. 3] ) can be used to access the user’s application program.
9 Operation
9.1 Clocking
VS1003 operates on a single, nominally 12.288 MHz fundamental frequency master clock. This
clock can be generated by external circuitry (connected to pin XTALI) or by the internal clock
crystal interface (pins XTALI and XTALO).
When the XRESET -signal is driven low, VS1003 is reset and all the control registers and
internal states are set to the initial values. XRESET-signal is asynchronous to any external
clock. The reset mode doubles as a full-powerdown mode, where both digital and analog parts
of VS1003 are in minimum power consumption stage, and where clocks are stopped. Also
XTALO is grounded.
After a hardware reset (or at power-up) DREQ will stay down for at least 16600 clock cycles,
which means an approximate 1.35 ms delay if VS1003 is run at 12.288 MHz. After this the
user should set such basic software registers as SCI_MODE, SCI_BASS, SCI_CLOCKF, and
SCI_VOL before starting decoding. See section 8.6 for details.
Internal clock can be multiplied with a PLL. Supported multipliers through the SCI_CLOCKF
register are 1.0 × . . . 4.5× the input clock. Reset value for Internal Clock Multiplier is 1.0×. If
typical values are wanted, the Internal Clock Multiplier needs to be set to 3.0× after reset. Wait
until DREQ rises, then write value 0x9800 to SCI_CLOCKF (register 3). See section 8.6.4 for
details.
In some cases the decoder software has to be reset. This is done by activating bit 2 in
SCI_MODE register (Chapter 8.6.1). Then wait for at least 2 µs, then look at DREQ. DREQ
will stay down for at least 16600 clock cycles, which means an approximate 1.35 ms delay if
VS1003 is run at 12.288 MHz. After DREQ is up, you may continue playback as usual.
If you want to make sure VS1003 doesn’t cut the ending of low-bitrate data streams and you
want to do a software reset, it is recommended to feed 2048 zeros (honoring DREQ) to the SDI
bus after the file and before the reset. This is especially important for MIDI files, although you
can also use SCI_HDAT1 polling.
If you want to interrupt the playing of a WAV, WMA, or MIDI file in the middle, set SM_OUTOFWAV
in the mode register, and wait until SCI_HDAT1 is cleared (with a two-second timeout) before
continuing with a software reset. MP3 does not currently implement the SM_OUTOFWAV be-
cause it is a stream format, thus the timeout requirement.
This chapter explains how to create RIFF/WAV file with IMA ADPCM format. This is a widely
supported ADPCM format and many PC audio playback programs can play it. IMA ADPCM
recording gives roughly a compression ratio of 4:1 compared to linear, 16-bit audio. This makes
it possible to record 8 kHz audio at 32.44 kbit/s.
IMA ADPCM recording mode is activated by setting bits SM_RESET and SM_ADPCM in
SCI_MODE. Optionally a high-pass-filter can be enabled for 8 kHz sample rate by also set-
ting SM_ADPCM_HP at the same time. Line input is used instead of mic if SM_LINE_IN is set.
Before activating ADPCM recording, user must write a clock divider value to SCI_AICTRL0
and gain to SCI_AICTRL1.
The differences of using SM_ADPCM_HP are presented in figure 17 (page 30). As a general
rule, audio will be fuller and closer to original if SM_ADPCM_HP is not used. However, speech
may be more intelligible with the high-pass filter active. Use the filter only with 8 kHz sample
rate.
Before activating ADPCM recording, user should write a clock divider value to SCI_AICTRL0.
Fc
The sampling frequency is calculated from the following formula: fs = 256×d , where Fc is the
internal clock (CLKI) and d is the divider value in SCI_AICTRL0. The lowest valid value for d is
4. If SCI_AICTRL0 contains 0, the default divider value 12 is used.
Examples:
Fc = 2.0 × 12.288 MHz, d = 12. Now fs = 2.0×12288000
256×12 = 8000 Hz.
2.5×14745000
Fc = 2.5 × 14.745 MHz, d = 18. Now fs = 256×18 = 8000 Hz.
Fc = 2.5 × 13 MHz, d = 16. Now fs = 2.5×13000000
256×16 = 7935 Hz.
Also, before activating ADPCM mode, the user has to set linear recording gain control to register
SCI_AICTRL1. 1024 is equal to digital gain 1, 512 is equal to digital gain 0.5 and so on. If the
user wants to use automatic gain control (AGC), SCI_AICTRL1 should be set to 0. Typical
speech applications usually are better off using AGC, as this takes care of relatively uniform
speech loudness in recordings.
Since VS1033c SCI_AICTRL2 controls the maximum AGC gain. If SCI_AICTRL2 is zero, the
maximum gain is 65535 (64×), i.e. whole range is used. This is compatible with previous
operation.
After IMA ADPCM recording has been activated, registers SCI_HDAT0 and SCI_HDAT1 have
new functions.
The IMA ADPCM sample buffer is 1024 16-bit words. The fill status of the buffer can be read
from SCI_HDAT1. If SCI_HDAT1 is greater than 0, you can read as many 16-bit words from
SCI_HDAT0. If the data is not read fast enough, the buffer overflows and returns to empty state.
Note: if SCI_HDAT1 ≥ 896, it may be better to wait for the buffer to overflow and clear before
reading samples. That way you may avoid buffer aliasing.
Each IMA ADPCM block is 128 words, i.e. 256 bytes. If you wish to interrupt reading data
and possibly continue later, please stop at a 128-word boundary. This way whole blocks are
skipped and the encoded stream stays valid.
To make your IMA ADPCM file a RIFF / WAV file, you have to add a header before the actual
data. Note that 2- and 4-byte values are little-endian (lowest byte first) in this format:
If you know beforehand how much you are going to record, you may fill in the complete header
before any actual data. However, if you don’t know how much you are going to record, you have
to fill in the header size datas F , S and D after finishing recording.
The 128 words (256 bytes) of an ADPCM block are read from SCI_HDAT0 and written into file
as follows. The high 8 bits of SCI_HDAT0 should be written as the first byte to a file, then the
low 8 bits. Note that this is contrary to the default operation of some 16-bit microcontrollers,
and you may have to take extra care to do this right.
A way to see if you have written the file in the right way is to check bytes 2 and 3 (the first byte
counts as byte 0) of each 256-byte block. Byte 3 should always be zero.
In order to play back your IMA ADPCM recordings, you have to have a file with a header as
described in Chapter 9.4.3. If this is the case, all you need to do is to provide the ADPCM file
through SDI as you would with any audio file.
VS10xx chips that support IMA ADPCM playback are capable of playing back ADPCM files with
any sample rate. However, some other programs may expect IMA ADPCM files to have some
exact sample rates, like 8000 or 11025 Hz. Also, some programs or systems do not support
sample rates below 8000 Hz.
However, if you don’t have an appropriate clock, you may not be able to get an exact 8 kHz
sample rate. If you have a 12 MHz clock, the closest sample rate you can get with 2.0 × 12 MHz
and d = 12 is fs = 7812.5Hz. Because the frequency error is only 2.4%, it may be best to set
fs = 8000Hz to the header if the same file is also to be played back with an PC. This causes
the sample to be played back a little faster (one minute is played in 59 seconds).
Note, however, that unless absolutely necessary, sample rates should not be tweaked in the
way described here.
If you want better quality with the expense of increased data rate, you can use higher sample
rates, for example 16 kHz.
The following code initializes IMA ADPCM encoding on VS1003b/VS1023 and shows how to
read the data.
/* Record loop */
while (recording_on) {
do {
w = ReadMp3SpiReg(SCI_HDAT1);
} while (w < 256 || w >= 896); /* wait until 512 bytes available */
If GPIO0 is set with a pull-up resistor to 1 at boot time, VS1003 tries to boot from external SPI
memory.
The memory has to be an SPI Bus Serial EEPROM with 16-bit addresses (i.e. at least 1 KiB).
The serial speed used by VS1003 is 245 kHz with the nominal 12.288 MHz clock. The first
three bytes in the memory have to be 0x50, 0x26, 0x48. The exact record format is explained
in the Application Notes for VS10XX.
9.6 Play/Decode
This is the normal operation mode of VS1003. SDI data is decoded. Decoded samples are
converted to analog domain by the internal DAC. If no decodable data is found, SCI_HDAT0
and SCI_HDAT1 are set to 0 and analog outputs are muted.
When there is no input for decoding, VS1003 goes into idle mode (lower power consumption
than during decoding) and actively monitors the serial data input for valid data.
All different formats can be played back-to-back without software reset in-between. Send at
least 4 zeros after each stream. However, using software reset between streams may still be a
good idea, as it guards against broken files. In this case you shouldt wait for the completion of
the decoding (SCI_HDAT0 and SCI_HDAT1 become zero) before issuing software reset.
VS1003 can be used as a PCM decoder by sending to it a WAV file header. If the length
sent in the WAV file is 0 or 0xFFFFFFF, VS1003 will stay in PCM mode indefinitely (or until
SM_OUTOFWAV has been set). 8-bit linear and 16-bit linear audio is supported in mono or
stereo.
There are several test modes in VS1003, which allow the user to perform memory tests, SCI
bus tests, and several different sine wave tests.
All tests are started in a similar way: VS1003 is hardware reset, SM_TESTS is set, and then a
test command is sent to the SDI bus. Each test is started by sending a 4-byte special command
sequence, followed by 4 zeros. The sequences are described below.
Sine test is initialized with the 8-byte sequence 0x53 0xEF 0x6E n 0 0 0 0, where n defines the
sine test to use. n is defined as follows:
n bits
Name Bits Description
F s Idx 7:5 Sample rate index
S 4:0 Sine skip speed
F s Idx Fs
0 44100 Hz
1 48000 Hz
2 32000 Hz
3 22050 Hz
4 24000 Hz
5 16000 Hz
6 11025 Hz
7 12000 Hz
S
The frequency of the sine to be output can now be calculated from F = F s × 128 .
Example: Sine test is activated with value 126, which is 0b01111110. Breaking n to its compo-
nents, F s Idx = 0b011 = 3 and thus F s = 22050Hz. S = 0b11110 = 30, and thus the final sine
30
frequency F = 22050Hz × 128 ≈ 5168Hz.
To exit the sine test, send the sequence 0x45 0x78 0x69 0x74 0 0 0 0.
Note: Sine test signals go through the digital volume control, so it is possible to test channels
separately.
Pin test is activated with the 8-byte sequence 0x50 0xED 0x6E 0x54 0 0 0 0. This test is meant
for chip production testing only.
Memory test mode is initialized with the 8-byte sequence 0x4D 0xEA 0x6D 0x54 0 0 0 0. After
this sequence, wait for 500000 clock cycles. The result can be read from the SCI register
SCI_HDAT0, and ’one’ bits are interpreted as follows:
Sci test is initialized with the 8-byte sequence 0x53 0x70 0xEE n 0 0 0 0, where n is the
register number to test. The content of the given register is read and copied to SCI_HDAT0. If
the register to be tested is HDAT0, the result is copied to SCI_HDAT1.
10 VS1003 Registers
User software is required when a user wishes to add some own functionality like DSP effects
to VS1003.
However, most users of VS1003 don’t need to worry about writing their own code, or about this
chapter, including those who only download software plug-ins from VLSI Solution’s Web site.
VS_DSP is a 16/32-bit DSP processor core that also had extensive all-purpose processor fea-
tures. VLSI Solution’s free VSKIT Software Package contains all the tools and documentation
needed to write, simulate and debug Assembly Language or Extended ANSI C programs for the
VS_DSP processor core. VLSI Solution also offers a full Integrated Development Environment
VSIDE for full debug capabilities.
SCI registers described in Chapter 8.6 can be found here between 0xC000..0xC00F. In addition
to these registers, there is one in address 0xC010, called SCI_CHANGE.
SCI_CHANGE bits
Name Bits Description
SCI_CH_WRITE 4 1 if last access was a write cycle.
SCI_CH_ADDR 3:0 SPI address of last access.
0000 0000
0030 System Vectors 0030
User
Instruction
0500 RAM 0500
X DATA Y DATA
RAM RAM
1800 1800
User User
1880 Space Space 1880
Stack Stack
1940 1940
1C00 1C00
1E00 1E00
4000 4000
8000 8000
C000 C000
Hardware
Register
C100 Space C100
Every fourth clock cycle, an internal 26-bit counter is added to by (DAC_FCTLH & 15) × 65536
+ DAC_FCTLL. Whenever this counter overflows, values from DAC_LEFT and DAC_RIGHT
are read and a DAC interrupt is generated.
GPIO_DIR is used to set the direction of the GPIO pins. 1 means output. GPIO_ODATA
remembers its values even if a GPIO_DIR bit is set to input.
Note that in VS1003 the VSDSP registers can be read and written through the SCI_WRAMADDR
and SCI_WRAM registers. You can thus use the GPIO pins quite conveniently.
INT_ENABLE bits
Name Bits Description
INT_EN_TIM1 7 Enable Timer 1 interrupt.
INT_EN_TIM0 6 Enable Timer 0 interrupt.
INT_EN_RX 5 Enable UART RX interrupt.
INT_EN_TX 4 Enable UART TX interrupt.
INT_EN_MODU 3 Enable AD modulator interrupt.
INT_EN_SDI 2 Enable Data interrupt.
INT_EN_SCI 1 Enable SCI interrupt.
INT_EN_DAC 0 Enable DAC interrupt.
Note: It may take up to 6 clock cycles before changing INT_ENABLE has any effect.
Writing any value to INT_GLOB_DIS adds one to the interrupt counter INT_COUNTER and
effectively disables all interrupts. It may take up to 6 clock cycles before writing to this register
has any effect.
Writing any value to INT_GLOB_ENA subtracts one from the interrupt counter (unless INT_COUNTER
already was 0). If the interrupt counter becomes zero, interrupts selected with INT_ENABLE
are restored. An interrupt routine should always write to this register as the last thing it does,
because interrupts automatically add one to the interrupt counter, but subtracting it back to its
initial value is the responsibility of the user. It may take up to 6 clock cycles before writing this
register has any effect.
By reading INT_COUNTER the user may check if the interrupt counter is correct or not. If the
register is not 0, interrupts are disabled.
AD_DIV controls the AD converter’s sampling frequency. To gather one sample, 128 × n clock
cycles are used (n is value of AD_DIV). The lowest usable value is 4, which gives a 48 kHz
sample rate when CLKI is 24.576 MHz. When AD_DIV is 0, the A/D converter is turned off.
10.10 Watchdog
The watchdog consist of a watchdog counter and some logic. After reset, the watchdog is
inactive. The counter reload value can be set by writing to WDOG_CONFIG. The watchdog is
activated by writing 0x4ea9 to register WDOG_RESET. Every time this is done, the watchdog
counter is reset. Every 65536’th clock cycle the counter is decremented by one. If the counter
underflows, it will activate vsdsp’s internal reset sequence.
Thus, after the first 0x4ea9 write to WDOG_RESET, subsequent writes to the same register
with the same value must be made no less than every 65536×WDOG_CONFIG clock cycles.
Once started, the watchdog cannot be turned off. Also, a write to WDOG_CONFIG doesn’t
change the counter reload value.
After watchdog has been activated, any read/write operation from/to WDOG_CONFIG or WDOG_DUMMY
will invalidate the next write operation to WDOG_RESET. This will prevent runaway loops from
resetting the counter, even if they do happen to write the correct number. Writing a wrong value
to WDOG_RESET will also invalidate the next write to WDOG_RESET.
10.10.1 Registers
The RS232 UART implements a serial interface using RS232 standard 8N1 (8 data bits, no
parity, 1 stop bit).
Start Stop
bit D0 D1 D2 D3 D4 D5 D6 D7 bit
When the line is idling, it stays in logic high state. When a byte is transmitted, the transmission
begins with a start bit (logic zero) and continues with data bits (LSB first) and ends up with a
stop bit (logic high). 10 bits are sent for each 8-bit byte frame.
10.11.1 Registers
A read from the status register returns the transmitter and receiver states.
UARTx_STATUS Bits
Name Bits Description
UART_ST_RXORUN 3 Receiver overrun
UART_ST_RXFULL 2 Receiver data register full
UART_ST_TXFULL 1 Transmitter data register full
UART_ST_TXRUNNING 0 Transmitter running
UART_ST_RXORUN is set if a received byte overwrites unread data when it is transferred from
the receiver shift register to the data register, otherwise it is cleared.
UART_ST_TXFULL is set if a write to the data register is not allowed (data register full).
A read from UARTx_DATA returns the received byte in bits 7:0, bits 15:8 are returned as ’0’. If
there is no more data to be read, the receiver data register full indicator will be cleared.
A receive interrupt will be generated when a byte is moved from the receiver shift register to
the receiver data register.
A write to UARTx_DATA sets a byte for transmission. The data is taken from bits 7:0, other
bits in the written value are ignored. If the transmitter is idle, the byte is immediately moved
to the transmitter shift register, a transmit interrupt request is generated, and transmission is
started. If the transmitter is busy, the UART_ST_TXFULL will be set and the byte remains in the
transmitter data register until the previous byte has been sent and transmission can proceed.
UARTx_DIV Bits
Name Bits Description
UART_DIV_D1 15:8 Divider 1 (0..255)
UART_DIV_D2 7:0 Divider 2 (6..255)
The divider is set to 0x0000 in reset. The ROM boot code must initialize it correctly depending
on the master clock frequency to get the correct bit speed. The second divider (D2 ) must be
from 6 to 255.
fm
The communication speed f = (D1 +1)×(D2 ) , where fm is the master clock frequency, and f is
the TX/RX speed in bps.
Transmitter operates as follows: After an 8-bit word is written to the transmit data register it will
be transmitted instantly if the transmitter is not busy transmitting the previous byte. When the
transmission begins a TX_INTR interrupt will be sent. Status bit [1] informs the transmitter data
register empty (or full state) and bit [0] informs the transmitter (shift register) empty state. A
new word must not be written to transmitter data register if it is not empty (bit [1] = ’0’). The
transmitter data register will be empty as soon as it is shifted to transmitter and the transmission
is begun. It is safe to write a new word to transmitter data register every time a transmit interrupt
is generated.
Receiver operates as follows: It samples the RX signal line and if it detects a high to low
transition, a start bit is found. After this it samples each 8 bit at the middle of the bit time (using
a constant timer), and fills the receiver (shift register) LSB first. Finally if a stop bit (logic high)
is detected the data in the receiver is moved to the reveive data register and the RX_INTR
interrupt is sent and a status bit[2] (receive data register full) is set, and status bit[2] old state is
copied to bit[3] (receive data overrun). After that the receiver returns to idle state to wait for a
new start bit. Status bit[2] is zeroed when the receiver data register is read.
RS232 communication speed is set using two clock dividers. The base clock is the processor
master clock. Bits 15-8 in these registers are for first divider and bits 7-0 for second divider. RX
sample frequency is the clock frequency that is input for the second divider.
10.12 Timers
There are two 32-bit timers that can be initialized and enabled independently of each other. If
enabled, a timer initializes to its start value, written by a processor, and starts decrementing
every clock cycle. When the value goes past zero, an interrupt is sent, and the timer initializes
to the value in its start value register, and continues downcounting. A timer stays in that loop
as long as it is enabled.
A timer has a 32-bit timer register for down counting and a 32-bit TIMER1_LH register for
holding the timer start value written by the processor. Timers have also a 2-bit TIMER_ENA
register. Each timer is enabled (1) or disabled (0) by a corresponding bit of the enable register.
10.12.1 Registers
TIMER_CONFIG Bits
Name Bits Description
TIMER_CF_CLKDIV 7:0 Master clock divider
TIMER_CF_CLKDIV is the master clock divider for all timer clocks. The generated internal
fm
clock frequency fi = c+1 , where fm is the master clock frequency and c is TIMER_CF_CLKDIV.
Example: With a 12 MHz master clock, TIMER_CF_DIV=3 divides the master clock by 4, and
the output/sampling clock would thus be fi = 12M Hz
3+1 = 3M Hz.
TIMER_ENABLE Bits
Name Bits Description
TIMER_EN_T1 1 Enable timer 1
TIMER_EN_T0 0 Enable timer 0
The 32-bit start value TIMER_Tx[L/H] sets the initial counter value when the timer is reset. The
fi
timer interrupt frequency ft = c+1 where fi is the master clock obtained with the clock divider
(see Chapter 10.12.2 and c is TIMER_Tx[L/H].
Example: With a 12 MHz master clock and with TIMER_CF_CLKDIV=3, the master clock fi =
3M Hz. If TIMER_TH=0, TIMER_TL=99, then the timer interrupt frequency ft = 3M Hz
99+1 =
30kHz.
TIMER_TxCNT[L/H] contains the current counter values. By reading this register pair, the user
may get knowledge of how long it will take before the next timer interrupt. Also, by writing to
this register, a one-shot different length timer interrupt delay may be realized.
10.12.6 Interrupts
Each timer has its own interrupt, which is asserted when the timer counter underflows.
The System Vector Tags are tags that may be replaced by the user to take control over several
decoder functions.
jmpi DAC_INT_ADDRESS,(i6)+1
The user may, at will, replace the first instruction with a jmpi command to gain control over the
audio interrupt.
jmpi SCI_INT_ADDRESS,(i6)+1
The user may, at will, replace the instruction with a jmpi command to gain control over the SCI
interrupt.
jmpi SDI_INT_ADDRESS,(i6)+1
The user may, at will, replace the instruction with a jmpi command to gain control over the SDI
interrupt.
jmpi MODU_INT_ADDRESS,(i6)+1
The user may, at will, replace the instruction with a jmpi command to gain control over the AD
Modulator interrupt.
jmpi EMPTY_INT_ADDRESS,(i6)+1
The user may, at will, replace the instruction with a jmpi command to gain control over the
UART TX interrupt.
jmpi RX_INT_ADDRESS,(i6)+1
The user may, at will, replace the first instruction with a jmpi command to gain control over the
UART RX interrupt.
jmpi EMPTY_INT_ADDRESS,(i6)+1
The user may, at will, replace the first instruction with a jmpi command to gain control over the
Timer 0 interrupt.
jmpi EMPTY_INT_ADDRESS,(i6)+1
The user may, at will, replace the first instruction with a jmpi command to gain control over the
Timer 1 interrupt.
jr
nop
If the user wants to take control away from the standard decoder, the first instruction should be
replaced with an appropriate j command to user’s own code.
Unless the user is feeding MP3 or WMA data at the same time, the system activates the user
program in less than 1 ms. After this, the user should steal interrupt vectors from the system,
and insert user programs.
The System Vector Functions are pointers to some functions that the user may call to help
implementing his own applications.
VS_DSP C prototype:
void WriteIRam(register __i0 u_int16 *addr, register __a1 u_int16 msW, register __a0 u_int16
lsW);
VS_DSP C prototype:
This is the preferred way to read from the User Instruction RAM.
VS_DSP C prototype:
u_int16 DataBytes(void);
If the user has taken over the normal operation of the system by switching the pointer in User-
Codec to point to his own code, he may read data from the Data Interface through this and the
following two functions.
This function returns the number of data bytes that can be read.
VS_DSP C prototype:
u_int16 GetDataByte(void);
Reads and returns one data byte from the Data Interface. This function will wait until there is
enough data in the input buffer.
VS_DSP C prototype:
void GetDataWords(register __i0 __y u_int16 *d, register __a0 u_int16 n);
Read n data byte pairs and copy them in big-endian format (first byte to MSBs) to d. This
function will wait until there is enough data in the input buffer.
VS_DSP C prototype:
void Reboot(void);
Causes a software reboot, i.e. jump to the standard firmware without reinitializing the IRAM
vectors.
This is NOT the same as the software reset function, which causes complete initialization.
This chapter describes the latest and most important changes to this document.
• Added mention of 8N1 format to Chapter 10.11, UART (Universal Asynchronous Re-
ceiver/Transmitter).
• Added chip image to last page.
• Removed MP2 and MP3 license descriptions from Chapter 1, Licenses, and other rele-
vant places, as all their patents have expired.
• Updated Chapter 3, Definitions.
• Other, minor changes.
• VS1003B-LK has been qualified for the same more relaxed CVDD limits as VS1003B-L
and VS1003B-B. Because of this, the more strict CVDD limits for VS1003B-LK removed
from Chapter 4.1, Absolute Maximum Ratings, and Chapter 4.2, Recommended Operat-
ing Conditions.
• Typical characteristics added to section 4.7, some values changed in section 4.3.
12 Contact Information
VLSI Solution Oy
Entrance G, 2nd floor
Hermiankatu 8
FI-33720 Tampere
FINLAND
URL: http://www.vlsi.fi/
Phone: +358-50-462-3200
Commercial e-mail: sales@vlsi.fi