An Approach to Programmable Signal Processor Assemblers and

1275

IEEE Tk4NSACTIONS ON COMMUNICATIONS, VOL. COM-34, NO. 12, DECEMBER 1986 T. A. C. M. Claasenand W. F. G. Mecklenbrauker, “Comparison of the convergence of two algorithms for adaptive FIR digital filters,” IEEE Trans. Circuits Syst., vol. CAS-28, pp. 510-518, June 1981. [7] F.R. Mageeand J. G. Proakis,“Adaptivemaximumlikelihood sequence estimation for digital signaling in the presenceof intersymbol interference,” IEEE Trans. Inform. Theory, vol.IT-19,pp. 120124, Jan. 1973.

11. GENERAL STRUCTURE AND METHODOLOGY

[6]

An Approach to Programmable Signal Processor Assemblers and Simulators TERESA H.-Y. MENG AND DAVID G. MESSERSCHMITT

Abstract-A method of programming an assembler and simulator for a programmable digital signal processor (PDSP) is proposed. This method is general and simple to execute, and makes use of the UNIX” utilities awk, sed, and grep to translate a source program, consisting of assembly instructions and C statements, toeither machine code forthe assembler or a C program for the simulator.The approach also provides the advantages that little programming effort is required to generate a new assembler and simulatorforanother processor instruction set. It also allowsaPDSPto be simulated in the contextofa larger system containing other PDSP’s, other hardware, and an external environment. A disadvantage is that the approach is applicable onlyto the UNIX operating system.

I. INTRODUCTION There are now available several single-chip programmable digital signal processors (PDSP’s) [ 11-[6]. The programming of a new assembler and simulator for a new PDSP instruction setisgenerallyconsideredtobeamajorundertaking. We describe here an approach that avoids extensive programming effort, and allows an assembler and simulator to be generated

in typically less than a day’s effort. In addition, the approach allows the PDSP to be simulatedin the context of a simulation of the systemin which it is embedded, including the simulation ofmultiple PDSP’s, associatedhardware,andtheexternal environment. We avoid most of the programming by combining existing UNIX@operating system [7] utilities. UNIX utilities translate a source program for the targetPDSP to machine codefor the C programforthesimulator. assembler and generatea Simulation is accomplished by compiling and executing the simulator C program that was generated. Themain advantage with which new of the approachis its universality and the ease assemblers and simulators for different PDSP’s can be generatedfrom an existingassembler and simulator.Also, debugging C statements (not executed by the PDSP’s) can be of added to theassemblysourcefile,andthedebugging assembly code is aidedby the symbolic debuggingfacilities of UNIX . The TI TMS32010 PDSP was used as an example. Starting with theTMS32010assembler and simulator,it took only about 5 h to generate a simulator for the Fujitsu MB8764 and 10 h for the TMS32020. In Section I1 we describe the basic approach, and Section 111gives conclusions. A basic familiarity with UNIX utilities and the C language is assumed. Paperapproved by theEditorforTransmissionSystems of theIEEE Communications Society. Manuscript received September 30, 1985; revised April15,1986.Thisworkwassupported by theSemiconductorResearch Corporation. TheauthorsarewiththeDepartment of ElectricalEngineeringand Computer Sciences, University of California, Berkeley, CA 94720. IEEE Log Number 861 1280.

Our approach is to consider the source fileas text and avoid extenske programming by using UNIX-provided text editors to modi‘& the contents. The commands borrowed from UNIX are: grep

sed awk

A family of commands that search the input files for lines matching a pattern and copy those lines to the standard output. A noninteractivestreamtexteditor. A patternscanning and processinglanguage which searches a setof files for patterns performsspecified actions upon lines or fields of lines which contain instances of those patterns.

A . Implementation of the Assembler An assemblergeneratesmachinecodeofatarget PDSP correspondingtoagivenassemblylanguageprogram.Our assembler is amnemonictranslator withno facilities for macros, link loading, etc.In our case, the source program a, as whole, including assembly instructions and optional added C statements (to be included in the simulation), is regarded as lines of character strings. The grep command extracts every line that is recognized as a particular assemblymnemonic (this is repeated for each mnemonic), sed translates the mnemonic awk generatesthe to an appropriate machine code,and hexadecimal representation to be storedin the program ROM. Indirect addressing, labels, constant definitions, expressions, and dynamic memory allocationsare simple ways to represent addresses of the data RAM in an instruction, as provided by assemblydirectives, and are realized by some special awk commands.Imbedded C statements(forthe simulation) are ignored by the assembler. The three major steps in assembly are described in more detail in the following three subsections. I ) Separation of AssembIy Instructions from C Statements: The assembler first removes imbedded C statements. Our convention is to addan identification “I**/” at the startof eachlinecontaining a C statement, and theselines are removed by the sed command “ / \ / \ *.* \ * \ / / d ” . 2) Label Translations: Branch instructionsrequire that labels be translated into exact program locations. This can be accomplished by parsing the source program twice. In the first pass, each assembly instructionis assigned a program address (by an awk program) which is placed at the beginning of each line. This same awk program generates a sed command file, which contains statements like s/label/address/. On the second pass this command file is used to replace the branch destination in the source file with the corresponding address. Subroutine callsare treated in the sameway to obtain theaddressesofsubroutines.Morecomplicatedbranching expressions such asargument passing insubroutines or destination addresses relative to the instruction location are easily implemented by an awk command. 3) Machine Code Generation: Inthe final step,each assemblyinstructiongoesthrougha pipelined sequence of commands such as egrep ‘instrl I instr2 I ..‘ input- file I sed - f sed. instr I awk - f awk. instr where instrl, instr2, etc., are the assembly mnemonics that have the same instructionfield format. Because the number of different instruction formats is usually quite small (seven for the TMS32010), the assembling process needs only a few lines of command sequences, as described above. The egrep command passes only the lines in the input file

0090-6778/86/1200-1275$01.00 0 1986 IEEE

1276

COMMUNICATIONS, TRANSACTIONS ON IEEE

which have one of the character strings specified in the first argument. Labels, specified by lower case characters, cannot ' be confused with mnemonics. Thesed command file sed.instr translates eachinstr to asequence of bits. Finally, the function awk.instr combines the opcode bits and operands to generate hexadecimal machine code. These commandssplit the original source input file into pieces, each containing similar groupings of instructions. Since each assembly line has been given an addressatthebeginning of theline,theinstructionsare reordered by concatenating these files, togetherand using the UNIX command sort.

VOL. NO. COM-34,

12, DECEMBER 1986

single C main program is generated to represent the entire assembly program for simulation. The sequence of assembly instructions is preserved in the simulator C program to ease assembly language the task of debugging.Therefore,an subroutine (lines of mnemonics expressing the functions of this subroutine) is represented not as a C subprogram, but rather as linesof C statementsin the single main program.We simulateadirect branch orsubroutinecall with a goto statement to a corresponding label placed in the C program. Flow-of-control statements where the destination address is know only at runtime represent a special problem. Examples of such statements arethe instructions RET and CALA in the B. Implementation of the Simulator TMS320 and indirect branches.RET, which causes a return at The purpose of a simulator is to simulate and debug the the end of a subroutine, is usually realized with a stack that storesthereturnaddressatitstop.Sincealltheregisters, assembly program by reproducingits functionality onthe including the program counter and the stack pointer, have been VAX. The simulator consists of two phases. 1) Translation phase: Translate assembly mnemonics to C declared as integer variables, the problem of simulating this functions line by line. instruction can be treated by software in the same way-as by must beindicated, 2) Simulation phase: Compile the resulting C programwhile hardware.However,thereturnaddress there is no explicit address associated with each .linking to a collectionof such functions definedin supporting since instruction in the simulator. CALA uses the current accumulafiles, and then run the resulting object program. Awk tor value as the subroutine address, and this is known only at Translation is performed similarly to the assembler. programsare used toconvert assembly mnemonics into C runtime. Indirect branches on register values have no specified destination in the instruction, so programmers need to specify functions and, at the same time, generate auxiliary code and files whenever necessary. the allowablebranchdestinations by puttinglabels in the source file. (This also helps in debugging the program at run Forthesimulationphase,all the opcodefunctionsare time). provided in a header file instr.h and a file of C functions The way we handle these specialflow of control instructions supp0rt.c. These two files are unique to a particular processor and instruction set, and are the primary files which must be is best illustrated by the following fragment of an automatirewritten to generate a simulator for anew PDSP. These files cally generated C simulation program: are straightforward to generate: given the instruction description, we simulatethefunctionality of that instruction by acc-branch: switch(DEST1N) { definingaCmacrostatement.Ourexperience is that for case 0: present generation PDSP's, generating a new assembler and goto reset; simulator takes less than one day, although it is difficult to break; extrapolate this experience new to generations which generally case 2: have much more complicated instruction sets. goto returnl; Thegeneralapproachtothedesign of thesimulationC break; program which is code generated from theassembly program case 45: is as follows. goto nopfunc; 1) Each register and data memory location is assigned an break; integer variable to provide a simple way to trace assembly default: instructions. return(l01); 2) Eachinstruction is executed as a single function call. break; Groups of similar instructions share a single function call with 1 arguments generated through the use of the C macro preproreset: cessor. /* Load accumulator with constant 45. */ 3) The simulator has been interfaced with BLOSIM [8] to LDA(LACK(45)); . achieve simulationsof multiprocessor systems and simulations /* Call the subroutine whose address */ of a PDSP chip in combination with other hardware. /* is the accumulator content. */ Withthesefeatures in mind,thefollowingsubsections CALA( ); goto acc-branch; describe the phases of the code generationand the structure of returnl: the resulting C program. I ) Translation Phase: Thetranslationphase is different from the assembler translation Qnly in that the output is a C program instead of machinecode.Thesamesequence of commandssuffices:separateCstatementsfrom assembly nopfunc: LDA(LACK(0)); code, define constants, label each instruction, assign function goto acc-branch; RET( ); names to instructions,and finally reordertheinstructions using the labels. These steps are described in the following This program fragment may be confusing at first, but the subsections. idea is simple.Sincebranchdestinations in theTMS320 a) Separation of Assembly and C Statements: First, C program are represented by numerical values (for example, statementsareseparatedfrom assembly instructions;the the contents of the accumulator), the branching is performed former are simply inserted in the final simulation program. by the switchstatementadded by the awk programwhich Numerical labels are added to each line of the source file so recordstheprogramaddress of each assembly instruction. that, after C statements are extracted from the source file, they Control is transferredbased on the valueof numerical variable can later be reinserted in the processed assembly program to DESTIN, theprogramaddress of the next instruction. A simple awk programandthe becomethesimulator. DESTIN is assigned a value in the function associated with command sort are used. that particular instruction. The number of cases in the switch b) Generation of the Flow-of-Control Program: A statement is the sum of twice the number of subroutines (one

1277

IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. COM-34, NO. 12, DECEMBER 1986

In file “instr.h”: for CALA andone for RET) andthenumber of labels specified by the user for indirect branching. The first #define.ik ADD(a, s) acc,(ram[a + dp*128] < < S) instruction (at address 0x000, label reset) loads the number acc, - l*(ram[a + dp*l28]

An Approach to Programmable Signal Processor Assemblers and

An Approach to Programmable Signal Processor Assemblers and

Suggest Documents

Programmable Digital Signal Processor (PDSP): A Survey - CiteSeerX

Digital signal processor against field programmable gate array ...

An End-to-End Approach to Globally Scalable Programmable ...

Using Digital Signal Processor Singularities to ...

Real-time full-programmable fuzzy processor on an Intel ... - EUSFLAT

An Estimation Approach for Programmable Logic - CiteSeerX

An Approach to Using Non Safety-Assured Programmable ...

Agents, assemblers, and ANTS: scheduling

A PROGRAMMABLE BASEBAND PROCESSOR ... - Semantic Scholar

An Approach to Using Non Safety-Assured Programmable ...

Generalized measurements via programmable quantum processor

An Integrated Approach to Locality Conscious Processor Allocation ...

Digital Signal Processor (DSP) Architecture

The Lucasfilm audio signal processor

Digital signal processor fundamentals and system design

digital signal processor architectures and programming

A PROGRAMMABLE APPROACH TO RESOURCE ... - Semantic Scholar

A Coalgebraic Approach to the Y86 Processor

An Survey of Low Power FFT Processor for Signal ...

A Low Area and Low Power Programmable Baseband Processor

A VLSI field-programmable mixed- signal array to ... - Sim Bamford

Designing a programmable analog signal ... - Semantic Scholar

an efficient signal-matching approach to melody ... - Google Sites

An analysis of signal transformation approach to triangular waveform ...