Java, Java, Java: Programming Languages

3. Computers, Objects, and Java

3.6. Programming Languages

Most computer programs today are written in a high-level language, such as Java, C, C++, or FORTRAN. A programming language is considered high level if its statements resemble English-language statements. For example, all of the languages just mentioned have some form of an “if” statement, which says, “if some condition holds, then take some action.”

Computer scientists have invented hundreds of high-level program- ming languages, although relatively few of these have been put to prac- tical use. Some of the widely used languages have special features that make them suitable for one type of programming application or another. COBOL (COmmon Business-Oriented Language), for example, is still widely used in commercial applications. FORTRAN (FORmula TRANsla- tor) is still preferred by some engineers and scientists. C and C++ are still the primary languages used by operating system programmers.

In addition to having features that make them suitable for certain types of applications, high-level languages use symbols and notation that make them easily readable by humans. For example, arithmetic operations in Java make use of familiar operators such as “+” and “ ” and “/”, so that arithmetic expressions look more or less the way they do in algebra. So, to take the average of two numbers, you might use the expression

The problem is that computers cannot directly understand such expres- sions. In order for a computer to run a program, the program must first be translated into the computer’s machine language, which is the language understood by its CPU or microprocessor. Each type of microprocessor has its own particular machine language. That’s why when you buy soft- ware it runs either on a Macintosh, which uses the Power-PC chip, or on a

Windows machine, which uses the Pentium chip, but not on both. When aPlatform independence

program can run on just one type of chip, it is known as platform dependent. In general, machine languages are based on the binary code, a two- valued system that is well suited for electronic devices. In a binary repre- sentation scheme, everything is represented as a sequence of 1’s and 0’s, which corresponds closely to the computer’s electronic “on” and “off” states. For example, in binary code, the number 13 would be repre-

sented as 1101. Similarly, a particular address in the computer’s memory might be represented as 01100011, and an instruction in the computer’s instruction set might be represented as 001100.

The instructions that make up a computer’s machine language are very simple and basic. For example, a typical machine language might in- clude instructions for ADD, SUBTRACT, DIVIDE, and MULTIPLY, but it wouldn’t contain an instruction for AVERAGE. In most cases, a single in- struction, called an opcode, carries out a single machine operation on one or more pieces of data, called its operands. Therefore, the process of av- eraging two numbers would have to be broken down into two or more steps. A machine language instruction itself might have something sim- ilar to the following format, in which an opcode is followed by several operands, which refer to the locations in the computer’s primary memory where the data are stored. The following instruction says ADD the num- ber in LOCATION1 to the number in LOCATION2 and store the result in LOCATION3:

Opcode	Operand 1	Operand 2	Operand 3
011110	110110	111100	111101
(ADD)	(LOCATION 1)	(LOCATION 2)	(LOCATION 3)

Given the primitive nature of machine language, an expression like (a + b)/2 would have to be translated into a sequence of several machine language instructions that, in binary code, might look as follows:

In the early days of computing, before high-level languages were de- veloped, computers had to be programmed directly in their machine languages, an extremely tedious and error-prone process. Imagine how difficult it would be to detect an error that consisted of putting a 0 in the preceding program where a 1 should occur!

Fortunately, we no longer have to worry about machine languages, be- cause special programs can be used to translate a high-level or source code program into machine language code or object code, which is the only code that can be executed or run by the computer. In general, a pro- gram that translates source code to object code is known as a translator (Fig. 3). Thus, with suitable translation software for Java or C++ we can write programs as if the computer could understand Java or C++ directly. Source code translators come in two varieties. An interpreter trans- lates a single line of source code directly into machine language and ex- ecutes the code before going on to the next line of source code. A com- piler translates the entire source code program into executable object code, which means that the object code can then be run directly without further

translation.

There are advantages and disadvantages to both approaches. Inter- preted programs generally run less efficiently than compiled programs,

SECTION 0.6 • Why Java?9

Figure 3:Translator software

High-level

language

Machine

language

translates high-level source code to machine language object code.

Source code

Object code

because they must translate and execute each line of the program before proceeding to the next line. If a line of code is repeated, an interpreter would have to translate the line each time it is encountered. By contrast, once compiled, an object program is just executed without any need for further translation. It is also much easier to refine compiled code to make it run more efficiently. But interpreters are generally quicker and easier to develop and provide somewhat better error messages when things go wrong. Some languages that you may have heard of, such as BASIC, LISP, and Perl, are mostly used in interpreted form, although compilers are also available for these languages. Programs written in COBOL, FORTRAN, C, C++, and Pascal are compiled. As we will see in the next section, Java programs use both compilation and interpretation in their translation process.