Search

12.4 — Early binding and late binding

In this lesson and the next, we are going to take a closer look at how virtual functions are implemented. While this information is not strictly necessary to effectively use virtual functions, it is interesting. Nevertheless, you can consider both sections optional reading.

When a C++ program is executed, it executes sequentially, beginning at the top of main(). When a function call is encountered, the point of execution jumps to the beginning of the function being called. How does the CPU know to do this?

When a program is compiled, the compiler converts each statement in your C++ program into one or more lines of machine language. Each line of machine language is given its own unique sequential address. This is no different for functions -- when a function is encountered, it is converted into machine language and given the next available address. Thus, each function ends up with a unique address.

Binding refers to the process that is used to convert identifiers (such as variable and function names) into addresses. Although binding is used for both variables and functions, in this lesson we’re going to focus on function binding.

Early binding

Most of the function calls the compiler encounters will be direct function calls. A direct function call is a statement that directly calls a function. For example:

Direct function calls can be resolved using a process known as early binding. Early binding (also called static binding) means the compiler (or linker) is able to directly associate the identifier name (such as a function or variable name) with a machine address. Remember that all functions have a unique address. So when the compiler (or linker) encounters a function call, it replaces the function call with a machine language instruction that tells the CPU to jump to the address of the function.

Let’s take a look at a simple calculator program that uses early binding:

Because add(), subtract(), and multiply() are all direct function calls, the compiler will use early binding to resolve the add(), subtract(), and multiply() function calls. The compiler will replace the add() function call with an instruction that tells the CPU to jump to the address of the add() function. The same holds true for subtract() and multiply().

Late Binding

In some programs, it is not possible to know which function will be called until runtime (when the program is run). This is known as late binding (or dynamic binding). In C++, one way to get late binding is to use function pointers. To review function pointers briefly, a function pointer is a type of pointer that points to a function instead of a variable. The function that a function pointer points to can be called by using the function call operator (()) on the pointer.

For example, the following code calls the add() function:

Calling a function via a function pointer is also known as an indirect function call. The following calculator program is functionally identical to the calculator example above, except it uses a function pointer instead of a direct function call:

In this example, instead of calling the add(), subtract(), or multiply() function directly, we’ve instead set pFcn to point at the function we wish to call. Then we call the function through the pointer. The compiler is unable to use early binding to resolve the function call pFcn(x, y) because it can not tell which function pFcn will be pointing to at compile time!

Late binding is slightly less efficient since it involves an extra level of indirection. With early binding, the CPU can jump directly to the function’s address. With late binding, the program has to read the address held in the pointer and then jump to that address. This involves one extra step, making it slightly slower. However, the advantage of late binding is that it is more flexible than early binding, because decisions about what function to call do not need to be made until run time.

In the next lesson, we’ll take a look at how late binding is used to implement virtual functions.

12.5 -- The virtual table
Index
12.3 -- Virtual destructors, virtual assignment, and overriding virtualization

62 comments to 12.4 — Early binding and late binding

  • Shashank More

    "the compiler converts each statement in your C++ program into one or more lines of machine language.Each line of machine language is given its own unique sequential address" (in Early binding).

    But at the compile time how can the compiler set the address of the functions because addresses are resolved or given at the run time not at compile time(This is what we learned in Pointers)????

    • Alex

      Statements, functions, and static variables are all known at compile time, so the compiler can determine what address they will occupy.

      Dynamic variables have their addresses set at runtime (because the OS allocates the memory).

      Functions aren’t dynamic variables, so their addresses aren’t subject to this rule.

      • Shashank More

        what about local variable?? Is their address is also known at the compile time?? I don’t think so!!

        • Alex

          Yes.

          You’re misunderstanding the issue. The compiler doesn’t need to know the actual memory address the variable will end up with. That’s governed by a lot of factors, including where in memory the OS decides to load the program in. Simplifying a bit, the compiler only needs to know where the identifier lives relative to the start of the program. In the case of a local variable int x, the compiler only needs to know where x lives relative to the start of the stack frame.

Leave a Comment

Put C++ code inside [code][/code] tags to use the syntax highlighter