The Full Wiki

OLE Automation: Wikis


Note: Many of our articles have direct quotes from sources you can cite, within the Wikipedia article! This article doesn't yet, but we're working on it! See more info or our list of citable articles.


From Wikipedia, the free encyclopedia

In Microsoft Windows applications programming, OLE Automation (later renamed by Microsoft to just Automation,[1][2] although the old term remained in widespread use), is an inter-process communication mechanism based on Component Object Model (COM) that was intended for use by scripting languages – originally Visual Basic – but now are used by languages run on Windows.[3] It provides an infrastructure whereby applications called automation controllers can access and manipulate (i.e. set properties of or call methods on) shared automation objects that are exported by other applications. It supersedes Dynamic Data Exchange (DDE), an older mechanism for applications to control one another.[4] As with DDE, in OLE Automation the automation controller is the "client" and the application exporting the automation objects is the "server".



Automation was designed with the ease of scripting in mind, so controllers often provide languages such as Visual Basic for Applications to end users, allowing them to control automation objects via scripts. Automation objects can be written in conventional languages such as C++,[5] but the C++ syntax for COM (and therefore Automation) is complex and not very programmer-friendly. In contrast, languages such as Visual Basic and Borland Delphi provide a convenient syntax for Automation which hides the complexity of the underlying implementation.



An Automation object is a COM object implementing the IDispatch interface (for details on interfaces and calling conventions, see Component Object Model). Automation objects are referred to as ActiveX objects, while an application that manipulates an ActiveX object is referred to as an ActiveX Client.[6] This interface exposes four methods, the most important of which is Invoke. This method allows calling methods of a class by name, with an arbitrary number of parameters. Neither the name of the method nor the number (and exact types) of parameters need to be known at compile time, as it is the case for COM objects not supporting Automation; moreover, in scripting languages there is no "compile time" at all. This technique is called late binding.

Most existing COM components are Automation-compliant and furthermore allow both late binding and traditional, compile-time early binding. This is achieved by implementing so-called dual interfaces, which are interfaces derived from IDispatch. Generally, both late and early binding expose the same functionality for Automation clients; languages such as Visual Basic and Delphi, as well as some C++ libraries, which provide a higher level of abstraction for COM, make sure that all Automation components created in these languages correctly duplicate their interfaces with late and early binding.

Generally, implementing early binding is more troublesome. Late binding is slower, but more reliable, as it does not require binary compatibility between versions of the same component. For late binding, a client only needs to know the name (or CLSID) of the desired object and the names and parameters of methods it actually uses, while for early binding, the client needs to know the complete definition and identifier (IID) for every interface requested, as well as the exact position of each method in the interface vtable. This is, however, more a problem of COM in general rather than Automation, as early binding relies on traditional COM calling conventions.

Automation servers may be implemented as single-use or multi-use.[7] For the former, each client connects to an individual instance of the server, with an individual instance of its class factory. For the latter, many clients connect to a single server, sharing that server's class factory across all clients.

The servers for the automation objects may be either out-of-process executables or in-process DLLs.[8]

Type libraries

In order to automate an application, the developer of an automation controller must know the object model that is employed by the target application exporting activation objects.[9] This requires that the developer of the target application publicly document its object model. Development of automation controllers without knowledge of the target application's object model is "difficult to impossible".[10] Because of these complications, Automation components are usually provided with type libraries, which contain metadata about classes, interfaces and other features exposed by an object library. Interfaces are described in Microsoft Interface Definition Language. Type libraries can be viewed using various tools, such as the Microsoft OLE/COM Object Viewer (oleview.exe, part of the Microsoft Platform SDK) or the Object Browser in Visual Basic (up to version 6) and Visual Studio .NET.

Also, type libraries are used to generate Proxy pattern/stub code for interoperating between COM and other platforms, such as Microsoft .NET and Java. For instance, the .NET Framework SDK includes tools that can generate a proxy .NET DLL to access Automation objects using both early binding (with information about interfaces extracted from a type library) and late binding (via IDispatch, mapped to the .NET Reflection API), with the built-in .NET-to-COM bridge called COM Interop.[11] While Java lacks built-in COM support, toolsets like JACOB[12] and jSegue[13] can generate proxy source code (consisting of two parts, a set of Java classes and a C++ source for a Java Native Interface DLL) from type libraries. These solutions only work on Windows.

Microsoft has publicly documented the object model of all of the applications in Microsoft Office,[14] and some other software developers have also documented the object models of their applications. Object models are presented to automation controllers as type libraries, with their interfaces described in ODL.

Language support

Automation is available for a variety of languages, including, but not limited to:


The following Perl code launches Microsoft Excel, loads an existing document, gets the values of four cells, exits the application, and then displays (from within Perl) the four values it got.

# This program illustrates controlling Excel from within Perl
# adapted from:
use strict;
use warnings;
use OLE;  # module
my $xlfile = "c:\\workingdir\\Perl\\test.xls";  # supply a test xls or csv
# Make OLE connection: this creates an instance of an Excel server
my $xl = CreateObject OLE 'Excel.Application' || die $!;
# Set application visibility: 1 = visible; 0 = not visible
$xl->{'Visible'} = 1;
# Open Excel file
my $book = $xl->Workbooks->Open($xlfile);
# Setup active worksheet
my $sheet = $book->Worksheets(1);
# Retrieve values from worksheet
my $a1 = $sheet->Range("A1")->{'Value'};
my $b1 = $sheet->Range("B1")->{'Value'};
my $a2 = $sheet->Range("A2")->{'Value'};
my $b2 = $sheet->Range("B2")->{'Value'};
# Close it up
print $a1, " ", $b1, "\n";  #1st row
print $a2, " ", $b2, "\n";  #2nd row

The following Visual Basic 6 (or VBA) code launches Microsoft Excel, creates a new document, selects some cells, types "Hello World!" into cells A1:C6, and then shows the application window. It operates using late binding, but early binding can be enabled simply by changing the object type from Object (an alias for IDispatch) to Excel.Application, provided that the Excel type library is referenced in the project.

Dim excelApp as Object
Set excelApp = CreateObject("Excel.Application")
excelApp.ActiveCell.Formula = "Hello World!"
excelApp.Visible = True

Here is a sample code for C++, which just makes the window visible, using the pure COM API for late binding without other libraries or wrappers:

CLSID clsid;
CLSIDFromProgID(L"Excel.Application", &clsid);
IDispatch *excelApp;
// Start Excel as a COM server in a separate process
hres = CoCreateInstance(clsid, 0, CLSCTX_LOCAL_SERVER, IID_IDispatch, reinterpret_cast<void **>(&excelApp));
    // error handling
DISPID dispidVisible;
OLECHAR *propertyName = "Visible";
hres = excelApp->GetIDsOfNames(IID_NULL, &propertyName, 1, LOCALE_SYSTEM_DEFAULT, &dispidVisible);
    // error handling
unsigned returnArg;
VARIANT varTrue;
DISPPARAMS params = { &varTrue, 1, 0, 0 };
// set the variant to a boolean true
varTrue.vt = VT_BOOL;
varTrue.boolVal = 0xFFFF;
// make the window visible: excelApp.Visible = True
hres = excelApp->Invoke(dispidVisible, IID_NULL, LOCALE_SYSTEM_DEFAULT, DISPATCH_PROPERTYPUT, params, 0, 0, &returnArg);
    // error handling
// ... use the object
// free the object

Here is an example C++ code (doing the same as the Visual Basic code) using VOLE, a compiler-independent COM/Automation driver library:

using vole::object;
using vole::collection;
object  excelApp    =   object::create("Excel.Application", CLSCTX_LOCAL_SERVER);
object  workBooks   =   excelApp.get_property<object>(L"Workbooks");
object  range       =   excelApp.get_property<object>(L"Range", "A1:C6");
object  activeCell  =   excelApp.get_property<object>(L"ActiveCell");
activeCell.put_property(L"Formula", "Hello World!");
excelApp.put_property(L"Visible", true);

Here is an example C++ code (doing the same as the Visual Basic code) using MFC-provided wrappers for Excel 97:

#include "excel8.h"
// ...
OleVariant covTrue((short)TRUE), covFalse((short)FALSE), covOptional((long)DISP_E_PARAMNOTFOUND, VT_ERROR);
_Application excelApp;
    // error handling
excelApp.GetActiveCell().SetFormula("Hello World!");

Finally, here is an example for PHP5:

$excelApp = new COM("Excel.Application") or die("Cannot create an Excel object");
$excelApp->ActiveCell->Formula = "Hello World!";
$excelApp->Visible = 1;

In C# (and VB.NET with strict type checking) late binding is always explicit, producing code almost as complex as the pure C++ example. In contrast, early binding in .NET offers cleaner-looking code, like the other three examples (although the given examples implicitly involve late binding while .NET does not).

Name confusion

Automation objects do not necessarily use Microsoft OLE, which stands for Object Linking and Embedding — currently only a subset of COM — although some of Automation objects (which are a special type of COM objects[1]) can be used in OLE and/or ActiveX environments. The confusion has its roots in Microsoft's earlier (rather vague) definition of OLE, which was previously more or less a synonym of COM — to the point that the acronym "OLE" frequently appears in legacy COM code, like parts of the MFC library.


  1. ^ a b Microsoft Corporation. "Automation (MFC)". MSDN.  
  2. ^ Kruglinski, David J.; Wingo, Scott; Shepherd, George (1998). "Chapter 25: Automation". Programming Microsoft Visual C++ 6.0 (5th ed.). Redmond, WA: Microsoft Press. ISBN 1-57231-857-0.  
  3. ^ Richard Potter (1999-04-18). "OLE Automation". Interprocess Communication for End-User Programming.  
  4. ^ Gordon McComb (1997). "Using OLE Automation to Control WordPerfect".   — McComb describes how to use OLE Automation instead of DDE to control WordPerfect
  5. ^ a b Chris Oakley. "OLE Automation for C++ programmers".  
  6. ^ Microsoft Corporation. "Overview of Automation". MSDN.  
  7. ^ "OLE automation routines in BASIC and C++". DB2: Application Development Guide: Programming Server Applications. IBM.  
  8. ^ a b Jan Dubois (Summer 1998). "Win32::OLE". The Perl Journal 3 (2).  
  9. ^ a b Yehuda Shiran and Tomer Shiran. "OLE Automation in JavaScript". WebReference.   — despite the title, the article discusses JScript rather than JavaScript
  10. ^ Bruce Armstrong (2006-01-16). "OLE — Extending the Capabilities of PowerBuilder (Part 2)". PowerBuilder Developers' Journal 12 (11).  
  11. ^ a b Appleman, Dan (2001). "Chapter 15: COM Interop and Accessing the Win32 API". Moving to VB.NET: Strategies, Concepts, and Code. Apress. ISBN 1-893115-97-6.  
  12. ^ "The JACOB Project". 2004-10-17.  
  13. ^ "jSegue". 2005-12-14.  
  14. ^ "How to find and use Office object model documentation". KnowledgeBase. Microsoft Corporation.;en-us;222101.  
  15. ^ "OLE Automation using Delphi".  
  16. ^ "Open Object Rexx".  
  17. ^ The PHP Group (2006-07-25). "PHP: COM and .Net (Windows)".  
  18. ^ Mark Hammond. "Python for Windows Extensions".  
  19. ^ Tcl programmers. "OLE from a Tcl perspective".  


OLE Automation in general

  • Microsoft Corporation (December 1993). OLE 2 Programmer's Reference: Creating Programmable Applications with OLE Automation v. 2. Programmer's Reference Library. Microsoft Press. ISBN 1-55615-629-4.  
  • Kraig Brockschmidt (1995). Inside OLE. Microsoft Press.  
  • Microsoft Corporation (1996). OLE Automation Programmer's Reference. Programmer's Reference Library. Microsoft Press. ISBN 1-55615-851-3.  

Object models for specific applications

  • Microsoft Corporation (1999). Microsoft Office 2000 Visual Basic for Applications Language Reference. Microsoft Press. ISBN 1-57231-955-0.   — full printed documentation of the object model of Microsoft Office

See also

External links

  • OLE Automation General paper on the introduction and problems implementing OLE.
  • "VOLE - A Neat C++ COM/Automation Driver" — an open-source, compiler-independent C++ COM Automation driver library, for use when having to drive IDispatch directly. VOLE is highly robust, fully encapsulates all "low-level" aspects of IDispatch, and is very flexible, taking and returning normal C++ types.

This article was originally based on material from the Free On-line Dictionary of Computing, which is licensed under the GFDL.


Got something to say? Make a comment.
Your name
Your email address