COBOLworx FactSheet
COBOLworx is a center of excellence for Open Source COBOL compilers and related technologies. We develop, maintain, and provide professional commercial support for GCC COBOL and GnuCOBOL.
History
COBOLworx is a division of Symas Corporation. The division was founded in 2015 in response to requests from industry partners for Open Source Software alternatives for current mainframe users.
COBOLworx’s first project was a prototype for a replacement for the IMS/DB (HIDAM) database subsystem. This had been specifically requested by the partner. This work led to development of the GnuCOBOL Debugging Extensions for GDB, the first instance of a source-level debugging capability for COBOL programs. This work was presented at an IBM Users Group meeting in 2018.
At the time, COBOLworx depended on GnuCOBOL as its compiler. Members of the team became committers and assisted the GnuCOBOL project to pull together GnuCOBOL 3.2, released in 2020(?).
In 2021, COBOLworx committed to developing a new COBOL compiler for inclusion in the GNU Compiler Collection (GCC). The work is based on the COBOL Language International Standard (ISO-IEC 1989:2023). In the fall of 2024 COBOLworx began the process of submitting GCC COBOL for inclusion in GCC Release 15. Formal release is expected by mid-May 2025.
Technical Rationale for Developing GCC COBOL
GnuCOBOL has been under continuous development since at least 2000. The compiler processes the COBOL program translating it into C language. Then the GCC C compiler is used to create the executable module (program). The C language intermediate step works well and produces small and efficient programs across the range of GCC’s portability.
Two things convinced COBOLworx that producing a true GCC-based COBOL compiler was important:
- GCC is a large and very active project. COBOLworx recognizes that it can be difficult to introduce Open Source Software packages into larger firms, institutions, and agencies. We felt that being part of GCC would significantly reduce that concern, and
- GCC produced outputs from COBOL programs that contained all the structural informaton (line numbers, etc.) from the C language intermediary source code. Early experience with source-level debugging with gdb was frustrating. We knew if we started clean and eliminated that step, debugging would be simpler and easier to maintain.
There is a third factor. COBOLworx hopes that GCC COBOL, as a member of the collection, will attract outside contributors. While adding contributors always means project complexity, the long term health of GCC COBOL would be best if the team grew.
Business Rationale for Developing GCC COBOL
COBOLworx learned from Symas. Symas has run a successful and profitable business by providing very high quality commercial technical support to users of otherwise free and open source software. Most smaller users will happily adopt and use a package for free and depend on Internet advice sources. But larger end-users are not willing to go without a dependable and capable technical support provider.
GnuCOBOL drove a few enterprises to COBOLworx for technical support. And we expect more. But we expect most COBOL shops will be more interested now that an official COBOL is available from GCC.
GCC COBOL Design Overview
The GNU Compiler Collection supports many source languages: C, C++, Go, FORTRAN, ADA, and now COBOL. Each language has a “Front End” that processes the source code and interacts with GCC’s internals (called “gimpel”) so GCC can construct the Abstract Syntax Tree (AST) from which it will build the machine-readable programs.
The AST is a machine independent view of the program’s logic and data. Most modern compilers build an AST from which they construct the machine specific code. GCC has long maintained many methods by which programs are produced from the same AST, each creating programs for specific chip architectures and operating systems.
GCC COBOL works with gimpel to generate the AST during early source program phases. The “Back End” then, optionally, further processes the AST for various optimizations (speed, program size, etc.). These are standard GCC capabilities. Finally, the AST is reduced to machine instructions used by the target chip architecture and builds an executable program using the operating system’s native interfaces.
As well as the logic and data descriptions in the AST, the GCC COBOL front end can provide debugging information. GCC takes that information and packages it into the appropriate form for the GNU Project Debugger (gdb). The user can either request debugging information in the resulting program or not. The machine instructions (logic) of the program is identical whether or not debugging data is included.
If debugging information is included, the module is larger but gdb can be used while it is running to sort out any issues in real time if needed.
GCC COBOL is written in C++. Approximately 130,000 lines of C++ code was added to GCC 15. It is isolated in the “gcobol” branch with some changes made throughout the GCC project code to accomodate this new language.
The COBOL language specifies a Compiler Directing Facility which is processed before the actual compilation of the source program. The rest of the compiler has roughly two major parts: the Parser processes the source code and deconstructs it and the AST-Builder processes that and feeds it to GCC’s AST processing.
In addition to the compiler itself, there are many intrinsic functions that perform date-time, financial, mathematical, and such computations specified in the standard that are included.