Why doesn’t this valid C program compile?

I wanted to use some C code I found, and when I tried to compile the source, it kept complaining because of a syntax error.

... 
void printMD() 
{ 
      printf("ABC   "); 
      int i; 
      i = 0; 
      printf("I is %i\n",i); 
} 
...

The original code was several thousand lines long. The (first) error message was

ERROR CCN3275 ./perfutic.c:4     Unexpected text 'int' encountered.

There were two challanges

Create the smallest program to isolate
Find out why the code failed to comile

Isolate the problem

I put #ifdef temp…. #endif around blocks of code to remove irrelevant code. This worked, but I quickly got into a mess where I had many #ifdef…#endif and matching them up.

I saved a copy of the program, then used #ifdef..#endif to ignore blocks of code. If this made no change to the compile, then delete the block, and add more #ifdef…#endif. If it caused other error messages, remove the #ifdef..#endif statements, and try a smaller block of code.

Eventually I got down to the code above.

Why did the code fail to compile?

I spent half a day scratching my head, and when I came back next morning, I had a flash of inspiration.

Later versions of C are more flexible in some areas.

In early days of the C compiler you had to define all variables before you did any work. With later versions of the C compiler this restriction was relaxed, so you could call a function, the define a variable.

See intermingled declarations and code: variable declaration is no longer restricted to file scope or the start of a compound statement (block) in the ISO C99 specification

The makefile I was using was

cparms ="-Wc,SO,LIST(lst31),XREF,ILP32,DLL,SSCOM,SHOWINC,DEFINE(MVS=1)"  
cc  -c -o $@ -I"//'MQCD94.SCSQC370'" -I'/usr/include' -I. $(cparms)  $<

You control the level of C using the LANGLVL option.

The cc command produced (in the listing)

Language level. . . . . . . . : *COMMONC:NOTEXTAFTERENDIF

With the c99 command, the code compiled, and the listing had

 Language level. . . . . . . . : *STDC99:NOTEXTAFTERENDIF

With the original compile, I was using the options which did not allow variables to be defined after executable code.

Instead of using cc, I could have used cc with langlvl(EXTENDED).

Why doesn’t this valid C program compile?

The short answer is : I was using the wrong compiler options.

An easy question: how do you print a long long nicely as hex?

I wanted to print a long long nicely, consistent with other lines of output. It took me a while to get working properly.

The code

printf("...  %20.16llx\n",ll);

The x means print as hexadecimal
ll means treat the value as a long long
20.16. The 20 is the the minimum number of characters printed. The second 16 specifies the number of characters to be output. In the past, I’ve used formatting to print numbers so they line up in a column.

Below is the formatting string, and the output

.%16.llx.    .              68.
.%16.16llx.  .0000000000000068.
.%.llx.      .68.
.%llx.       .68.

My final formatting string is

printf("Serial Number in hex:%llx\n",ll);

Serial Number in hex:68

It is another of the “it is always easy when you know the answer”.

My original problem was “how do you nicely print a variable length string”.

I solved it

			
char longChar[8];
char * pData;      // points to the data
int lData = ...  ; // length of the data
memset(&longChar[0],0,sizeof(longChar); // clear it to 0
memcpy(&longChar[8-lData],
       pdata, 
       lData
      ); 
long long ll;        
memcpy(&ll,&longChar,8);       
printf("Serial number Hex %llx\n",ll);

		

There may be better ways of doing it (please suggest a better way of doing it), but it works.

Why is the size of my enum larger than yours?

I’ve been doing some coding with GSK, system SSL, and had problems getting the data to match.

There are some definitions and my code

			
typedef enum { 
    gskdb_rectype_unknown       = 0, 
    gskdb_rectype_keyPair       = 1, /* Request key pair */ 
    gskdb_rectype_certificate   = 2, /* Certificate */ 
    gskdb_rectype_certKey       = 3, /* Certificate with key */ 
    gskdb_rectype_maximum       = 32767 
} gskdb_record_type; 
typedef struct _gskdb_record { 
    gskdb_record_type           recordType; 
    ....
} gskdb_record; 
...
gskdb_record ccp;
    
printf("Length %i\n",sizeof(ccp .recordType));

		

It prints out a length of 4.

If I display the data in the control block, the value of ccp.recordType is 0x0003C000.

This had me scratching my head. It needed a trip to the shops, and lunch before I spotted the problem

In the options section of the compiler list it had

 *ENUMSIZE(INT)

This says treat all enums as integer. The default is enums(SMALL).

The smallest size of gskdb_record_type is 2 bytes, but I had specified use 4 bytes. The true value of the field is 0x003 (Certificate with key) rather than an undefined 0003c000!

I removed the -qenum=int from my compiler switches.

When I recompiled, and reran the program – it all worked, giving the result 3 -> certificate with key!

Using Re-entrant assembler macros in C ASM()

I needed to use some assembler macros inside a C program, because there was no native C interface. This was to prompt the operator for a password, but not to display the value which was entered.

This “simple program” took a few hours to get working.

I’ve written Re-entrant assembler macros in z/OS explaining how to use assembler macros in re-entrant code.

I needed to do this from with a C program using the ASM() definition to write assembler code.

Set up the C code

 /*Include standard libraries */ 
  #include <stdio.h> 
  #include <stdlib.h> 
  #include <string.h> 

int main( int argc, char *argv[])  
{
  struct{
    short ll;  // length of the message
    char text[120]; // the message itself
  } outputMsg; // the displayed message 
  int tempWto[100] ; // plenty of space, on a full word boundary
  int ECB = 0;  // wait on this 
  // Define the reply area
  int lReply = 100;
  // use +1 to define a trailing null
  char reply[lReply+1];
  memset(&reply[0],0,lReply+1); // set to nulls

  int rc = 0;

  char * outMsg = "Please give the password for userid ABC";
  strncpy(&outputMsg.text[0],outMsg,sizeof(outputMsg.text));
  outputMsg.ll = strlen(outMsg);

The WTOR macros

The macros I needed to use were the Write To Operator with Reply(WTOR).

The code needs to

define the static data for the WTOR
copy the WTOR static structure data to thread read-write storage
execute the WTOR passing the parameters, and the WTOR structure in read-write storage. The parameters are
- the message to display
- the address of the reply buffer
- the length of the reply buffer
- the ECB to wait on

Define the static data

asm(  "WTORL   WTOR TEXT=(,,,),MF=L,ROUTCDE=(9) \n"
      "OVERWTO  DS 0H \n"

The ROUTCDE=(9) says supress what was typed in. This WTOR is being used to prompt for a password, using ROUTCDE=(9) achieves this.

Execute the WTOR

  asm(...
      "    LA  2,%[out] \n"
      "    WTOR TEXT=((2),(%[pReply]),(%[rLen]),%[ECB]),MF=(E,%[pData]) \n"
      "    ST  15,%[rc] \n"
      "    LTR  15,15  \n"
      "    JNZ   ERROR \n"
      "    WAIT  1,ECB=%[ECB]  \n" 
      "ERROR  DS 0H \n" 
     
     
     : [rc] "+m"(rc), //* output
       [rLen] "+r"(lReply)  
     : [out]  "m"(outputMsg),
       [pReply] "r"(&reply),
       [ECB]   "m"(ECB),
       [pData]  "m"(tempWto[0]) 
     :"r0", "r1" , "r15", "r2" );

The LA 2,%[out] code uses the definition [out] further down. The “m” says use memory, and substitute the address of outputMsg. In the C program this is at address 168 off register 13.

The LA 2,%[out] becomes

 LA    2,168(13)

The WTOR TEXT=((2),(%[pReply]),(%[rLen]),%[ECB]),MF=(E,%[pData]) statement becomes

WTOR  TEXT=((2),(5),(4),696(13)),MF=(E,296(13))

[pReply] was defined as a temporary register “r”. It is surrounded by () to show that it is a register
[rLen] was defined as a temporary register “r”. It is surrounded by () to show that it is a register
[ECB] is defined as a memory location, and it’s address 696 off register 13 is used
the read write storage to use is at offset 296 off register 13

It was hard to know which fields had to be passed in registers, and which could be passed as memory addresses. I solved it by trial and error.

The hard part

The statically defined structure needs to be copied to thread read write storage. This proved a challenge.

In assembler you can use an instruction like MVC TO(24),FROM and it copies 24 bytes from FROM to TO. Using the assembler from C means you cannot use this.

There are no base+using registers, so you cannot reference a field by a label
You cannot specify a length field when using %[name].

I used the MVCL which allows registers to be used to address the data, and specify the length. You need two registers to identify the “from” area, and two registers to identify the “to” area.

  asm("    BRAS 14,OVERWTO  \n"  // point R14 to the constant area
      "WTORL   WTOR TEXT=(,,,),MF=L,ROUTCDE=(9) \n"
      "OVERWTO  DS 0H \n"
      "    LA    15,OVERWTO-WTORL \n"  
      "    LR    1,15 \n"  // make the lengths the same
      "    LA    0,%[pData] \n" // where the data is stored
      "    MVCL  0,14 \n" // move from the static to the dynamic

Where

BRAS 14,OVERWTO sets register 14 to the address of the data following, and jumps to the label OVERWTO
LA 15,OVERWTO-WTORL gets the length of the statically defined data
LR 1,15 copies the length of the data into register 1
LA 0,%[pData] points register 0 to the address of the thread read-write storage
MVCL 0,14 This copies the data from what register 14 points to (the static data) with a length of the content of register 15, into the storage pointed to by register 0, of length in the contents of register 1

Wait for the reply

The code was

"    WTOR TEXT=((2),(%[pReply]),(%[rLen]),%[ECB]),MF=(E,%[pData]) \n"
"    ST  15,%[rc] \n"
"    LTR  15,15  \n"
"    JNZ   ERROR \n"
"    WAIT  1,ECB=%[ECB]  \n" 
"ERROR  DS 0H \n"

The code

checks the return code from the WTOR was zero
If not, skip the ECB wait
Wait for one ECB posted, with the specified ECB

The after code

printf("Return code %i\n",rc);
printf("Data %i %s\n",strlen(reply),reply);

return rc;
}

This prints what the user entered. Because the reply buffer was primed with hex 00, you can use STRLEN to get the length of the returned string.
The the program ran, it returned data in upper case. I had to reply to the WTOR on the console using R nn,’lower case’.

The whole program

 /*Include standard libraries */ 
  #include <stdio.h> 
  #include <stdlib.h> 
  #include <string.h> 

int main( int argc, char *argv[])  
{
  // the output message 
  struct{
    short ll;
    char text[120];
  } outputMsg;
  int tempWto[100] ; // plenty of space, on a full word boundary
  int ECB = 0;
  int lReply = 100;
  // use +1 to define a trailing null
  char reply[lReply+1];
  memset(&reply[0],0,lReply+1); // set to nulls
  int rc = 0;
  char * outMsg = "Please give the password for userid ABC";
  strncpy(&outputMsg.text[0],outMsg,sizeof(outputMsg.text));
  outputMsg.ll = strlen(outMsg);

  
  asm("    BRAS 14,OVERWTO  \n"  // point R14 to the constant area
      "WTORL   WTOR TEXT=(,,,),MF=L,ROUTCDE=(9) \n"
      "OVERWTO  DS 0H \n"
      "    LA    15,OVERWTO-WTORL \n"  
      "    LR    1,15 \n"  // make the lengths the same
      "    LA    0,%[pData] \n"
      "    MVCL  0,14 \n" // move from the static to the dynamic
      "    LA  2,%[out] \n"
      "    WTOR TEXT=((2),(%[pReply]),(%[rLen]),%[ECB]),MF=(E,%[pData]) \n"
      "    ST  15,%[rc] \n"
      "    LTR  15,15  \n"
      "    JNZ   ERROR \n"
      "    WAIT  1,ECB=%[ECB]  \n" 
      "ERROR  DS 0H \n" 
     
     
     : [rc] "+m"(rc) // + means modified output
        
     : [out]      "m"(outputMsg),
       [pReply] "r"(&reply),
       [rLen]   "r"(lReply), 
       [ECB]    "m"(ECB),
       [pData]  "m"(tempWto[0]) 
     :"r0", "r1" , "r15", "r2"
     );  
 
  printf("Return code %i\n",rc);
  printf("Data %i %s\n",strlen(reply),reply);

  return rc;
}

How do I create a load module in a PDS from Unix?

This is another of the little problems which are easy once you know the anwser.

I used the shell program to compile my program.

name=extract 
                                                                                                                    
export _C89_CCMODE=1 

p1="-Wc,arch(8),target(zOSV2R3),list,source,ilp32,gonum,asm,float(ieee)" 
p7="-Wc,ASM,ASMLIB(//'SYS1.MACLIB')                   " 
p8="-Wc,LIST(c.lst),SOURCE,NOWARN64,XREF,SHOWINC -Wa,LIST(133),RENT" 

# compile it                                                                                                                    
xlc  $p1 $p7  $p8   -c $name.c -o $name.o 

l1="-Wl,LIST,MAP,XREF,AC=1 " 
# create an executable in the file system
/bin/xlc $name.o  -o $name   -V   $l1    1>a 
extattr +a $name 

# create a load module in a PDS
/bin/xlc $name.o  -o "//'COLIN.LOAD(EXTRACT)'"  -V $l1    1>a

Create an executable in the file system

The first bind xlc step creates an object with name “extract” in the file system.

Specify the load module

The second bind step specified a load module in a PDS. The load module is stored in COLIN.LOAD. If you copy and paste the line, make sure you have the correct quotes ( double quote, //, single quote, dataset(member),single quote,double quote). Sometimes my pasting lost a quote.

Process assembler code

My program has some assembler code…

 asm( ASM_PREFIX 
         " STORAGE RELEASE,...
         :"r0", "r1" , "r15" );

It needs the options “-Wc,ASM,ASMLIB(//’SYS1.MACLIB’) ” to compile it, and specify the location of the assembler macros.

Binder parameters

The line parameters in -Wl,LIST,MAP,XREF,AC=1 are passed to the binder.

Message – wrong suffix on the source file

Without the export _C89_CCMODE=1 I got the message

FSUM3008 Specify a file with the correct suffix (.c, .i, .s, .o, .x, .p, .I, or .a), or a corresponding data set name, instead of -o ./extract.

Putting assembler code inside a C program

I was using a RACF service, and the documentation casually says “the application must free the returned storage”.

C does not provide facilities to use the STORAGE RELEASE z/OS service, so I had to write some assembler code to do this. It wasn’t difficult, I just fell over many little problems.

My code
- With the SYSSTATE ARCHLVL=2
- Without the SYSSTATE ARCHLVL=2
Using registers
- Using registers with my program
Using variables
Using the different data types.
64 bit programming
Compiling the code

The IBM extension to the C language ASM is defined in the documentation.

I think of asm() as a special macro.

You pass in strings of assembler code. asm() will format it, and split the line if it would be longer than 72 characters.
If you want multiple lines you should end each line with \n (standard printf in C).
Lines starting with “*” are treated as comments, and produce no code
You can start your line with just one blank, before the instruction and it will be formatted as properly aligned assembler code, wrapping onto new lines if needed.
You do not use the C variable names in the assembler source.
- You specified a mapping like [length] “m”(lDeletestorage), it takes the C variable lDeletestorage, determines its storage address such as it is offset 40 from register 5 (40(5)) and assigns this value to “length”
- In the assembler code you specify ” L 2,%[length] \n” and it substitutes the value for %[length]. This instruction becomes ” L 2,40(5) “
- At first glance it you would think it should be able to substitute the value directly, but you need to specify meta information about the field (is it used read only or read/write, is it a storage location, or a literal string). This is why there is this indirection
You need to specify which registers are modified so the C code can either save/restore their values, or just use different registers.

My code

if ( pDeletestorage != 0 )
{
  int freerc = 0;  
  #define ASM_PREFIX " SYSSTATE ARCHLVL=2 \n" 
  asm( 
     ASM_PREFIX 
     " L     2,%[length] \n"
     " L     4,%[a] \n"
     " STORAGE RELEASE,LENGTH=(2),ADDR=(4),COND=YES,RTCD=%[rc] "
     : [rc] "+m"(freerc) //* output
     : [length] "m"(lDeletestorage), 
       [a] "m"(pDeletestorage)
     :"r0", "r1" , "r2" , "r4" , "r15" ); 
  printf("Storage release rc %i\n",rc);
}

if ( pDeletestorage != 0 ) If there is a block of storage to release
{
int freerc = 0; define and preset the return code
#define ASM_PREFIX ” SYSSTATE ARCHLVL=2 \n” see below. The \n makes a new line
asm( generate the code
ASM_PREFIX insert the SYSTATE code
” L 2,%[length] \n” Load register 2 with the value of the variable length defined below
” L 4,%[a] \n” Load register 4 with the value of the variable a(ddress) defined below
” STORAGE RELEASE,LENGTH=(2),ADDR=(4),COND=YES,RTCD=%[rc] “ issue the storage macro
: After the first : is a list of output variables
- [rc] “+m”(freerc) //* output [rc] matches the name in the RTCD=%[rc]
  - “+” means Indicates that the operand is both read and written by the instruction
  - “m” is use a memory object – that is, a variable. Other options are literal values or registers.
  - (freerc) is the name of the C variable
: After the second : is a list of input variables
- [length] “m”(lDeletestorage),
- [a] “m”(pDeletestorage)
: After the third : is a list of register that are changed (clobbered)
- “r0”, “r1” , “r2” , “r4” , “r15”
);
printf(“Storage release rc %i”,rc);
}

You have to pass the data to the macro using registers. You can either select registers yourself, or let the compiler pick them for you. See Using registers with my program as a more correct way of coding the program.

Note: If you pick the registers yourself, they may be unavailable if you compile it as a 64 bit program or as XPLINK, so using Using registers with my program is better.g

With the SYSSTATE ARCHLVL=2

This statement sets the minimum architecture to z/Architecture (which includes Jump instructions).

The generated code was

* SYSSTATE ARCHLVL=2                                              
* THE VALUE OF SYSSTATE IS NOW SET TO ASCENV=P AMODE64=NO ARCHLVX01-SYSS 
*        L=2 OSREL=00000000 RMODE64=NO 
  L     2,1368(13)                                                 
  L     4,1364(13)  
                                               
  STORAGE RELEASE,LENGTH=(2),ADDR=(4),COND=YES,RTCD=1372(13)       
  LR     0,2                          .STORAGE LENGTH            
  LR     1,4                          .ADDRESS OF STORAGE        
  LHI    15,X'0001'                   .Add in parameters     
  L      14,16(0,0)                   .CVT ADDRESS                
  L      14,772(14,0)                 .ADDR SYST LINKAGE TABLE    
  L      14,204(14,0)                 .OBTAIN LX/EX FOR RELEASE  
  PC     0(14)                        .PC TO STORAGE RTN         
  ST     15,1372(13)                  .SAVE RETURN CODE

Where

  L     2,1368(13)       
  L     4,1364(13)

is loading the registers with the variable data from the C code,

and

  ST     15,1372(13)                  .SAVE RETURN CODE

saves the return code into the C variable.

Without the SYSSTATE ARCHLVL=2

The code failed to compile because of addressability issues.

         L     2,1368(13)                                              
         L     4,1364(13)                                              
         STORAGE RELEASE,LENGTH=(2),ADDR=(4),COND=YES,RTCD=1372(13)    
         CNOP   0,4                                                    
         B      IHB0001B                     .BRANCH AROUND DATA 
*** ASMA307E No active USING for operand IHB0001B       
IHB0001F DC     BL1'00000000'                                          
         DC     AL1(0*16)                    .KEY                      
         DC     AL1(0)                       .SUBPOOL                  
         DC     BL1'00000001'                .FLAGS                    
IHB0001B DS     0F                                                     
         LR     0,2                          .STORAGE LENGTH           
         LR     1,4                          .ADDRESS OF STORAGE       
         L      15,IHB0001F                  .CONTROL INFORMATION 
*** ASMA307E No active USING for operand IHB0001F 
         L      14,16(0,0)                   .CVT ADDRESS              
         L      14,772(14,0)                 .ADDR SYST LINKAGE TABLE  
         L      14,204(14,0)                 .OBTAIN LX/EX FOR RELEASE 
         PC     0(14)                        .PC TO STORAGE RTN        
         ST     15,1372(13)                  .SAVE RETURN CODE

In days of old you had base registers to locate data and instructions. Data reference was relative to (USING) a base register. Branching within a program used one or more base register. A re-entrant program would get dynamic storage, and this would be addressed by its own base register.

Modern instructions include the Jump instruction. This says jump this many half words to the instruction. These instructions do not need a base register.

With SYSSTATE ARCHLVL=2, the generated code for the C part of the program used the jump instructions, and did not need a base register. Assembler macros that generate code the old way, need a base register.

Using old instruction, to load a value into a register, it was loaded from storage – which needed a base register to locate the storage. For example

IHB0001F DC     BL1'00000000'                                          
         DC     AL1(0*16)                    .KEY                      
         DC     AL1(0)                       .SUBPOOL                  
         DC     BL1'00000001'                .FLAGS   
...
         L      15,IHB0001F                  .CONTROL INFORMATION

Modern instructions have the “constant” value as part of the instruction

 LHI    15,X'0001'

This effectively clears register 15 and loads the value 0x0001 into the bottom. (To be accurate it moves 0x0001 to the bottom half, then propagates the sign bit to the upper half word. The sign bit is 0.)

Using registers

The z/OS uses registers 14,15,0,1 for linkage, and these are likely to be used when using macros. Register 3 was used by the C code to locate storage, so I used registers 2 and 4.

Using variables

Reading the documentation, it seems I should be able to say

 " STORAGE RELEASE,LENGTH=%[length],ADDR=(4),COND=YES,RTCD=%[rc] "
                : ...
                : [length] "m"(lDeletestorage), 
                  [a] "m"(pDeletestorage)
                :...

passing in a variable. This does not work, because C does not pass in “lDeletestorage”, but the offset and register.

For example the storage for variable lDeleteme is 1368 off register 13.

It can be used in a Load instruction

  L     2,1368(13)

But not every where. In the macro it get substituted.

STORAGE RELEASE,LENGTH=1368(13),ADDR=(4),COND=YES,RTCD=1372(13)
 L      0,=A(1368(13))               .STORAGE LENGTH  
*** ASMA035S Invalid delimiter - 1368(13)

Passing back the return code using RTCD=%[rc]. The macro checks to see if the value of RTCD is a register, if not then use the value in a store instruction.

Using registers

You can specify that you want to use a register, and the ASM() will pick registers for you.

 int res = 0x12345678;        
 int newRes = 55;
 __asm(" SR %[rx],%[rx]  clear \n"
       " SR %[rx],%[ry]  them  \n"  
       "  LR %[rx],%[ry] COLINS\n" 
              
       :  [rx]"=r"(res)   // output
       :  [ry]"r"(newRes) // input
      );

This generates code

     L        r4,newRes(,r13,1380) 
*     SR    2,2                     clear         
*     SR    2,4                     them          
*     LR    2,4                     COLINS        
      SR       r2,r2 
      SR       r2,r4 
      LR       r2,r4 
 
      LR       r0,r2 
      ST       r0,res(,r13,1376)

It has selected to use registers 2 and 4.

Where

L r4,newRes(,r13,1380) loads the input value into a register of its choice
*… these are comments of the coded instructions
SR.. LR.. are the generated instructions
LR r0,r2 , ST r0,res(,r13,1376) saves the variable defined as output back into C storage.

Using registers with my program

The compiler decides which registers to user ( r2 and r4). When I compiled this code as 64 bit (using option lp64) it used registers 6 and 7.

asm( ASM_PREFIX 
     " STORAGE RELEASE,LENGTH=(%[length]),ADDR=(%[a]),COND=YES,RTCD=%[rc] "
    : [rc] "+m"(freerc) //* output
    : [length]  "r"(lDeleteme), 
      [a] "r"(pDeleteme)
    :"r0", "r1" , "r15" );

This generates

 *          asm( 
            L        r2,lDeleteme(,r13,1368) 
            L        r4,pDeleteme(,r13,1364) 
         SYSSTATE ARCHLVL=2 

         STORAGE RELEASE,LENGTH=(2),ADDR=(4),COND=YES,RTCD=1372(13)     
         LR     0,2                          .STORAGE LENGTH            
         LR     1,4                          .ADDRESS OF STORAGE        
         LHI    15,X'0001'                   .Add in parameters    @PCA 
         L      14,16(0,0)                   .CVT ADDRESS               
         L      14,772(14,0)                 .ADDR SYST LINKAGE TABLE   
         L      14,204(14,0)                 .OBTAIN LX/EX FOR RELEASE  
         PC     0(14)                        .PC TO STORAGE RTN         
         ST     15,1372(13)                  .SAVE RETURN CODE

Using the different data types.

This is not explained very well in the documentation.

Using literal integer constants

The code

asm( "LABEL LA     4,%[i] \n" 
     : 
     : [i] "i"(99)
     : "r4" );

Generates

 * LABEL    LA    4,99                    Colins             
             LA       r4,99

The label I specified on the instruction is not added to the created instruction.

Using literal string constants

Instead of using this capability, you could use C constants, and the “m” definition.

The code

asm( "LABEL LA     4,%[i] Colins \n" 
     : 
     : [i] "i"("ABCD")
     : "r4" );

Gives a compile error

10 LABEL    LA    4,ABCD                  Colins    
 *** ASMA044E Undefined symbol - ABCD

The code

asm("LABEL LA     4,%[i] Colins \n" 
                : 
                : [i] "i"("=C'ABCD'")
                : "r4" );

Generates

* LABEL    LA    4,=C'ABCD'              Colins          
           LA    r4,0(,r3)
... 
Start of ASM Literals 
000360  C1C2C3C4     =C'ABCD'

Using a memory operand that is offsetable (o)

The code

asm("LABEL LA     4,%[i] Colins \n" 
                    "LABEL2 LA    4,%[j] Colins2\n"
                : 
                : [i] "o"(res),
                 [j]  "m"(res)  
                : "r4" );

produced

* LABEL    LA    4,1376(13)              Colins      
* LABEL2   LA    4,1376(13)              Colins2     
               LA       r4,1376(r13,) 
               LA       r4,1376(r13,)

So it looks like the “o” operand is the same as specifying “m”.

64 bit programming

When you compile in 64 bit code (option lp64). The asm() works just as well It is better to use Using registers with my program so you do not have to guess which registers you can use.

My code

asm( ASM_PREFIX 
     " STORAGE RELEASE,LENGTH=(%[l]),ADDR=(%[a]),COND=YES,RTCD=%[rc] "
     : [rc] "+m"(freerc) //* output
     : [l]  "r"(lDeleteme), 
       [a] "r"(pDeleteme)
     :"r0", "r1" , "r15"
    );

Generated

    LLGF     r6,lDeleteme(,r4,3504) 
    LG       r7,pDeleteme(,r4,3496) 
    SYSSTATE ARCHLVL=2
...

Where the LLGF loads the 32 bit length variable into register 6, and LG loads the 64 bit address variable into register 7.

Compiling the code

I used the shell script

name=irrseq
export _C89_CCMODE=1 
p1="-Wc,arch(8),target(zOSV2R3),list,source,ilp32,gonum,asm,float(ieee)" 
p5="                           -I.                            " 


p7="-Wc,ASM,ASMLIB(//'SYS1.MACLIB') -Wa,LIST,RENT" 

p8="-Wc,LIST(c.lst),SOURCE,NOWARN64,XREF,SHOWINC " 
                                                                                                                   
xlc  $p1 $p5 $p7  $p8   -c $name.c -o $name.o 
                                                                                                                   
l1="-Wl,LIST,MAP,XREF       " 
/bin/xlc $name.o  -o irrseq  -V   $l1    1>a

The important line, p7, contains

-Wc,ASM,ASMLIB(//’SYS1.MACLIB’) C compile options
- ASM Enables inlined assembly code inside C/C++ programs.
- ASMLIB Specifies assembler macro libraries to be used when assembling the assembler source code.
-Wa,LIST,RENT” Assembler options
- LIST Instructs the assembler to produce a listing
- RENT Specifies that the assembler checks for possible coding violations of program reenterability.

That’s strange – the compile worked.

I was setting up a script to compile some C code in Unix Services, and it worked – when I expected the bind to fail because I had not specified where to find a stub file.

How to compile the source

I used a shell script to compile and bind the source. I was surprised to see that it worked, because it needed some Linkedit stubs from CSSLIB. I thought I needed

export _C89_LSYSLIB=”CEE.SCEELKEX:CEE.SCEELKED:CBC.SCCNOBJ:SYS1.CSSLIB”

but it worked without it.

The script

name=irrseq 
                                                                                  
export _C89_CCMODE=1 
# export _C89_LSYSLIB="CEE.SCEELKEX:CEE.SCEELKED:CBC.SCCNOBJ:SYS1.CSSLIB" 
p1="-Wc,arch(8),target(zOSV2R3),list,source,ilp32,gonum,asm,float(ieee)" 
p5=" -I. " 
p8="-Wc,LIST(c.lst),SOURCE,NOWARN64,XREF,SHOWINC " 
                                                                                    
xlc  $p1 $p5  $p8   -c $name.c -o $name.o 
# now bind it                                                                                    
l1="-Wl,LIST,MAP,XREF      " 
/bin/xlc $name.o  -o irrseq  -V   $l1    1>binder.out

The binder output had

XL_CONFIG=/bin/../usr/lpp/cbclib/xlc/etc/xlc.cfg:xlc 
-v -Wl,LIST,MAP,XREF irrseq.o -o./irrseq 
STEPLIB=NONE 
_C89_ACCEPTABLE_RC=4 
_C89_PVERSION=0x42040000 
_C89_PSYSIX= 
_C89_PSYSLIB=CEE.SCEEOBJ:CEE.SCEECPP 
_C89_LSYSLIB=CEE.SCEELKEX:CEE.SCEELKED:CBC.SCCNOBJ:SYS1.CSSLIB

Where did these come from? – I was interested in SYS1.CSSLIB. It came from xlc config file below.

xlc config file

By default the compile command uses a configuration file /usr/lpp/cbclib/xlc/etc/xlc.cfg .

The key parts of this file are

* FUNCTION: z/OS V2.4 XL C/C++ Compiler Configuration file
*
* Licensed Materials - Property of IBM
* 5650-ZOS Copyright IBM Corp. 2004, 2018.
* US Government Users Restricted Rights - Use, duplication or
* disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
*

* C compiler, extended mode
xlc:          use               = DEFLT
...* common definitions
DEFLT: cppcomp           = /usr/lpp/cbclib/xlc/exe/ccndrvr 
       ccomp             = /usr/lpp/cbclib/xlc/exe/ccndrvr 
       ipacomp           = /usr/lpp/cbclib/xlc/exe/ccndrvr 
       ipa               = /bin/c89 
       as                = /bin/c89 
       ld_c              = /bin/c89 
       ld_cpp            = /bin/cxx 
       xlC               = /usr/lpp/cbclib/xlc/bin/xlc 
       xlCcopt           = -D_XOPEN_SOURCE 
       sysobj            = cee.sceeobj:cee.sceecpp 
       syslib            = cee.sceelkex:cee.sceelked:cbc.sccnobj:sys1.csslib 
       syslib_x          = cee.sceebnd2:cbc.sccnobj:sys1.csslib 
       exportlist_c      = NONE 
       exportlist_cpp    = cee.sceelib(c128n):cbc.sclbsid(iostream,complex) 
       exportlist_c_x    = cee.sceelib(celhs003,celhs001) 
       exportlist_cpp_x  = ... 
       exportlist_c_64   = cee.sceelib(celqs003) 
       exportlist_cpp_64 = ...
       steplib           = NONE

Where the _x entries are for xplink.

It is easy to find the answer when you know the solution.

Note:

Without the export _C89_CCMODE=1

I got

IEW2763S DE07 FILE ASSOCIATED WITH DDNAME /0000002 CANNOT BE OPENED
BECAUSE THE FILE DOES NOT EXIST OR CANNOT BE CREATED.
IEW2302E 1031 THE DATA SET SPECIFIED BY DDNAME /0000002 COULD NOT BE
FOUND, AND THUS HAS NOT BEEN INCLUDED.
FSUM3065 The LINKEDIT step ended with return code 8.

Compiling in 64 bit

It was simple to change the script to compile it in 64 bit mode, but overall this didn’t work.

p1="-Wc,arch(8),target(zOSV2R3),list,source,lp64,gonum,asm,float(ieee)" 
...
l1="-Wl,LIST,MAP,XREF -q64  "

When I compiled in 64 bit mode, and tried to bind in 31/32 bit mode (omitting the -q64 option) I got messages like

 IEW2469E 9907 THE ATTRIBUTES OF A REFERENCE TO isprint FROM SECTION irrseq#C DO
          NOT MATCH THE ATTRIBUTES OF THE TARGET SYMBOL. REASON  2              
 ...           
 IEW2469E 9907 THE ATTRIBUTES OF A REFERENCE TO IRRSEQ00 FROM SECTION irrseq#C  
          DO NOT MATCH THE ATTRIBUTES OF THE TARGET SYMBOL. REASON  3   
        
 IEW2456E 9207 SYMBOL CELQSG03 UNRESOLVED.  MEMBER COULD NOT BE INCLUDED FROM   
          THE DESIGNATED CALL LIBRARY. 
 ...                                         
 IEW2470E 9511 ORDERED SECTION CEESTART NOT FOUND IN MODULE.                    
 IEW2648E 5111 ENTRY CEESTART IS NOT A CSECT OR AN EXTERNAL NAME IN THE MODULE.

IEW2469E THE ATTRIBUTES OF A REFERENCE TO … FROM SECTION … DO
NOT MATCH THE ATTRIBUTES OF THE TARGET SYMBOL. REASON x

Reason 2 The xplink attributes of the reference and target do not match.
Reason 3 Either the reference or the target is in amode 64 and the amodes do not match. The IRRSEQ00 stub is only available in 31 bit mode, my program was 64 bit amode.

IEW2456E SYMBOL CELQINPL UNRESOLVED. MEMBER COULD NOT BE INCLUDED
FROM THE DESIGNATED CALL LIBRARY.

The compile in 64 bit mode generates an “include…” of the 64 bit stuff needed by C. Because the binder was in 31 bit, it used the 31 bit libraries – which did not have the specified include file. When you compile in 64 bit mode you need to bind with the 64 bit libraries. The compile command sorts all this out depending on the options.
The libraries used when binding in 64 bit mode are syslib_x = cee.sceebnd2:cbc.sccnobj:sys1.csslib. See the xlc config file above.

IEW2470E 9511 ORDERED SECTION CEESTART NOT FOUND IN MODULE.
IEW2648E 5111 ENTRY CEESTART IS NOT A CSECT OR AN EXTERNAL NAME IN THE MODULE.

Compiling in 64 bit mode, generates an entry point of CELQSTRT instead of CEESTART, so the binder instructions for 31 bit programs specifying the entry point of CEESTART will fail.

Overall

Because IRRSEQ00 only comes in a 31 bit flavour, and not a 64 bit flavour, I could not call it directly from a 64 bit program, and I had to use a 32 bit compile and bind.

Python calling C functions

You can have Python programs which are pure Python.
You can call C programs that act like Python programs, using Python constructs within the C program
You can call a C program from Python, and it processes parameters like a normal C program.
- You can pass simple data types such as char, integers and strings.
- You can pass structures. See Python calling C functions – passing structures

This blog post is about the third, calling a C program from Python, passing simple data types such as char, integers and strings.

I have based a lot of this on the well written pyzfile package by @daveyc.

The glue that makes it work is the ctypes package a “foreign function library” package.

Before you start

The blog post is called “Python calling C functions”. I tried using a z/OS stub code directly. This is not written in C, and I got.

CEE3595S DLL ... does not contain a CELQSTRT CSECT.

Which shows you must supply a C program.

The C program that I wrote, calls z/OS services. These must be defined like (or default to)

#pragma linkage(...,OS64_NOSTACK)

Getting started

My C program has several functions including

int query() {  
return 0;
}

The compile instructions said exportall – so all functions are visible from outside of the load module.

You access this from Python using code like

lib_file = pathlib.Path(__file__).parent / "pySMFRealTime.so"
self.lib = ctypes.CDLL(str(lib_file))
...
result = self.lib.query()

Where

lib_file = pathlib.Path(__file__).parent / “pySMFRealTime.so” says get the file path of the .so module in the same directory as the current Python file.
self.lib = ctypes.CDLL(str(lib_file)) load the file and extract information.
result = self.lib.query() execute the query function, passing no parameters, and store any return code in the variable result

Passing simple parameters

A more realistic program, passing parameters in, and getting data back in the parameters is

int conn(const char* resource_name,  // input:  what we want to connect to
         char * pOut,                // output: where we return the handle
         int * rc,                   // output: return code  
         int * rs,                   // output: reason code 
         int * debug)                // input:  pass in debug information 
{
    int lName = strlen(resource_name); 
    if  (*debug >= 1) 
    { 
      printf("===resource_namen"); 
      printHex(stdout,pFn,20); 
    } 
    ...
    return 0;
}

The Python code has

lib_file = pathlib.Path(__file__).parent / "pySMFRealTime.so"
self.lib = ctypes.CDLL(str(lib_file))
self.lib.conn.argtypes = [c_char_p,  # the name of stream
                          c_char_p,  # the returned buffer
                          ctypes.POINTER(ctypes.c_int), # rc
                          ctypes.POINTER(ctypes.c_int), # rs 
                          ctypes.POINTER(ctypes.c_int), # debug
                          ] 
self.lib.conn.restype = c_int

The code to do the connection is

def conn(self,name: str,):
    token =  ctypes.create_string_buffer(16) # 16 byte handle
    rc = ctypes.c_int(0)
    rs = ctypes.c_int(0)
    debug = ctypes.c_int(self.debug)
    self.token = None
    retcode  = self.lib.conn(name.encode("cp500"),
                                token,
                                rc,
                                rs,
                                debug)
    if retcode != 0:
        print("returned rc",rc, "reason",rs)
        print(">>>>>>>>>>>>>>>>> connect error ")
        return None
    print("returned rc",rc, "reason",rs)
    self.token = token
    return rc

The code does

def conn(self,name: str,): define the conn function and pass in the variable name which is a string
token = ctypes.create_string_buffer(16) # 16 byte handle create a 16 byte buffer and wrap it in ctypes stuff.
rc = ctypes.c_int(0), rs = ctypes.c_int(0), debug = ctypes.c_int(self.debug) create 3 integer variables.
self.token = None preset this
retcode = self.lib.conn( invoke the conn function
- name.encode(“cp500”), convert the name from ASCII (all Python printable strings are in ASCII) to code page 500.
- token, the 16 byte token defined above
- rc, rs, debug) the three integer variables
if retcode != 0: print out error messages
print(“returned rc”,rc, “reason”,rs) print the return and reason code
self.token = token save the token for the next operation
return rc return to caller, with the return code.

Once I got my head round the ctypes… it was easy.

The C program

There are some things you need to be aware of.

Python is compiled with the -qascii compiler option, so all strings etc are in ASCII. The code name.encode(“cp500”), converts it from ASCII to EBCDIC. The called C program sees the data as a valid EBCDIC string (null terminated).
If a character string is returned, with printable text. Either your program coverts it to ASCII, or your Python calling code needs to convert it.
Your C program can be compiled with -qascii – or as EBCDIC(no -qascii)
- Because Python is compiled in ASCII, the printf routines are configured to print ASCII. If your program is compiled as ASCII, printf(“ABCD”) will print as ABCD. If your program is compiled as EBCDIC printf(“ABCD”) will print garbage – because the hex values for EBCDIC ABCD are not printable as ASCII characters.
- If your program is compiled as ASCII you can define EBCDIC constants.
  - #pragma convert(“IBM-1047”)
  - char * pEyeCatcher = “EYEC”; // EBCDIC eye catcher for control block
  - #pragma convert(pop)

Python calling C functions – passing structures

I’ve written how you can pass simple data from Python to a C function, see Python calling C functions.

This article explains how you can pass structures and point to buffers in the Python program. it extends Python calling C functions. It allows you to move logic from the C program to a Python program.

Using complex arguments

The examples in Python calling C functions were for using simple elements, such as Integers or strings.

I have a C structure I need to pass to a C function. The example below passes in an eye catcher, some lengths, and a buffer for the C function to use.

The C structure

typedef struct querycb {                                                         
      char        Eyecatcher[4];  /* Eye catcher   offset    0    */                   
      uint16_t    Length;         /* Length of the block     4    */                   
      char        Rsvd1[1];       /* Reserved                6    */                   
      uint8_t     Version;        /* Version number          7   */                    
      char        Flags[2];       /* Flags                   8    */                   
      uint16_t    Reserved8;      //   10                                              
      uint32_t    Count;          // number returned  12                                       
      uint32_t    lBuffer;        // length of buffer 16                                      
      uint32_t    Reservedx ;     //              20                                    
      void        *pBuffer;       //              24                                 
    } querycb;

The Python code

# create the variables
eyec = "EYEC".encode("cp500")  # char[4] eye catcher
l = 32                         # uint16_t
res1 = 0                       # char[1] 
version = 1                    # uint8_t -same as a char
flags = 0                      # char[2]
res2 = 0                       # uint16_t
count = 0                      # uint32_t  
lBuffer = 4000                 # uint32_t 
res3 = 0                       # uint32_t 
# pBuffer                      # void *  
# allocate a buffer for the C program to use and put some data
# into it
pBuffer = ctypes.create_string_buffer(b'abcdefg',size=lBuffer)
# cast the pBuffer so it is a void * 
pB =  ctypes.cast(pBuffer, ctypes.c_void_p)
# use the struct.pack function.  See @4shbbhhiiiP below
# @4 is 4 bytes, the eye catcher
# h half word
# bb two char fields res1, and version
# hh two half word s flags and res2
# iii three integer fields.  count lBuffer and res3
# P void * pointer 
# Note pB is a ctype, we need the value of it, so pB.value
p = pack("@4shbbhhiiiP", eyec,l,res1,version,flags,
         res2,count,lBuffer,res3,pB.value)

#create first parm
p1 = ctypes.c_int(3)  # pass in the integer 3 as an example
# create second parm
p2 = ctypes.cast(p, ctypes.c_void_p)

# invoke the function 

retcode  = lib.conn(p1,p2)

The C program

int conn(int * p1, char * p2) 
// int conn(int  max,...)
{ 
    typedef struct querycb {                                                         
      char        Eyecatcher[4];  /* Eye catcher             0    */                   
      uint16_t    Length;         /* Length of the block     4    */                   
      char        Rsvd1[1];       /* Reserved                6    */                   
      uint8_t     Version;        /* Version number          7   */                    
      char        Flags[2];       /* Flags                   8    */                   
      uint16_t    Reserved8;      //   10                                              
      uint32_t    Count;  // number returned  12                                       
      uint32_t    lBuffer; // length of buffer 16                                      
      uint32_t    Reservedx ;    //              20                                    
      void        *pBuffer;      //              24                                    
    } querycb;  

    querycb * pcb = (querycb * ) p2;

    printf("P1 %i\n",*p1);
    printHex(stdout,p2,32); 
    printf("Now the structure\n")
    printHex(stdout,pcb -> pBuffer,32); 
    return 0 ;
}

The output

P1 3
00000000 : D8D9D7C2 00200001 00000000 00000000  ..... .......... EYEC............ 
00000010 : 00000FA0 00000000 00000050 0901BCB0  ...........P.... ...........&.... 
Now the structure
00000000 : 61626364 65666700 00000000 00000000  abcdefg......... /............... 
00000010 : 00000000 00000000 00000000 00000000  ................ ................

Where

EYEC is the passed in eye catcher
00000FA0 is the length of 4000
00000050 0901BCB0 is the 64 address of the structure
abcdefg is the data used to initialise the buffer

Observations

It took me a couple of hours to get this to work. I found it hard to get the cast, and the ctype…. functions to work successfully. There may be a better way of coding it, if so please tell me. The code works, which is the objective – but there may be better more correct ways of doing it.

Benefits

By using this technique I was able to move code from my C program to set up the structure needed by the z/OS service into C. My C program was just parse input parameters, set up the linkage for the z/OS service, and invoke the service.

If course I did not have the constants available from the C header file for the service, but that’s a different problem.

C calling an “assembler” function, setting the high order bit on, and passing parameters.

Since days of old when knights were bold, the standard parameter list to call an assembler function was to pass the addresses of the parameters, and set on the top bit of the address for the last address.
This way the called function knows how many parameters have been passed, and you do not need to pass a parameter count.

Setting the high order bit on, for the last parameter

I had to ask for help to remind me how to do it from C, so I could call “Assembler” functions.

You can get C to do this using

#pragma linkage(IRRSPK00 ,OS)

Example

The syntax of the routine from the RACF callable services documentation is

CALL IRRSPK00 (Work_area,
  ALET, SAF_return_code,
  ALET, RACF_return_code,
  ALET, RACF_reason_code,
  ALET, Function_code,
  Option_word,
  Ticket_area,
  Ticket_options,
  Ticket_principal_userid,
  Application_Id
)

Here is part of my C program.

#pragma linkage(IRRSPK00 ,OS)
...
long SAF_RC,RACF_RC,RACF_RS; 
SAF_RC=0 ; 
long ALET = 0; 
// ticket options needs special treatment, see below
int Ticket_options = 1; 
int * pTO = & Ticket_options; 

rc=IRRSPK00( 
         &work_area, 
         &ALET , &SAF_RC, 
         &ALET , &RACF_RC, 
         &ALET , &RACF_RS, 
         &ALET ,&Function_code, 
         &Option_word, 
         &ticket, // length followed by area 
         &pTO, 
         &userid, 
         &appl 
         );

If you use #pragma linkage(IRRSPK00 ,OS) it sets on the high order bit. You pass the address of the parameters. I just used &variable, there are other ways.

Passing variables

Most of the parameters are passed by address for example &ALET inserts the address of the variable, conforming to the z/OS standards.

There is a field Ticket_principal_userid which is the name of a 10-byte area that consists of a 2-byte length field followed by the userid id for whom a PassTicket operation is to be performed followed by an 8-byte PassTicket field.

I defined a structures for each variable like

struct {
  short length;
  char value[8];
} ticket;

In the program I used &ticket.

Ticket option

The documentation says

Ticket_options: The name of a fullword containing the address of a binary bit string that identifies the ticket-specific processing to be performed.

It took me a while to understand what this meant. I had to use

int Ticket_options = 1; 
int * pTO = & Ticket_options;

and use it

int Ticket_options = 1; 
int * pTO = & Ticket_options; 
...
 &ticket, // length followed by area 
 &pTO,

Whoops R_GenSec (IRRSGS00 or IRRSGS64): Generic security API interface

I had great problems getting this to work. The documentation said

The address double words from 31 bit callers should have the first word filled with
zeros and the second word filled with the 31 bit address. Sub-parameter addresses will be in the format of the AMODE of the caller.

I do not know what this means. When I coded it as expected I got

CEE3250C The system or user abend S0E0 R=00000029

Which means invalid ALET supplied.

I converted the program to 64 bit and it still failed!