Stackoverflow: What throughput can a standalone Java program achieve?

There was a question on the MQ section on StackOverflow

I have a standalone multi threaded java application which listen messages from IBM MQ.
Current system take around 500ms for processing of 1 message after it read from queue and till it commit.
I want to know how many messages I can consume

Concurrently:

Max number of messages can be processed? or throttle limit

A good meaty performance question I thought. Let me break this into pieces.

Current system take around 500ms for processing of 1 message after it read from queue and till it commit.

Processing one messages and commit should take about 10 milliseconds or less( say 30 ms for a two phase commit). There is clearly something else going on. Fix this first.

A long database call. This could be due to database locking, or a badly designed statement, for example a query which needs to access thousands or millions of rows.
A request to a server far far away
A file system with the speed of writing an illuminated letter to parchment

How many messages I can consume: Concurrently:

Take the worst case of using persistent messages, which require log IO during commit.

For one thread, processing multiple messages before doing a commit means the thread can do more work. Consider a get taking 1 millisecond, and a commit taking 10 ms. This is one message processed every 11 ms. If you did 50 gets – taking 50 ms and a commit taking 10 ms, this is 50 messages in 50 + 10 ms which equates to one message every 1.2 milliseconds almost 10 times faster. This is how channels can send messages efficiently. There is a “sweet spot” of messages per commit to give you maximum data processed per second. This depends on the message size, logging rates and other factors. For a 100MB message it is one message per commit. For 10KB messages, this may be 1000 messages per commit.

This may be selfish

This is clearly a great improvement, but possibly selfish. If the application logic is a get followed by a database insert, followed by a commit, then doing 50 gets, 50 inserts and a commit, will work much faster. The down side is that the database requests will keep locks until the commit. These locks may prevent other applications from accessing data, either the recently inserted records, page locks, or index locks. So overall MQ throughput goes up – but the business transaction suffers. You need to understand the database and find the optimum number of requests per commit for your business transaction.

How long before the data is visible?

Rather than have one thread process 1000 messages per commit (taking 1010 ms) you may want to have multiple threads processing 10 messages per commit – taking 20 ms. This means that the data in the database (or replies etc) are visible earlier. This may be important to your business transaction if you have to worry about response time.

Parallel threads

Using more threads should improve throughput, unless this is delayed by external factors – such as database locks.
One customer found one thread was optimum because there was no database delays.

How many messages I can consume: Max number of messages can be processed? or throttle limit

There are papers written on this but here is a one minute overview

As fast as the queue manager can process data

The rate at which MQ can write its logs
Keep queue data in memory – ( buffer pools on z/OS, queue buffer on midrange), so few messages on the queue.

Threads

Having parallel threads gives you better throughput than one thread. You get overlapped writing to the log, the units of work are shorter in duration, you can get parallel IO.
You may be limited by the network. Having multiple threads from an application means the network can be better utilized. One thread can be receiving data down the wire, while another thread is waiting in commit.
You may be limited by where your programs run – eg short of CPU, or slow IO (for your System.out.println statements)

Application design

You may get delays due to serialization if all thread are using the same queue.
Remove the debug printf or System.out.println statements.
Using a queue per business application is better than all applications sharing the same queue
Using one reply to queue per web server may be better than a shared reply to queue – especially if you use Apache Camel.
Use get first if possible. Avoid scans of the queues.

The short answer….

You should be able to get thousands of 1KB messages a second through your Java application when using multiple threads.

What’s the difference between an MQ Message and a JMS Message

I had problems using the MQI Interface to create a message for a JMS program to receive.

To see what was in the JMS message, I used a Java program using JMS to write a message, and used my trusty C program to display it.

I could see that there were message properties in the message

Property 0 name <mcd.Msd> value <jms_text>
Property 1 name <jms.Dst> value <queue:///JMSQ1>
Property 2 name <jms.Rto> value <queue:///JMSQ2>
Property 3 name <jms.Tms> value <1571902099742>
Property 4 name <jms.Dlv> value <2>

These are described here.

The mcd.Msd value is one of jms_none, jms_text, jms_bytes, jms_map, jms_stream, jms_object. This depends on whether you use Message message, BytesMessage message etc to define your message type. The jms program receiving the message may be expecting a particular type

The jms.Rto comes from the message.setJMSReplyTo(…). It was set in the MQMD.ReplyToQ as well as the message property.

It took me some time to find how to specify value such as for deliveryMode. I found it here. For example message.setDeliveryMode(DeliveryMode.NON_PERSISTENT). (This comes from javax.jms.DeliveryMode.NON_PERSISTENT,not a com.ibm…. file).

I converted my simple program from JMQI to JMS, in a couple of hours, and was surprised to find it used fewer lines of code than using the JMQI. Of course I may find I omitted some work, such as error handling, but it seems to be working OK.

Magic methods to decode Java MQ constants to strings.

I had been struggling with MQ and java, and decoding what the return codes numbers were, and found some well gem methods here.

String reasonCode = MQConstants.lookup(2035, “MQRC_.*”); gave MQRC_NOT_AUTHORIZED

and

String decode = MQConstants.decodeOptions(gmo.options,”MQGMO_.*”); gave me

MQGMO_WAIT | MQGMO_SYNCPOINT_IF_PERSISTENT | MQGMO_FAIL_IF_QUIESCING

I wish I had these a couple of years ago – it would have saved me a lot of time!

The methods are

`static java.lang.String`	`decodeOptions(int optionsP, java.lang.String optionPattern)` This helper method takes an integer representing a set of IBM MQ options for an MQI structure, and converts them into a string displaying the constants that the options represent.
`static int`	`getIntValue(java.lang.String name)` Returns the value of the named MQSeries constant as an int.
`static java.lang.Object`	`getValue(java.lang.String name)` Returns the value of the named MQSeries constant.
`static java.lang.String`	`lookup(int value, java.lang.String filter)` Returns the MQSeries constant name or names for the supplied int value.
`static java.lang.String`	`lookup(java.lang.Object value, java.lang.String filter)` Returns the MQSeries constant name or names for the supplied value of type `Integer`, `String`, `byte[]`, or `char[]`.
`static java.lang.String`	`lookupCompCode(int reason)` Convenience method for finding the constant name for a completion code.
`static java.lang.String`	`lookupReasonCode(int reason)` Convenience method for finding the constant name for a reason code.
`static void`	`main(java.lang.String[] args)`

MQRC_DATA_LENGTH_ERROR with client

We had an application working on one system, and we moved it to another system, and we got MQ RC 2010 data length error. It turns out that the

SYSTEM.DEF.SVRCONN had MAXMSGL of 1 – so the maximum message sized allowed on this channel was 1 bytes.

You can specify the maximum msg length on the client for example the MQCD or client table – but I think the negotiation is the lower of the values at each end.

Setting the value to one on the z/OS end was part of stopping people using the default channel definitons.

Any port in a storm? No.

Ive just spent a day resolving a problem with specifying a port value trying to connect to MQ.

I had

public long port = 1414;
String channel = “MYCHANNEL”;
String hostname = “127.0.0.1”;
Hashtable<String, Object> h = new Hashtable<String, Object>();
h.put(MQConstants.PORT_PROPERTY, dd.port);h.put(MQConstants.CHANNEL_PROPERTY, channel);
h.put(MQConstants.TRANSPORT_PROPERTY, MQConstants.TRANSPORT_MQSERIES_CLIENT);
h.put(MQConstants.HOST_NAME_PROPERTY, hostname);
queueManager = new MQQueueManager(“QMA”,h);

(did you spot the problem?)

This failed with

MQConnection to QMA com.ibm.mq.MQException: MQJE001: Completion Code ‘2’, Reason ‘2538’.
Caused by: com.ibm.mq.jmqi.JmqiException: CC=2;RC=2538;AMQ9204: Connection to host ‘127.0.0.1(0)’ rejected.

This is saying it tried to connect with port 0!

I tried

String port = “1414”;, that failed the same way.

If I used

MQEnvironment.port=”1414″; it worked.

This was tough to resolve, as there is no documentation to help me.

Someone suggested public int port = 1414; and it worked! What a way to spend a nice autumn day.

Whoops -deploying MDB in weblogic

I was quite happily using my MDB in webLogic, but when I changed its configuration, it did not pick up the new changes. It took a day to find out why, and I have learned much more about deploying MDBS.

My connection factory was using SYSTEM.DEF.SVRCONN, I changed it to use a different client channel. I stopped SYSTEM.DEF.SVRCONN, ( so I could check the change had worked), and restarted the webLogic instance. I was surprised when my MDB failed to start, because the channel was stopped. The MDB was trying to use that channel. It took a lot of head scratching to get it to work as I expected.

I had messages like <BEA-015073> Message-Driven Bean … is configured with unknown activation-config-property name failIfQuiesce. This message is wrong, failIfQuiesce is supported by the IBM Resource adapter.
I had the same message with activation-config-property name cfLookup. This was my problem. I should have specified connectionFactoryLookup.
If you have <activation-config-property-name>connectionFactoryLookup… (specified in the ejb-jar) any other parameters you specify in the ejb-jar.xml file are ignored.
If you do not specify a connectionFactoryLookup, nor properties in the ejb-jar.xml file, defaults are provided, see Configuring the resource adapter for inbound communication. In my case I had not specified activation-config-property-name channel, and this defaulted to SYSTEM.DEF.SVRCONN, which is why it continued to use that channel.
It worth putting <activation-config-property-name>applicationName … in your definitions so you can see what you are using.
- dis qstatus(JMSQ3) type(handle) gave me APPLTAG(CF3Name) so I can tell which definitions are being used.
- If you get APPLTAG(weblogic.Server) then you are taking the defaults.
The Oracle documentation says the precedence order is as below. I do not think this is 100% accurate. (I could not specify some of the parameters on the weblogic-ejb-jar.xml file). I didnt try the java program.
1. properties set in the weblogic-ejb-jar.xml deployment descriptor
2. activation-config-property properties set in the ejb-jar.xml deployment descriptor
3. activationConfigProperty annotation properties in the java program.

What do I need to specify?

As a minimum you need to use connectionFactoryLookup or specify

applicationName – so you can identify which definitions are being used
channel – which channel to use
failIfQuiesce
hostName
port

The ejb-jar.xml file is in the META-INF directory. Change the ejb-jar.xml or weblogic-ejb-jar.xml file. IUpdate the jar file using a command like jar -uvf MDB4.jar META-INF/ejb-jar.xml, and redeploy it.

How do I get a client to disconnect?

I had a question from a customer who asked how they can reduce the number of client connections in use. They had tried setting a disconnect interval (DISCINT) on the channel, but the connections were like weeds – you kill them off, and they grow back again.

DISCINT is “the length of time after which a channel closes down, if no message arrives during that period”. This sounds perfect for most people. The application is in an MQGET, and if no messages arrive, the channel can be disconnected, and the application gets connection broken. The application can then decide to disconnect or reconnect.
If the application is not in an MQGET, then it will get notified of the broken connection next time it tries to use MQ.

Independent applications

Many applications are well written in that when they get Connection Broken, they just reconnect again, and so the DISCINT has no effect on reducing the number of connections. This may be good for availability but not for resource usage. It may be good to have 1000 application instances running the day, but perhaps not overnight when there is no work to do. Ive seen instances where the applications do an MQGET every minute, and with 1000 instances this can use a lot of CPU and doing no useful work. In this case you want unused application instances to stop, and be restarted when needed.

You cannot use triggering with client connections (unless you have a very smart trigger monitor to produce an event which says start a client program over there).

Use automation periodically check the queue depth, and number of input handles. If there is a high queue depth, or a low number of handles(eg 2) then start more application instances, across your back-end servers. Your applications can then disconnect if they have not received a message within say 10 minutes. This should keep the right number of application instances active.

An administrator should be able to get this automation set up, but getting the application to connect could be a challenge, as this requires the application developer to change the code!

Running under a web server

If your applications are running under a web server you may have mis-configured connection pools. You can specify the initial size of the pool, and this many connections are made. As more connections are needed, then more can be added to the pool until the pool maximum is reached. You should specify a time out value, so periodically the pool gets cleaned up, and unused connections are removed, until the pool is back to the initial size. You should review the initial size of the pools ( is it too large), and the value of the time out value.

This should just be an administrative change.

Good luck, you may be successful in reducing the number of client connections, but do not set your hopes too high.

WebLogic message does not have authorization to view the logs

I was using JMX to display the connectionPool statistics from webLogic, and kept getting messages

I solved this by using the webLogic console

Click “Security Realms” in the Domain Stucture box on the left hand side of the home page
Click on the name of your realm (myrealm)
Click on “Roles and Policies”
Click on “Realm Policies”
Expand Domain and select “View Policy Conditions” for “View Log”
Click on Add Condition
Select the role
Click on Finish
Click on Save

The change is available immediately on “Save”.

Did the JMS architects get it wrong? Possibly!

I was looking into a performance problem where a web server was doing 1 million MQCONNects day! During my investigations I found that the original designers did not design it properly because of the performance overhead, and so the JMS architects had to fix it up by creating connection factories.

Below, I cover

the performance problem
writing my own connection factory
the problems of writing my own connection factory
it might just be easier to use a connection factory provided by your web server or JMS provider.

What is an EJB and an MDB?

In a web server you can have Enterprise Java Beans (EJBs) which are a package of java applications doing enterprise type work ( get a message, update a database, send a reply) conforming to an specific Programming Interface.

A Message Driven Bean is an EJB which responds to messages. A message listener applications gets messages from a queue – and passes the message to the MDB. In non EJB terms this is just like a trigger monitor, starting a transaction and passing the data.

When you create the MDB you have a configuration file which allows a message listener to be created, and the parameters to use to connect to MQ, and how many threads etc can be running concurrently.

The MDB, to process the message has a basic structure of

method onMessage – this is given the message
ejbCreate – when the EJB is created, you can do initialisation here
ejbRemove – when the EJB is about to be deleted, you can do clean up here

and looks like

public class MDB
implements MessageListener, MessageDrivenBean
{
  private static final long serialVersionUID = -8070254332864574796L;
  public void onMessage(Message message)
  {
    Connection connection = null;
    Session session = null;
    MessageProducer producer = null;

    try
    {
      InitialContext ctx = new InitialContext();
      ConnectionFactory cf = (ConnectionFactory)ctx.lookup("CF2");
      Destination dest = message.getJMSReplyTo();
     
      connection = cf.createConnection();
      session = connection.createSession(false, 1);
      producer = session.createProducer(dest);
      
      TextMessage response = session.createTextMessage("test response message from the WMQ_IVT_MDB");
      response.setJMSCorrelationID(message.getJMSMessageID());
      producer.send(response);
      return;
    }
    catch (Exception je)
    {
      System.out.println("Something went wrong." + je);
    }
  }

  public void ejbCreate() { }
  public void ejbRemove() { }
 }

Within the onMessage method is logic connection = cf.createConnection();
Under the covers this createConnection() does an MQCONN. With 1 million messages a day – this is 1 million MQCONNs!
There was also an MQOPEN, an MQCLOSE, and an MQDISC before the onMessage() method returned. This results in a huge performance overhead.

Attempts to fix the performance

People quickly found that this model was not efficient, and they came up with ways to improve the performance.

The idea of a connection pool was developed. Instead of doing an the createConnection() doing an MQCONN etc. this code was changed, to not do the MQDISC, but to keep the Queue Manager handle. When the next MQCONN occurred, if there was one of these spare Queue Manager handles it would use it.

These connection pools are sophisticated. For example they can free up connections if they had not been used for a time period, so reducing the overall number of Queue Manager connections in use, they can slowly increase the number of connections in the pool so there is not a sudden spike in requests.

I found it difficult to set up a connection pooling in Oracle’s webLogic webserver, so I thought I would write my own. This was pretty easy, but then I discovered some complications.

Writing my own “connection pooling”.

It was clear to me that having connection = cf.createConnection(); from the ejbCreate() method rather than in the onMessage() method would be much more efficient.

Move variables from being method variables to instance variables so they persisted across onMessage() calls.
Create a connect() method which actually does the createConnection(). This can be called in the ejbCreate setup routine, and from the onMessage() if the connection variable is null.
Have code in the ejbRemove to the connection, session etc.

public class IVTMDB
implements MessageListener, MessageDrivenBean
{
  private static final long serialVersionUID = -337338331639L;
  // create long lasting instance variables 
  Connection connection = null;
  Session session = null;
  MessageProducer producer = null;
  InitialContext ctx = null;
  ConnectionFactory cf = null;
  // new method to do the connect
  public void connect(){
  try {
     ctx = new InitialContext();
     cf = (ConnectionFactory)ctx.lookup("CF2");
     connection = cf.createConnection();
     session = connection.createSession(false, 1); 
    }
    catch (Exception je)
    {...
    } 
 }

 public void ejbCreate() {
   connect();
 }
...
}

This worked well. The number of MQCONN dropped to about 10 per day for many puts.

The problem looked solved, until I tried to shut down the queue manager.

If no work was being processed, then the MDBs were not being called, and so there were no MQ requests being made, and so the connections stayed active, and the queue manager did not shut down.

To solve this you can set cf.setExceptionListener(..) which will get notified if there are problems with the connection, and so you can close the connection etc.
You need to consider what to do if the connection cannot be made; do you wait for a short time period, or do you return an error.

I found that this “simple” way of avoiding all of the MQCONNects was quickly getting much more complex. It was going to be much easier in the long term to get the provided connection factories working. This was another challenge, taking about a week. See the next few blog entries on how I did this for webLogic.

MDBs activation specs and @things in the java program

While struggling with getting MDBs working, and looking at examples, I saw examples where they defined JMS resources within the java program using @…. statements, and could not see how they worked. These are called annotations. The documentation on the web assumes you know about annotations when explaining annotations! They, in fact, are pretty simple, let me explain.

Annotations start with an @ character, and the information can be stored within the .class file as meta-data. Programs can extract and use this meta data.

You can have java code like
@Resource(lookup="java:customerMQ") private javax.jms... myMQ;

A program, for example, your program, an analysis program or a web server, can issue request like

load class information
from the meta data list all fields with @resource defined.
do things with the list

One example would be to specify a JNDI lookup of java:customerMQ and return it into the field myMQ.

Another example from the IBM documentation

@MessageDriven(
  name = "JMSSampleMDB",
  activationConfig = 
  {
    @ActivationConfigProperty(
       propertyName  = "destinationType", 
       propertyValue = "javax.jms.Queue"),
 
    @ActivationConfigProperty(
       propertyName  = "destination", 
	propertyValue = "jndi_INPUT_Q")                         
   }
)

The resource adapter has code which does

load your MDB program
get the MessageDriven stuff.
- within this, locate the activationConfig records
  - within these, locate the ActivationConfigProperty propertyName and propertyValue, and merge the data with the data in the ejb-jar.xml file.

With the definitions in your java program, and the definitions in the MDB configuration you can configure a complete set of options for MDB. I think the definitions in the java program override the MDB configuration.

How do I see what data there is?

You can extract this meta-data using a method like (see here)

public void getAnnotations(Class inclass){
    for(Field field : inclass.getDeclaredFields()){
        Class type = field.getType();
        String name = field.getName();
        field.getDeclaredAnnotations(); //do something with these
    }

Use the javap command to display the data.

To display the annotations you can usethe command, where ….class is the name of your class file.

javap -v .....class

My java program had

import javax.annotation.Resource;
.....
@Resource(lookup = "java:app/jms/myappTopic")
String colin = "ZZZZZ";

The javap command gave

java.lang.String colin;
  descriptor: Ljava/lang/String;
  flags:
  RuntimeVisibleAnnotations:
  0: #14(#15=s#16)
...
#14 = Utf8 Ljavax/annotation/Resource;
#15 = Utf8 lookup
#16 = Utf8 java:app/jms/myappTopic

from which we get

java.lang.String colin ... 
  javax/annotation/Resource (lookup = java:app/jms/myappTopic).

which matches the source code.

Different annotation types are confusing.

As well as providing meta-information on variables and classes, java also uses annotations to modify the java compiler behaviour. For example

By putting @Deprecated infront of a method, the method can be flagged when used, as deprecated, and you should not use it
@SuppressWarnings(“unchecked”) tells the java compiler NOT to produce an error message for the unchecked condition. See here for a list of warning conditions.