ClusterJ is part of the MySQL Cluster Connector for Java which is currently in beta as part of MySQL Cluster 7.1. It is designed to provide a high performance method for Java applications to store and access data in a MySQL Cluster database. It is also designed to be easy for Java developers to use and is “in the style of” Hibernate/Java Data Objects (JDO) and JPA. It uses the Domain Object Model DataMapper pattern:
- Data is represented as domain objects
- Domain objects are separate from business logic
- Domain objects are mapped to database tables
The purpose of ClusterJ is to provide a mapping from the table-oriented view of the data stored in MySQL Cluster to the Java objects used by the application. This is achieved by annotating interfaces representing the Java objects; where each persistent interface is mapped to a table and each property in that interface to a column. By default, the table name will match the interface name and the column names match the property names but this can be overridden using annotations.
If the table does not already exist (for example, this is a brand new application with new data) then the table must be created manually – unlike OpenJPA, ClusterJ will not create the table automatically.
Figure 2 shows an example of an interface that has been created in order to represent the data held in the ‘employee’ table.
ClusterJ uses the following concepts:
-
SessionFactory: There is one instance per MySQL Cluster instance for each Java Virtual Machine (JVM). The SessionFactory object is used by the application to get hold of sessions. The configuration details for the ClusterJ instance are defined in the Configuration properties which is an artifact associated with the SessionFactory.
- Session: There is one instance per user (per Cluster, per JVM) and represents a Cluster connection
- Domain Object: Objects representing the data from a table. The domain objects (and their relationships to the Cluster tables) are defined by annotated interfaces (as shown in the right-hand side of Figure 2.
- Transaction: There is one transaction per session at any point in time. By default, each operation (query, insert, update, or delete) is run under a new transaction. . The Transaction interface allows developers to aggregate multiple operations into a single, atomic unit of work.
ClusterJ will be suitable for many Java developers but it has some restrictions which may make OpenJPA with the ClusterJPA plug-in more appropriate. These ClusterJ restrictions are:
- Persistent Interfaces rather than persistent classes. The developer provides the signatures for the getter/setter methods rather than the properties and no extra methods can be added.
- No Relationships between properties or between objects can be defined in the domain objects. Properties are primitive types.
- No Multi-table inheritance; there is a single table per persistent interface
- No joins in queries (all data being queried must be in the same table/interface)
- No Table creation – user needs to create tables and indexes
- No Lazy Loading – entire record is loaded at one time, including large object (LOBs).
Tutorial
This tutorial uses MySQL Cluster 7.1.2a on Fedora 12. If using earlier or more recent versions of MySQL Cluster then you may need to change the class-paths as explained in http://dev.mysql.com/doc/ndbapi/en/mccj-using-clusterj.html
It is necessary to have MySQL Cluster up and running. For simplicity all of the nodes (processes) making up the Cluster will be run on the same physical host, along with the application.
These are the MySQL Cluster configuration files being used :
config.ini:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
|
<span style="color: #3366ff;">[ndbd default]noofreplicas=2 datadir=/home/billy/mysql/my_cluster/data [ndbd] hostname=localhost id=3 [ndbd] hostname=localhost id=4 [ndb_mgmd] id = 1 hostname=localhost datadir=/home/billy/mysql/my_cluster/data [mysqld] hostname=localhost id=101 [api] hostname=localhost</span> |
my.cnf:
1
2
3
4
|
<span style="color: #3366ff;">[mysqld] ndbcluster datadir=/home/billy/mysql/my_cluster/data basedir=/usr/local/mysql</span> |
This tutorial focuses on ClusterJ rather than on running MySQL Cluster; if you are new to MySQL Cluster then refer to running a simple Cluster before trying this tutorial.
ClusterJ needs to be told how to connect to our MySQL Cluster database; including the connect string (the address/port for the management node), the database to use, the user to login as and attributes for the connection such as the timeout values. If these parameters aren’t defined then ClusterJ will fail with run-time exceptions. This information represents the “configuration properties” shown in Figure 3. These parameters can be hard coded in the application code but it is more maintainable to create a clusterj.properties file that will be imported by the application. This file should be stored in the same directory as your application source code.
clusterj.properties:
1
2
3
4
5
6
7
8
|
<span style="color: #3366ff;">com.mysql.clusterj.connectstring=localhost:1186 com.mysql.clusterj.database=clusterdb com.mysql.clusterj.connect.retries=4 com.mysql.clusterj.connect.delay=5 com.mysql.clusterj.connect.verbose=1 com.mysql.clusterj.connect.timeout.before=30 com.mysql.clusterj.connect.timeout.after=20 com.mysql.clusterj.max.transactions=1024</span> |
As ClusterJ will not create tables automatically, the next step is to create ‘clusterdb’ database (referred to in clusterj.properties) and the ‘employee’ table:
1
2
3
4
5
6
7
8
9
10
11
12
13
|
<span style="color: #800000;">[billy@ws1 ~]$ mysql -u root -h 127.0.0.1 -P 3306 -u root mysql> create database clusterdb;use clusterdb; mysql> CREATE TABLE employee ( -> id INT NOT NULL PRIMARY KEY, -> first VARCHAR(64) DEFAULT NULL, -> last VARCHAR(64) DEFAULT NULL, -> municipality VARCHAR(64) DEFAULT NULL, -> started VARCHAR(64) DEFAULT NULL, -> ended VARCHAR(64) DEFAULT NULL, -> department INT NOT NULL DEFAULT 1, -> UNIQUE KEY idx_u_hash (first,last) USING HASH, -> KEY idx_municipality (municipality) -> ) ENGINE=NDBCLUSTER;</span> |
The next step is to create the annotated interface:
Employee.java:
1
2
3
4
|
<span style="color: #3366ff;">import com.mysql.clusterj.annotation.Column; import com.mysql.clusterj.annotation.Index; import com.mysql.clusterj.annotation.PersistenceCapable; import com.mysql.clusterj.annotation.PrimaryKey;</span> |
1
2
3
|
<span style="color: #3366ff;">@PersistenceCapable(table="employee") @Index(name="idx_uhash") public interface Employee {</span> |
1
2
3
|
<span style="color: #3366ff;">@PrimaryKey int getId(); void setId(int id);</span> |
1
2
3
4
5
|
<span style="color: #3366ff;">String getFirst(); void setFirst(String first); String getLast(); void setLast(String last);</span> |
1
2
3
4
|
<span style="color: #3366ff;">@Column(name="municipality") @Index(name="idx_municipality") String getCity(); void setCity(String city);</span> |
1
2
|
<span style="color: #3366ff;">String getStarted(); void setStarted(String date);</span> |
1
2
|
<span style="color: #3366ff;">String getEnded(); void setEnded(String date);</span> |
1
2
3
|
<span style="color: #3366ff;">Integer getDepartment(); void setDepartment(Integer department); }</span> |
The name of the table is specified in the annotation @PersistenceCapable(table=”employee”) and then each column from the employee table has an associated getter and setter method defined in the interface. By default, the property name in the interface is the same as the column name in the table – the column name has been overridden for the City property by explicitly including the @Column(name=”municipality”) annotation just before the associated getter method. The @PrimaryKey annotation is used to identify the property whose associated column is the Primary Key in the table. ClusterJ is made aware of the existence of indexes in the database using the @Index annotation.
The next step is to write the application code which we step through here block by block; the first of which simply contains the import statements and then loads the contents of the clusterj.properties defined above:
Main.java (part 1):
1
2
3
4
5
6
|
<span style="color: #3366ff;">import com.mysql.clusterj.ClusterJHelper; import com.mysql.clusterj.SessionFactory; import com.mysql.clusterj.Session; import com.mysql.clusterj.Query; import com.mysql.clusterj.query.QueryBuilder; import com.mysql.clusterj.query.QueryDomainType;</span> |
1
2
3
4
|
<span style="color: #3366ff;">import java.io.File; import java.io.InputStream; import java.io.FileInputStream; import java.io.*;</span> |
1
2
|
<span style="color: #3366ff;">import java.util.Properties; import java.util.List;</span> |
1 |
<span style="color: #3366ff;">public class Main {</span> |
1 |
<span style="color: #3366ff;">public static void main (String[] args) throws java.io.FileNotFoundException,java.io.IOException {</span> |
1 |
<span style="color: #3366ff;">// Load the properties from the clusterj.properties file</span> |
1
2
3
4
|
<span style="color: #3366ff;">File propsFile = new File("clusterj.properties"); InputStream inStream = new FileInputStream(propsFile); Properties props = new Properties(); props.load(inStream);</span> |
1
2
3
|
<span style="color: #3366ff;">//Used later to get userinput BufferedReader br = new BufferedReader(new InputStreamReader(System.in));</span> |
The next step is to get a handle for a SessionFactory from the ClusterJHelper class and then use that factory to create a session (based on the properties imported from clusterj.properties file.
Main.java (part 2):
1
2
3
|
<span style="color: #3366ff;">// Create a session (connection to the database) SessionFactory factory = ClusterJHelper.getSessionFactory(props); Session session = factory.getSession();</span> |
Now that we have a session, it is possible to instantiate new Employee objects and then persist them to the database. Where there are no transaction begin() or commit() statements, each operation involving the database is treated as a separate transaction.
Main.java (part 3):
1
2
3
4
5
6
7
|
<span style="color: #3366ff;">// Create and initialise an Employee Employee newEmployee = session.newInstance(Employee.class); newEmployee.setId(988); newEmployee.setFirst("John"); newEmployee.setLast("Jones"); newEmployee.setStarted("1 February 2009"); newEmployee.setDepartment(666);</span> |
1
2
|
<span style="color: #3366ff;">// Write the Employee to the database session.persist(newEmployee);</span> |
At this point, a row will have been added to the ‘employee’ table. To verify this, a new Employee object is created and used to read the data back from the ‘employee’ table using the primary key (Id) value of 998:
Main.java (part 4):
1
2
|
<span style="color: #3366ff;">// Fetch the Employee from the database Employee theEmployee = session.find(Employee.class, 988);</span> |
1
2
3
4
5
6
7
8
9
10
|
<span style="color: #3366ff;">if (theEmployee == null) {System.out.println("Could not find employee");} else {System.out.println ("ID: " + theEmployee.getId() + "; Name: " + theEmployee.getFirst() + " " + theEmployee.getLast()); System.out.println ("Location: " + theEmployee.getCity()); System.out.println ("Department: " + theEmployee.getDepartment()); System.out.println ("Started: " + theEmployee.getStarted()); System.out.println ("Left: " + theEmployee.getEnded()); }</span> |
This is the output seen at this point:
1
2
3
4
5
6
|
<span style="color: #800000;">ID: 988; Name: John Jones Location: null Department: 666 Started: 1 February 2009 Left: null Check the database before I change the Employee - hit return when you are done</span> |
The next step is to modify this data but it does not write it back to the database yet:
Main.java (part 5):
1
2
3
|
<span style="color: #3366ff;">// Make some changes to the Employee & write back to the database theEmployee.setDepartment(777); theEmployee.setCity("London");</span> |
1
2
3
|
<span style="color: #3366ff;">System.out.println("Check the database before I change the Employee - hit return when you are done"); String ignore = br.readLine();</span> |
The application will pause at this point and give you chance to check the database to confirm that the original data has been added as a new row but the changes have not been written back yet:
1
2
3
4
5
6
|
<span style="color: #800000;">mysql> select * from clusterdb.employee; +-----+-------+-------+--------------+-----------------+-------+------------+ | id | first | last | municipality | started | ended | department | +-----+-------+-------+--------------+-----------------+-------+------------+ | 988 | John | Jones | NULL | 1 February 2009 | NULL | 666 | +-----+-------+-------+--------------+-----------------+-------+------------+</span> |
After hitting return, the application will continue and write the changes to the table, using an automatic transaction to perform the update.
Main.java (part 6):
1 |
<span style="color: #3366ff;">session.updatePersistent(theEmployee);</span> |
1
2
3
|
<span style="color: #3366ff;">System.out.println("Check the change in the table before I bulk add Employees - hit return when you are done"); ignore = br.readLine();</span> |
The application will again pause so that we can now check that the change has been written back (persisted) to the database:
1
2
3
4
5
6
|
<span style="color: #800000;">mysql> select * from clusterdb.employee; +-----+-------+-------+--------------+-----------------+-------+------------+ | id | first | last | municipality | started | ended | department | +-----+-------+-------+--------------+-----------------+-------+------------+ | 988 | John | Jones | London | 1 February 2009 | NULL | 777 | +-----+-------+-------+--------------+-----------------+-------+------------+</span> |
The application then goes onto create and persist 100 new employees. To improve performance, a single transaction is used to that all of the changes can be written to the database at once when the commit() statement is run:
Main.java (part 7):
1
2
3
|
<span style="color: #3366ff;">// Add 100 new Employees - all as part of a single transaction newEmployee.setFirst("Billy"); newEmployee.setStarted("28 February 2009");</span> |
1 |
<span style="color: #3366ff;">session.currentTransaction().begin();</span> |
1
2
3
4
5
6
|
<span style="color: #3366ff;">for (int i=700;i<800;i++) { newEmployee.setLast("No-Mates"+i); newEmployee.setId(i+1000); newEmployee.setDepartment(i); session.persist(newEmployee); }</span> |
1 |
<span style="color: #3366ff;">session.currentTransaction().commit();</span> |
The 100 new employees will now have been persisted to the database. The next step is to create and execute a query that will search the database for all employees in department 777 by using a QueryBuilder and using that to build a QueryDomain that compares the ‘department’ column with a parameter. After creating the, the department parameter is set to 777 (the query could subsequently be reused with different department numbers). The application then runs the query and iterates through and displays each of employees in the result set:
Main.java (part 8):
1
2
3
4
5
6
7
8
|
<span style="color: #3366ff;">// Retrieve the set all of Employees in department 777 QueryBuilder builder = session.getQueryBuilder(); QueryDomainType<Employee> domain = builder.createQueryDefinition(Employee.class); domain.where(domain.get("department").equal(domain.param( "department"))); Query<Employee> query = session.createQuery(domain); query.setParameter("department",777);</span> |
1
2
3
4
5
6
7
8
9
|
<span style="color: #3366ff;">List<Employee> results = query.getResultList(); for (Employee deptEmployee: results) { System.out.println ("ID: " + deptEmployee.getId() + "; Name: " + deptEmployee.getFirst() + " " + deptEmployee.getLast()); System.out.println ("Location: " + deptEmployee.getCity()); System.out.println ("Department: " + deptEmployee.getDepartment()); System.out.println ("Started: " + deptEmployee.getStarted()); System.out.println ("Left: " + deptEmployee.getEnded()); }</span> |
1
2
3
|
<span style="color: #3366ff;">System.out.println("Last chance to check database before emptying table - hit return when you are done"); ignore = br.readLine();</span> |
At this point, the application will display the following and prompt the user to allow it to continue:
1
2
3
4
5
6
7
8
9
10
|
<span style="color: #800000;">ID: 988; Name: John Jones Location: London Department: 777 Started: 1 February 2009 Left: null ID: 1777; Name: Billy No-Mates777 Location: null Department: 777 Started: 28 February 2009 Left: null</span> |
We can compare that output with an SQL query performed on the database:
1
2
3
4
5
6
7
|
<span style="color: #800000;">mysql> select * from employee where department=777; +------+-------+-------------+--------------+------------------+-------+------------+ | id | first | last | municipality | started | ended | department | +------+-------+-------------+--------------+------------------+-------+------------+ | 988 | John | Jones | London | 1 February 2009 | NULL | 777 | | 1777 | Billy | No-Mates777 | NULL | 28 February 2009 | NULL | 777 | +------+-------+-------------+--------------+------------------+-------+------------+</span> |
Finally, after pressing return again, the application will remove all employees:
Main.java (part 9):
1
2
3
|
<span style="color: #3366ff;">session.deletePersistentAll(Employee.class); } }</span> |
As a final check, an SQL query confirms that all of the rows have been deleted from the ‘employee’ table.
1
2
|
<span style="color: #800000;">mysql> select * from employee; Empty set (0.00 sec)</span> |
Compiling and running the ClusterJ tutorial code
1 |
<span style="color: #800000;">javac -classpath /usr/local/mysql/share/mysql/java/clusterj-api.jar:. Main.java Employee.java</span> |
1
2
|
<span style="color: #800000;">java -classpath /usr/local/mysql/share/mysql/java/clusterj.jar:. -Djava.library.path=/usr/local/mysql/lib Main </span><span style="color: #800000;"> </span> |
Download the source code for this tutorial from here (together with the code for the up-coming ClusterJPA tutorial).