Class DefaultTableLoader
- All Implemented Interfaces:
AutoCloseable
- Author:
- Christopher Mindus
Field Summary
Modifier and TypeFieldDescriptionprotected final String[]
The column names.final Pattern
The comma delimiter string pattern "[whitespace],[whitespace]": '\s*,\s*'.protected final Connection
The connection.final Pattern
Matches a date as YYYY-MM-DD or YYYY/MM/DD where YYYY is 4 digits between 1000 and 2999, MM is two digits between 01 and 12, DD is two digits between 01 and 31.final Pattern
Pattern for date/time/timestamp separator characters '-/:.' and whitespace: '[-/:.\s]+'.final int
The default batch count of transactions before a batch-commit is done when inserting multiple rows.final Pattern
The Quoted String string pattern as: 'single \'quoted\' string', "double 'quoted' string", "double \"quoted\" string" The pattern is: '(['"])(?:.*?)(?<!\\)(?>\\\\)*?\1'.protected final PreparedStatement
The prepared statement.protected final ResultSetMetaData
The meta data for the table columns.final Pattern
The "NULL" string pattern with whitespace ignored around it, case insensitive.final Pattern
The Quoted String pattern '"string where ""doubled-double-quotes"" are made double-quotes"': '"((?:""|[^"])*)"'.protected final BufferedReader
The reader.The replacement strings.protected Pattern
The pattern for quoted strings: MUST BE one of:{@link #quotedStringPattern}
or{@link #escapedQuotedStringPattern}
.final Pattern
Matches a valid 24-hour time from 00:00:00-23:59:59 in hh:mm:ss or hh.mm.ss formats: '([01]?\d|2[0-3])[:.]([0-5]\d)[:.]([0-5]\d)', and allows time as e.g.final Pattern
Matches a timestamp as "YYYY-MM-DD hh:mm:ss.fff" or "YYYY/MM/DD hh.mm.ss.fffffffff" where YYYY is 4 digits between 1000 and 2999, MM is two digits between 01 and 12, DD is two digits between 01 and 31.final Pattern
Whitespace pattern.Constructor Summary
ConstructorDescriptionDefaultTableLoader
(InputStream in, Charset charSet, Connection conn, String tableName, boolean clearBefore, String... columns) Creates the table loader.DefaultTableLoader
(Reader reader, Connection conn, String tableName, boolean clearBefore, String... columns) Creates the table loader.Method Summary
Modifier and TypeMethodDescriptionvoid
close()
Closes the input stream or reader and the prepared SQL statement for row insertion: both are attempted to be closed, and if the input stream or reader throws an IOException the SQL statement will be closed.Gets the meta data for the columns of the table.protected void
processLine
(String line) Processes a line of data.void
Processes the reader.void
processReader
(int batchCount) Processes the reader.protected String
scanLineToStatement
(String line, Scanner scanner) Retrieves and sets all the column values in the prepared statement using the scanner.void
setStringPattern
(Pattern stringPattern) The pattern for quoted strings: MUST BE one of:{@link #quotedStringPattern}
or{@link #escapedQuotedStringPattern}
.void
setStringReplacement
(Map<String, String> replace) Adds a potential string replacement, but default it is the two characters "\n
" inside a double-quoted string.
Field Details
nullPattern
The "NULL" string pattern with whitespace ignored around it, case insensitive.commaPattern
The comma delimiter string pattern "[whitespace],[whitespace]": '\s*,\s*'.quotedStringPattern
The Quoted String pattern '"string where ""doubled-double-quotes"" are made double-quotes"': '"((?:""|[^"])*)"'.timePattern
Matches a valid 24-hour time from 00:00:00-23:59:59 in hh:mm:ss or hh.mm.ss formats: '([01]?\d|2[0-3])[:.]([0-5]\d)[:.]([0-5]\d)', and allows time as e.g. 10:12:12, 13.12.59, 13:56:00 or 8.14.00.datePattern
Matches a date as YYYY-MM-DD or YYYY/MM/DD where YYYY is 4 digits between 1000 and 2999, MM is two digits between 01 and 12, DD is two digits between 01 and 31. '[12]\d{3}[-/](0[1-9]|1[0-2])[-/](0[1-9]|[12]\d|3[01])'.timestampPattern
Matches a timestamp as "YYYY-MM-DD hh:mm:ss.fff" or "YYYY/MM/DD hh.mm.ss.fffffffff" where YYYY is 4 digits between 1000 and 2999, MM is two digits between 01 and 12, DD is two digits between 01 and 31. The time part is a valid 24-hour time from 00:00:00-23:59:59 in hh:mm:ss or hh.mm.ss formats with fractional nanoseconds in 'fffffffff' that may be omitted. '[12]\d{3}[-/](0[1-9]|1[0-2])[-/](0[1-9]|[12]\d|3[01])\s+([01]?\d|2[0-3])[:.]([0-5]\d)[:.]([0-5]\d)(\.[0-9]{1,9})?'.escapedQuotedStringPattern
The Quoted String string pattern as:- 'single \'quoted\' string',
- "double 'quoted' string",
- "double \"quoted\" string"
whitespacePattern
Whitespace pattern.dateTimeSeparatorPattern
Pattern for date/time/timestamp separator characters '-/:.' and whitespace: '[-/:.\s]+'.DEFAULT_BATCH_COUNT
public final int DEFAULT_BATCH_COUNTThe default batch count of transactions before a batch-commit is done when inserting multiple rows.- See Also:
reader
The reader.insertStatement
The prepared statement.conn
The connection.metaData
The meta data for the table columns.columns
The column names.replace
The replacement strings.stringPattern
The pattern for quoted strings: MUST BE one of:{@link #quotedStringPattern}
or{@link #escapedQuotedStringPattern}
.
{@link #quotedStringPattern}
.
Constructor Details
DefaultTableLoader
public DefaultTableLoader(InputStream in, Charset charSet, Connection conn, String tableName, boolean clearBefore, String... columns) throws SQLException Creates the table loader.- Parameters:
in
- The input stream.charSet
- The character set to use.conn
- The JDBC connection: the connection is never closed by this class. It is up to the called to close it. A finalcommit()
will be performed however.tableName
- The table name.clearBefore
- Flag to clear the table of all rows prior to inserting the data. Please note that this flag sometime cannot betrue
if there are constraints on e.g. other tables.columns
- The columns.- Throws:
NullPointerException
- If any parameter isnull
.IllegalArgumentException
- If thecolumns
are not specified.SQLException
- For SQL errors.
DefaultTableLoader
public DefaultTableLoader(Reader reader, Connection conn, String tableName, boolean clearBefore, String... columns) throws SQLException Creates the table loader.- Parameters:
reader
- The reader.conn
- The JDBC connection: the connection is never closed by this class. It is up to the called to close it. A finalcommit()
will be performed however.tableName
- The table name.clearBefore
- Flag to clear the table of all rows prior to inserting the data. Please note that this flag sometime cannot betrue
if there are constraints on e.g. other tables.columns
- The field names, leave empty in case the data contains all fields in correct ordering, thus indicies can be used instead of the names.- Throws:
NullPointerException
- If any parameter isnull
.IllegalArgumentException
- If thecolumns
are not specified.SQLException
- For SQL errors.
Method Details
getMetaData
Gets the meta data for the columns of the table.- Returns:
- The meta data of the result set from
SELECT column1, column2, ... FROM tableName
requested before rows are inserted into the table, but after a potential clear of the table.
close
Closes the input stream or reader and the prepared SQL statement for row insertion: both are attempted to be closed, and if the input stream or reader throws an IOException the SQL statement will be closed. If that one throws an SQLException, the potential IOException will be added as a suppressed exception. Otherwise the potential IOException is thrown.- Specified by:
close
in interfaceAutoCloseable
- Throws:
IOException
- For I/O errors.SQLException
- For SQL errors.
setStringReplacement
Adds a potential string replacement, but default it is the two characters "\n
" inside a double-quoted string. Doubled double-quotes inside such a string is replaced with a single double-quote.The default
replace
map would be as:replace=new HashMap<>(2); replace.put("\\n","\r\n"); replace.put("\"\"","\"");
- Parameters:
replace
- The map for String replacement.- Throws:
NullPointerException
- Ifreplace
isnull
.
setStringPattern
The pattern for quoted strings: MUST BE one of:{@link #quotedStringPattern}
or{@link #escapedQuotedStringPattern}
.
{@link #quotedStringPattern}
.- Parameters:
stringPattern
- The new String Pattern to use.- Throws:
IllegalArgumentException
- IfstringPattern
is not one of the values{@link #quotedStringPattern}
or{@link #escapedQuotedStringPattern}
.
processReader
Processes the reader.A subclass can override this method to perform specialized reader handling. The default is to read all lines until
EOF
and call{@link #processLine(line)}
for each line, skipping empty lines. Each line produces a new row in the table, then the prepared insert statement's parameters are cleared (before next row is added).- Throws:
IOException
- For stream reader I/O error.SQLException
- For SQL errors.
processReader
Processes the reader.A subclass can override this method to perform specialized reader handling. The default is to read all lines until
EOF
and call{@link #processLine(line)}
for each line, skipping empty lines. Each line produces a new row in the table, then the prepared insert statement's parameters are cleared (before next row is added).- Throws:
IOException
- For stream reader I/O error.SQLException
- For SQL errors.
processLine
Processes a line of data.A subclass can override this method to perform specialized line parsing to values.
- Parameters:
line
- The line, never empty string.- Throws:
IOException
- For stream reader I/O error.SQLException
- For SQL errors.
scanLineToStatement
Retrieves and sets all the column values in the prepared statement using the scanner.Subclasses can override this method to provide the line scanning to set the columns with its data using the indicies of the columns.
- Parameters:
line
- The line being scanned.scanner
- The scanner.- Returns:
- A warning string that will be logged, null for none.
- Throws:
IOException
- For stream reader I/O error.SQLException
- For SQL errors.