#15687: Improve Export and Import for BLOBs cross databases

Devin_ reported 2019-02-06T17:49:49Z · last modified 2020-04-21T14:51:07Z

Improve Export and Import for BLOBs cross databases

Require Automation Requires Documentation

Dev	askajit
QA	tomconrad

Priority	Low
Complexity	Unknown

Component	Tools - Import Tool
Version	20.6

Request for the Export of BLOB and then the Import of BLOB to be improved for BLOB, it seems to export and change the datatype and therefore Import is inaccurate.

Feedback from customer:

It will succeed. But it will show inaccurate data as when you download in excel, blob columns are downloaded as text(7b3030….). When you export it back it assumes this a text and converts this data to (3634363137343631336……..) in the target column which is blob. The actual data from source to target is lost. I don’t believe downloading blob in excel and importing it back to blob in target is the right way to handle blob columns.

There is no error on either steps(import or export). The issue is export brings blob as strings of hex values in excel and import will put this string into blob) while this happens the content is changed as anything you insert into blob column is changed into hex/binary format. I request you to try exporting and importing the blob column and you will understand the issue. The content from source to target is not the same. The 4 options that you see is not sufficient to handle blob columns. You might need to add additional support to handle such columns

14 attachments

Import BLOB warning datatype.jpg

2019-02-06T18:48:48Z

139 KB
hex.png

2019-03-07T15:26:19Z

62 KB
Import-Blob.patch

2019-12-03T01:50:59Z

2 KB
MS SQL Server blob insert java pgm

2020-01-22T12:53:02Z

3 KB
export-blob.csv

2020-02-04T02:45:13Z

985 KB
export-blob.xlsx

2020-02-04T02:45:13Z

80 KB
error.png

2020-02-06T13:07:07Z

66 KB
InterBase-Blob-Import.patch

2020-02-22T02:06:25Z

3 KB
export-small-blob.csv

2020-02-25T01:20:50Z

32 B
export-small-blob.xlsx

2020-02-25T01:34:25Z

6 KB
MySQLConnectionError.png

2020-03-09T11:27:57Z

95 KB
TeradataConnectionIssue.png

2020-03-11T13:32:11Z

70 KB
VerticaConnectionIssue1.png

2020-03-11T13:32:11Z

70 KB
Teradata Issue.png

2020-03-16T10:56:34Z

139 KB

All Comments (73) Change History

tomconrad 2019-02-06T18:11:45Z

Which databases?

Devin_ 2019-02-06T18:38:03Z

Forgot to add platforms:

Target is MySQL. The source was Oracle in this example.

nhilam 2019-10-08T22:27:49Z · (edited)

I don't think we can support this as requested. Each blob value equates to a separate file. If exporting multiple rows, this equates to generating multiple files. IMO import/export is not the right place to handle blobs.

For this issue, we should probably add an option to exclude blob columns during export, and display a warning (ie "importing into a blob column will not work as intended") when importing into a blob column.

tomconrad 2019-03-07T15:27:50Z

Hi Devin,

Can you try doing the select on the blob column in mysql after you shut off the convert binary to hex in options? See here.

Thanks,

Tom

nhilam 2019-12-03T01:54:33Z · (edited)

ADS currently exports BLOB columns based on the "File > Options > Results > Convert binary to hex" setting. However the import does not take this setting into account. Thus the IMPORT does NOT import exactly what is exported for BLOB column when this setting is turn ON. For example the BLOB values is exported as a HEX string, but is imported as a literal raw value.

For this issue, we need to fix the IMPORT so that it considers the "File > Options > Results > Convert binary to hex" setting. If this setting is OFF, then it should continue the current behavior. If this setting is ON, then ADS needs to assume that the blob values in the import file is hex encoded, and import the BLOB values as HEX strings using the correct format for each supported database type. Note that IMPORT is implemented as a bunch of INSERT statements. Different database types may have different formats for inserting HEX strings. For example in DB2 LUW, the insert statement looks like this: INSERT INTO some_table(..., blob_column) VALUES(..., blob(x'some_hex_string'))

This issue will involve researching the correct format for each supported database. The import will have to work correctly for each supported database.

Note that BLOB values can potentially be very large. In this issue, we need to gracefully handle the low memory or out of memory condition. For example, during such condition, ADS should NOT hang or crash, but should display an error dialog with the message:

There is insufficient memory to perform this operation. Please increase the amount of memory available to the application and the JVM according to the application documentation and restart.

This error dialog and message already exist in ADS, and the condition is already handled. This handling needs to be validated due to the nature of BLOB values being large and such condition is more like to occur when importing BLOB values.

Attached Import-Blob.patch contains the fix for a few databases (Oracle, DB2, etc.). Will need to test and fix for all supported databases.

This issue will involve researching the correct format for each supported database. The import will have to work correctly for each supported database.

There is insufficient memory to perform this operation. Please increase the amount of memory available to the application and the JVM according to the application documentation and restart.

Attached Import-Blob.patch contains the fix for a few databases (Oracle, DB2, etc.). Will need to test and fix for all supported databases.

askajit 2020-01-17T10:43:59Z · (edited)

Revision no. 57503
Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Added support import blob datatype while import file for some databases and

will continue with remaining database those are supports blob datatype or syntax

askajit 2020-01-17T11:01:15Z

@nhi Please find below list of database, those are fixed, doesn't support blob or insert query with blob datatype databases and working on other databases.

Amazon Redshift(Doesn't support BLOB datatype)

Apache Cassandra

Apachy Derby(Insert query doesn’t support BLOB datatype)

Hive(Doesn't support BLOB datatype)

DB2

Greenplum(Doesn't support BLOB datatype)

Informix(Insert query doesn’t support BLOB datatype)

Maria

Mysql

MS SQL Server(!Doesn't support BLOB datatype)

Oracle

abdullah 2020-01-22T12:59:30Z · (edited)

@Dev,

For MS SQL Server, we can insert BLOB data using insert statements. Here is sample program. And the blob data format looks like this:

String s = "0xC9CBBBCCCEB9C8CABCCCCEB9C9CBBB"; //SQL Server format

Table DML:

CREATE TABLE `dbo`.`test_blob`  (
      `name`     varchar(25) NULL,
      `baddress` varbinary(2000) NULL
      )
      ON `PRIMARY`
      WITH (
      DATA_COMPRESSION = NONE
      )
      GO

@Dev,

For MS SQL Server, we can insert BLOB data using insert statements. Here is sample program. And the blob data format looks like this:

String s = "0xC9CBBBCCCEB9C8CABCCCCEB9C9CBBB"; //SQL Server format

Table DML:

CREATE TABLE `dbo`.`test_blob`  (
      `name`     varchar(25) NULL,
      `baddress` varbinary(2000) NULL
      )
      ON `PRIMARY`
      WITH (
      DATA_COMPRESSION = NONE
      )
      GO

askajit 2020-01-23T09:00:55Z

@Asif,

As mention in the ticket, we are considering only BLOB datatype for Import and Export and fixing for BLOB datatype.

Let us know your view point on that.

tomconrad 2020-01-23T17:42:34Z

Whatever changes are made to the ImportThread and ExportThread classes should also be made to the CoreLibImportThread and CoreLibExportThread classes. CoreLibImportThread/CoreLibExportThread is used to support Fluidshell SQLImport/SQLExport. These should also be tested with the same test cases as ImportThread/ExportThread.

There is no corresponding class in Fludishell for ImportExeclImportThread as this is not supported.

askajit 2020-01-24T09:39:47Z

Revision no: 57516

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Added support import blob datatype while import file.

2. Added Support for review or export insert query for supported database.

3. Setting NULL while importing file, if database does not support insert query with BLOB

askajit 2020-01-24T09:40:32Z

Revision no: 57518

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Added support import blob datatype while import file for DB2zOS and Teradata.

2. Added support import/Export blob datatype with Fluidshell.

askajit 2020-01-24T09:45:21Z · (edited)

@Tom, @Nhi Please find below list of databases.

Database	BLOB Support	BLOB Insert Query Support
Amazon Redshift	N	N
Apache Cassandra	Y	Y
Apache Derby	Y	N
Apache Hive	N	N
DB2	Y	Y
DB2 ISeries	N	N
DB2zOS	Y	Y
Generic	N	N
Google BigQuery	N	N
Greenplum	N	N
Informix	Y	N
Interbase	Y	Y
MS Excel	N	N
MSSQL	N	N
MariaDB	Y	Y
MongoDB	N	N
MySQL	Y	Y
Netezza	N	N
Oracle	Y	Y
ParAccel	N	N
PostgreSQL	N	N
SAP HANA	Y	Y
SQLite	Y	Y
Snowflake	N	N
Sysbase ASE/Anywhere	N	N
Sysbase IQ	Y	Y
Teradata	Y	Y
Vertica	N	N
VoltDB	N	N

@Tom, @Nhi Please find below list of databases.

Database	BLOB Support	BLOB Insert Query Support
Amazon Redshift	N	N
Apache Cassandra	Y	Y
Apache Derby	Y	N
Apache Hive	N	N
DB2	Y	Y
DB2 ISeries	N	N
DB2zOS	Y	Y
Generic	N	N
Google BigQuery	N	N
Greenplum	N	N
Informix	Y	N
Interbase	Y	Y
MS Excel	N	N
MSSQL	N	N
MariaDB	Y	Y
MongoDB	N	N
MySQL	Y	Y
Netezza	N	N
Oracle	Y	Y
ParAccel	N	N
PostgreSQL	N	N
SAP HANA	Y	Y
SQLite	Y	Y
Snowflake	N	N
Sysbase ASE/Anywhere	N	N
Sysbase IQ	Y	Y
Teradata	Y	Y
Vertica	N	N
VoltDB	N	N

nhilam 2020-01-24T16:18:35Z

@Ajit, please recheck the BLOB support for each database type where you currently indicate that it is not supported. Some database supports it, but it may use a different name for the data type. Basically when we say BLOB in this issue, we meant binary data type (BLOB stands for binary large object). For example MS SQLServer calls it BINARY or VARBINARY; they mean the same as BLOB. Please see: https://www.developer.com/net/asp/article.php/3761486/Working-with-Binary-Large-Objects-BLOBs-Using-SQL-Server-and-ADONET.htm

abdullah 2020-01-27T03:04:44Z

As mention in the ticket, we are considering only BLOB datatype for Import and Export and fixing for BLOB datatype.

@Ajit, please update the table you have listed here to include the actual blob data type name. As Nhi pointed out we might have missed some blob datatypes.

As Tom, we will have implications in Fluidshell sqlexport and sqlimport commands for blob support.

Thanks

askajit 2020-01-29T12:27:44Z

Revision no: 57521

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Added import support for supported binary datatype.

askajit 2020-01-29T17:52:54Z · (edited)

@All Please find below list of databases, those are supports BLOB datatype.

Database	BLOB	BLOB Insert Query	BINARY	BINARY Insert Query	Datatypes
Amazon Redshift	N	N	N	N
Apache Cassandra	Y	Y	N	N
Apache Derby	Y	Y	N	N
Apache Hive	N	N	Y	Y	Binary
DB2	Y	Y	N	N
DB2 ISeries	N	N	Y	Y	Binary, Varbinary
DB2zOS	Y	Y	Y	Y	Binary, Varbinary
Generic	N	N	N	N
Google BigQuery	N	N	N	N
Greenplum	N	N	N	N
Informix	Y	N	Y	N	Byte, Text
Interbase	Y	Y	N	N
MS Excel	N	N	N	N
MSSQL Azure	N	N	Y	Y	Binary, Varbinary
MSSQL	N	N	Y	Y	Binary, Varbinary
MariaDB	Y	Y	Y	Y	Binary
MongoDB	N	N	Y	Y	Binary
MySQL	Y	Y	Y	Y	Binary, Varbinary
Netezza	N	N	N	N
Oracle	Y	Y	N	N
ParAccel	N	N	N	N
PostgreSQL	N	N	N	N
SAP HANA	Y	Y	Y	Y	Binary, Varbinary
SQLite	Y	Y	N	N
Snowflake	N	N	Y	Y	Binary, Varbinary
Sysbase ASE/Anywhere	N	N	Y	Y	Binary, Varbinary
Sysbase IQ	Y	Y	Y	Y	Binary, Varbinary
Teradata	Y	Y	Y	Y	Varbyte
Vertica	N	N	Y	Y	Binary, Varbinary, Long Varbinary

@All Please find below list of databases, those are supports BLOB datatype.

Database	BLOB	BLOB Insert Query	BINARY	BINARY Insert Query	Datatypes
Amazon Redshift	N	N	N	N
Apache Cassandra	Y	Y	N	N
Apache Derby	Y	Y	N	N
Apache Hive	N	N	Y	Y	Binary
DB2	Y	Y	N	N
DB2 ISeries	N	N	Y	Y	Binary, Varbinary
DB2zOS	Y	Y	Y	Y	Binary, Varbinary
Generic	N	N	N	N
Google BigQuery	N	N	N	N
Greenplum	N	N	N	N
Informix	Y	N	Y	N	Byte, Text
Interbase	Y	Y	N	N
MS Excel	N	N	N	N
MSSQL Azure	N	N	Y	Y	Binary, Varbinary
MSSQL	N	N	Y	Y	Binary, Varbinary
MariaDB	Y	Y	Y	Y	Binary
MongoDB	N	N	Y	Y	Binary
MySQL	Y	Y	Y	Y	Binary, Varbinary
Netezza	N	N	N	N
Oracle	Y	Y	N	N
ParAccel	N	N	N	N
PostgreSQL	N	N	N	N
SAP HANA	Y	Y	Y	Y	Binary, Varbinary
SQLite	Y	Y	N	N
Snowflake	N	N	Y	Y	Binary, Varbinary
Sysbase ASE/Anywhere	N	N	Y	Y	Binary, Varbinary
Sysbase IQ	Y	Y	Y	Y	Binary, Varbinary
Teradata	Y	Y	Y	Y	Varbyte
Vertica	N	N	Y	Y	Binary, Varbinary, Long Varbinary

askajit 2020-01-29T17:52:37Z · (edited)

Revision no: 57523

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Added blob import support for apache derby database.

----------------------------------

@Tom, @Asif I have added support for all databases, those are supports BLOB/Binary datatype except Informix. I have gone through their documentation but could not able to get helpful information. It seems that Informix databases doesn't supports insert query. And for now I added null value where insert doesn't supports.

Would you let me know, if you finds any helpful information.

nhilam 2020-01-29T23:42:26Z

@Ajit, the original patch already handles Informix. Using latest from SVN, I've just tested import of blob data on Informix database (Informix 172.24.1.140 v11.70), and seems to work. It's imported to the "t2" table in the "tom" database.

askajit 2020-01-30T11:32:49Z

@Nhi, In current implementation, we are setting NULL while importing file, if database does not support insert query with BLOB(Informix). Its generates query as INSERT INTO informix.import1(c1) VALUES(NULL).

I have reverted current implementation and added patch code but it gets failed while importing file and gives syntax error. Its generates query as INSERT INTO informix.import(c1) VALUES(x'd0cf11e0a1b11ae....')

tomconrad 2020-01-30T16:34:29Z

I don't think that you can directly insert hex data into blob/clob/byte columns in Informix??

nhilam 2020-01-30T18:38:47Z · (edited)

My test was based on Transaction Type = Batch, which is configured in the last page of the Import wizard. In this mode, the import uses prepared statements.

@Documentation: We need to document that importing BLOBs in Informix must use Transaction Type = Batch in order for the import to work correctly.

@Ajit: Please make sure that you test your code changes with all available Transaction Type in the Import wizard.

askajit 2020-02-03T12:18:01Z

Revision no: 57532

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Default code for handling import blob

nhilam 2020-02-05T19:13:35Z · (edited)

Code changes look good.

So far, the import of XLSX file into Informix database does not work correctly. Please see the attached export-blob.xlsx file.

Check the "File > Options > Results > Convert binary to hex" option
Import the export-blob.xlsx file to the tom.informix.t2 table in the Informix 172.24.1.140 v11.70 database
In the last Import wizard page, set the "Transaction Type" to Batch, and complete the import
Result: The hex encoded binary data in the XLSX file is not imported correctly to the database

Note that if you import the attached export-blob.csv file to the same database table, the data will be imported correctly. You can check the correctness of the imported data as follows:

Select * from t2
Click on the first cell in the grid results
Right-click on the same cell and select View Cell In ... > Image Viewer
Result: a new tab should open with the image of a butterfly on a flower

@QA: please test import binary data to every database type that supports some form of binary data. Please import CSV, XLS and XLSX files, and verify that the binary data is imported correctly.

askajit 2020-02-04T15:56:51Z · Edited by

nhilam

@Nhi, The fields of CSV file/existing table data has length more than 32767 characters. And Microsoft excel supports only 32767 characters per cell. If we try to add more than 32767 characters in excel cell, data gets truncated and result of importing same file data will be incorrect.

[Nhi]: Noted. Thanks, Ajit.

@Documentation: Please add this to the documentation.

askajit 2020-02-04T22:32:09Z · Edited by

nhilam

@Nhi, Derby database insert query for BLOB datatype only supports small data. If try to insert large data it throws below error as "Caused by: ERROR 54002: Error for batch element #0: A string constant starting with 'x'ffd8ffe000104a46494600010101004800480000ffe2021c4943435f50&' is too long." with all transaction types.

After looking into code I found that if we add support of derby in private boolean usePreparedStatement(ConnectionProperties cp) {

method of class ImportThread and select transaction type Batch, it work fine.

Is there any impact on other code, if we add derby database support in usePreparedStatement method to work import large data?

[Nhi] Should not have an impact on other code.

askajit 2020-02-05T19:12:58Z · Edited by

nhilam

Revision no: 57541

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Batch support for Apache derby

[Nhi] Looks good.

juhisindhani 2020-02-05T16:00:17Z · Edited by

tomconrad

Refer below link for "Microsoft excel supports only 32767 characters per cell. If we try to add more than 32767 characters in excel cell, data gets truncated and result of importing same file data will be incorrect".

https://support.office.com/en-us/article/Excel-specifications-and-limits-1672b34d-7043-467e-8e27-269d656771c3#ID0EBABAAA=Office_2010

tomconrad 2020-02-05T16:00:37Z

Thanks Juhi.

juhisindhani 2020-02-06T13:09:59Z · (edited)

Hi @Ajit,

Found issues in Vertica Database:

Note: For large binary data, referred Nhi's attached export-blob.csv and export-blob.xlsx files

1. a) Binary datatype --> CSV --> Importing small binary data (Binary value cannot exceed 65000)

FULL worked fine
Batch imported 15 rows instead of 18 and throws attached error.png : (Issue)

Also for imported 15 rows using Batch transaction type, data gets lost because it displays different value in the table after import.

Threshold worked same as Batch (Issue)

1.b) Binary datatype --> Excel --> Importing small binary data

Full, Batch, Threshold worked fine

1.c) Binary datatype --> Excel --> Importing long binary data

Full worked fine
Batch, Threshold doesn’t throw error but data got lost (Issue)

2.a) Varbinary datatype --> Excel --> Importing Varbinary data

Full worked fine
Batch, Threshold doesn’t throw error but data got lost (Issue)

3.a) Long Varbinary datatype --> Excel --> Importing Long Varbinary data

Full throws error of limit 65000 octets
Batch worked fine
Threshold worked fine

askajit 2020-02-07T12:35:54Z

Revision no: 57553

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Batch support for Mysql, Maria and Hana

2. Teradata batch insert fix.

3. Vertical batch insert code refactored

askajit 2020-02-11T06:23:22Z · Edited by

nhilam

@Tom @Nhi, In Vertica database if hex data has only numbers, then below code gets execute while importing data from file and imported data will be incorrect. Class ImportThread.java line no: 1353

if (NumberUtils.isNumber(columnInfo) && !(columnInfo.contains("x") || columnInfo.contains("X"))) {
   if (columnInfo.contains(".")) {
      stmt.setBytes(idx, AQHexUtils.hexToByte(Long.toHexString(Double.doubleToRawLongBits(Double.valueOf(columnInfo)))));
   }
   else {
      BigInteger bigInt = new BigInteger(columnInfo);
      stmt.setBytes(idx, bigInt.toByteArray());
   }

}

Is there any impact on any other functionally, if we refactor this code as below?

stmt.setBytes(idx, AQHexUtils.hexToByte(columnObject.toString()));

[Nhi] Should be fine.

@Tom @Nhi, In Vertica database if hex data has only numbers, then below code gets execute while importing data from file and imported data will be incorrect. Class ImportThread.java line no: 1353

if (NumberUtils.isNumber(columnInfo) && !(columnInfo.contains("x") || columnInfo.contains("X"))) {
   if (columnInfo.contains(".")) {
      stmt.setBytes(idx, AQHexUtils.hexToByte(Long.toHexString(Double.doubleToRawLongBits(Double.valueOf(columnInfo)))));
   }
   else {
      BigInteger bigInt = new BigInteger(columnInfo);
      stmt.setBytes(idx, bigInt.toByteArray());
   }

}

Is there any impact on any other functionally, if we refactor this code as below?

stmt.setBytes(idx, AQHexUtils.hexToByte(columnObject.toString()));

[Nhi] Should be fine.

askajit 2020-02-11T09:04:17Z

Revision no: 57558

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1.Vertical batch insert code refactored

2. In Query Analyser result open as Image Viewer for org.bson.types.Binary data fixed.

askajit 2020-02-11T14:26:46Z

Revision no: 57561

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Batch support for Teradata

askajit 2020-02-12T14:08:51Z

@Tom, @Nhi Hive : Tools > Import Data menu is disabled for Hive database, We are not able to test import data in Hive databases.

Could you help us to enable menu to test import data?

tomconrad 2020-02-13T17:42:31Z

Hi Ajit,

Lets not worry about Hive for this issue as the hives that haven't crashed don't allow acid transactions.

Thanks,

Tom

askajit 2020-02-18T06:15:27Z

Revision no: 57571

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Batch support for Sybase Anywhere/ASE databases

viraaj.apte 2020-02-19T12:56:45Z

Revision: 24073

Author: automation

Date: Wednesday, February 19, 2020 4:57:30 AM

Message:

<ADS 20.6 #15687> Added automation test cases

Changes -

Added automation test cases for Import (binary, varbinary, longvarbinary)

----

viraaj.apte 2020-02-20T09:44:13Z

Revision: 24075

Author: automation

Date: Thursday, February 20, 2020 1:47:05 AM

Message:

<ADS 20.6 #15687> Added automation test cases

Changes -

Added automation test cases for Import (blob)

----

askajit 2020-02-21T10:57:28Z · (edited)

Revision no: 57578

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Batch support for Informix binary datatype.

2. Missed statement for DB2zOS.

askajit 2020-02-21T10:57:33Z

@Tom, @Nhi We are not able to batch import for BLOB data in an Interbase database. We have tried below code changes.

Added support of Interbase in usePreparedStatement method of ImportThread.java

Expression:

1.stmt.setBlob(idx, new SerialBlob(bytes));

Result : import --> Error: Row: 3 -- javax.sql.rowset.serial.SerialBlob cannot be cast to java.lang.Long.

2.stmt.setBytes(idx,bytes);

Result: import runs continuously.

3.ByteArrayInputStreamis= new ByteArrayInputStream(bytes);

stmt.setBinaryStream(idx,is,bytes.length);

Result: import runs continuously.

4.ByteArrayInputStreamis= new ByteArrayInputStream(bytes);

stmt.setBinaryStream(idx,is);

Result: Import runs successfully, but blob columns will have empty string.

Could you help us to find the correct way to batch import for BLOB data in an Interbase database?

viraaj.apte 2020-02-21T13:32:53Z

Revision: 24077

Author: automation

Date: Friday, February 21, 2020 5:26:45 AM

Message:

<ADS 20.6 #15687> Added automation test cases

Changes -

Added required file

----

viraaj.apte 2020-02-21T13:33:07Z

Revision: 24078

Author: automation

Date: Friday, February 21, 2020 5:31:03 AM

Message:

<ADS 20.6 #15687> Added automation test cases

Changes -

Corrected folder structure for test data

----

viraaj.apte 2020-02-21T13:33:24Z

Revision: 24079

Author: automation

Date: Friday, February 21, 2020 5:34:17 AM

Message:

<ADS 20.6 #15687> Added automation test cases

Changes -

Added missing Test data files for Azure and Derby database

----

nhilam 2020-02-22T02:12:46Z

@Ajit: Regarding InterBase, I've experimented and it works with stmt.setBinaryStream(idx, is, length). There are other issues that needs to be tweaked in order for it to work. Please see InterBase-Blob-Import.patch. Here I did a prototype and was able to successfully import the export-blob.csv file. Please use this patch as you see fit, but be sure to test it first. Also note that you'll also need to make similar changes in CoreLibImportThread and ImportExcelImportThread.

viraaj.apte 2020-02-24T05:38:36Z

Revision: 24081

Author: automation

Date: Sunday, February 23, 2020 9:41:35 PM

Message:

<ADS 20.6 #15687> Updated Automation test cases

Changes :

Removed duplicated test cases from test suit

----

askajit 2020-02-24T05:39:20Z

Revision no: 57582

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Batch support for interbase blob datatype.

askajit 2020-02-24T06:23:21Z

Revision no: 57583

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Batch support for Snowflake binary datatype.

askajit 2020-02-24T13:36:02Z

Revision no: 57584

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. TEXT datatype import for Informix database.

askajit 2020-02-24T13:50:36Z · (edited)

@Tom, @Nhi We are trying to import BLOB/Binary data into Sybase IQ database after adding sajdbc4.jar and related .dll files, but are getting the following error if the column data value exceeds 32767.

Execute batch failed: [Sybase][JDBC Driver][Sybase IQ]binary data not supported on data longer than 32767 Bind host variable,

-- (dflib\df_Heap.cxx 3219)

After looking error message it look like as, we might need to change databases level configuration.

Could you change configuration of Sybase IQ database to test large data, if our assumption is correct?

nhilam 2020-02-25T01:51:33Z

Functional Review:

I was trying to import CLOB into an Oracle database: Oracle 172.24.1.8 v12.2.0.2.0, Schema = TOM, Table = test_clob

When importing the export-small-blob.csv and export-small-blob.xlsx in BATCH mode, the content was imported with single quote characters around the CLOB values, which is incorrect. When I do the same import in FULL mode, it is imported correctly.

When importing the export-blob.csv file, the CLOB values are imported as TO_CLOB('ffd8fe...'), which is incorrect. The correct value should be ffd8fe...

I didn't test against other databases. Please be sure to test import of CLOB against other databases.

askajit 2020-02-25T22:55:37Z · Edited by

nhilam

Revision no: 57587

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Properly inserted Oracle Clob data.

[Nhi] Looks good.

viraaj.apte 2020-02-25T11:04:30Z

Revision: 24086

Author: automation

Date: Tuesday, February 25, 2020 3:06:34 AM

Message:

<ADS 20.6 #15687> Added automation test cases

Changes -

Resolved merge conflicts and added test cases for Oracle and MangoDB

----

viraaj.apte 2020-02-25T13:53:22Z

Revision: 24087

Author: automation

Date: Tuesday, February 25, 2020 5:53:11 AM

Message:

<ADS 20.6 #15687> Updated automation test cases

Changes -

Updated automation test cases for Import (Snowflake, Informix)

askajit 2020-02-26T12:17:27Z

Revision no: 57590

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. DBCLob insert for DB2zOS database.

viraaj.apte 2020-02-27T13:00:02Z

Revision: 24090

Author: automation

Date: Thursday, February 27, 2020 5:03:18 AM

Message:

<ADS 20.6 #15687> Updated automation test cases

Changes -

Added automation test cases for Import (SAP Hana, Interbase, Sybase ANY)

----

askajit 2020-02-27T13:01:42Z

Revision no: 57595

Author: ajit.kulkarni

<aquadatastudio:#15687> Improve Export and Import for BLOBs cross databases

Changes:

1. Large CLOB data insert in teradata.

askajit 2020-03-02T12:39:31Z

@Tom, @Nhi Teradata basebase was working fine for BLOB/CLOB datatype, but recently started throwing below error.

Execute batch failed: [Teradata JDBC Driver] [TeraJDBC 16.20.00.13] [Error 1338] [SQLState HY000] A failure occurred while executing a PreparedStatement batch request. Details of the failure can be found in the exception chain that is accessible with getNextException.

[Teradata JDBC Driver] [TeraJDBC 16.20.00.13] [Error 1339] [SQLState HY000] A failure occurred while executing a PreparedStatement batch request. The parameter set was not executed and should be resubmitted individually using the PreparedStatement executeUpdate method.

Could you please help us to fix above error?

tomconrad 2020-03-02T15:26:25Z

Hi Ajit,

Please add your test cases in the issue so I can debug them.

Thanks,

Tom

juhisindhani 2020-03-03T12:07:25Z · (edited)

Hi Tom,

Below are the steps to reproduce:

- Check the "File > Options > Results > Convert binary to hex" option

- I was trying to import BLOB into an Teradata database : Teradata 10.31.200.39 v15.1, database = tom, Table = "tb_b_64000" ( blob datatype of length 64000).

- Right click on "tom.tb_b_64000" > Tools > Import > now select the attached export-blob.csv file for import.

- Click Next > Again Click Next > Under the Destination dropdown select "Import into Database" and click Next > and in the last Import wizard page , set the "Transaction Type" to Batch , and complete the import.

Result :

Import failed.

"Execute batch failed: [Teradata JDBC Driver] [TeraJDBC 16.20.00.13] [Error 1338] [SQLState HY000] A failure occurred while executing a PreparedStatement batch request. Details of the failure can be found in the exception chain that is accessible with getNextException.

Previously, I did the same procedure of importing BLOB into an Teradata database : Teradata 10.31.200.39 v15.1, database = tom, Table = "tb_Blob" ( blob datatype of length 64000).

Result : the data was imported correctly. Verified by doing below steps:

1.)SELECT * FROM tb_Blob

2.)Execute this Query in Output You can see the result with the data of 5 rows :

Click on the first cell in the grid results

Right-click on the same cell and select View Cell In ... > Image Viewer

Result: a new tab will open with the image of a butterfly on a flower

Thanks,

Juhi

tomconrad 2020-03-03T18:09:54Z

Hi Juhi,

I think this is because the database tom ran out of space. I increased the size and it now imports. Please make sure that you clean up after testing with the blob data. It can consume a lot of space.

Thanks,
Tom

juhisindhani 2020-03-04T12:08:38Z

Yes Tom, It worked fine now.

Thanks,

Juhi

viraaj.apte 2020-03-09T10:37:44Z

Revision: 24098

Author: automation

Date: Monday, March 9, 2020 3:40:16 AM

Message:

<ADS 20.6 #15687> Updated automation test cases

Changes -

Updated automation test cases

----

viraaj.apte 2020-03-09T11:29:07Z

@nhi

I am unable to connect to "MySQL 172.24.1.199 v8.0". Error screenshot.

tomconrad 2020-03-09T15:01:01Z

Hi Viraaj,

Try it now. For some reason it was stopped.

Thanks, Tom

nhilam 2020-03-11T05:11:23Z

Hi Viraaj,

Only 50% of the automated test cases passed for me, even after multiple runs. Please see results here: https://idera.testrail.net/index.php?/runs/view/3047&group_by=cases:section_id&group_order=asc&group_id=20777

viraaj.apte 2020-03-11T13:13:15Z

Revision: 24100

Author: automation

Date: Wednesday, March 11, 2020 6:09:25 AM

Message:

<ADS 20.6 #15687> Updated Automation test cases

Changes :

Updated test cases

----

viraaj.apte 2020-03-11T14:32:18Z · Edited by

tomconrad

Hi,

I am not able to connect to

1. "Teradata 10.220.200.86 v16.20" - (Error screenshot)

This keeps showing message "Connecting" but doesn't connect.

2. Vertica - (Error screenshot) It is intermittent issue

[TC] I fixed Teradata. Please try it again. Looks like Vertica is working ok.

juhisindhani 2020-03-11T13:57:28Z

Hi All ,

1.) Done with Manual testing of this ticket.

2.) Added test-cases in Test-Rail mentioned in below link :-

'https://idera.testrail.net/index.php?/cases/view/244931&group_by=cases:section_id&group_order=asc&group_id=20757'

3.) Results of these test cases (for binary , varbinary , blob datatypes) for all the supported databases are prevailed in the Excel sheet , use the below link to see the results.

'https://docs.google.com/spreadsheets/d/1WiDhxhzjLvXSw-h3GkiWKM-JRRbNa4g6GvbH_cQtjOc/edit?usp=sharing'

Thanks,

Juhi

nhilam 2020-03-12T02:50:36Z

I am still not able to get all automated test cases to succeed: https://idera.testrail.net/index.php?/runs/view/3047&group_by=cases:section_id&group_order=asc&group_id=20777

viraaj.apte 2020-03-16T14:37:48Z · Edited by

tomconrad

Hi Tom,

There is issue with Teradata. It has error like "No more room in database tom"

[TC] I cleaned up some space.

viraaj.apte 2020-03-16T19:51:43Z · Edited by

nhilam

Hi @Nhi,

I have made some code updates in SVN #24105. Can you pls try it again ?

[Nhi] All test cases succeeded now.

aparna 2020-04-21T14:51:07Z

QE Comment:

We have verified the fix for import\export of BLOB datatype through ADS UI and Fluidshell both on ADS v20.6.0-rc-2 for following Database Servers:

Database Servers

Database Servers for BLOB & CLOB datatypes :-

a.) Cassandra 10.31.200.12 3.11

b.) Derby 172.24.1.199 v10.14

c.) DB2 LUW 172.24.1.44 v11.5

d.) DB2 ZOS 192.86.33.139 V12

e.) Informix 172.24.1.145 v12.10

f.) Interbase 172.24.1.153 2017

g.) MariaDB 172.24.1.8 v10.2

h.) MySQL 172.24.1.199 v8.0

i.) Oracle 172.24.1.44 v18.0

j.) SAP Hana 10.31.200.29 v2.0

k.) SQLite localhost

l.) Sybase IQ 172.24.1.140 16.0

m.) Teradata 10.220.200.86 v16.20

Database servers tested for Binary , VarBinary , Byte , Text , LongVarbinary and long Binary datatypes.

a.) Microsoft SQL Azure (RTM)- 12.0.2000.8

b.) MongoDB 172.24.1.153 v4.0.6

c.) SQL Server 10.31.200.24 2019

d.) Vertica 10.31.200.87 v9.2

e.) SnowflakeTestPartner1

f.) Sybase ASE 172.24.1.145 v16

Database servers tested for Binary data type having maximum length of 255

a.) Sybase Any 172.24.1.145 v16

Above databases are verified for import & Export of all types of datatypes supported by ADS for particular database servers in following platforms :
--> Windows
--> Ubuntu
--> Mac OS

Testcase results for same could be found at the below link:
https://idera.testrail.net/index.php?/cases/view/249516&group_by=cases:section_id&group_order=asc&group_id=21880

Search Tips

Aqua Data Studio / nhilam

Improve Export and Import for BLOBs cross databases

14 attachments

Issue #15687

Completion

1 issue link

Issue #10563

Search Tips

Aqua Data Studio / nhilam

Title

Improve Export and Import for BLOBs cross databases

14 attachments

Issue #15687

Completion

1 issue link

Issue #10563