Updated on 2023-04-28 GMT+08:00

Defining Inapplicable Data Row

Scenario

BulkLoad supports the function of defining inapplicable data rows. The inapplicable data rows are not stored in HBase. Instead, these data rows are stored in a specific file.

You can define multiple methods in configuration.xml for importing data in batches.

The column name consists of letters, digits, and underscores and cannot contain any special characters.

Procedure

Details about how to define inapplicable data rows are as follows:

<!-- Define bad line filter rule --> 
<badlines>SMS_ID &lt; 7000 &amp;&amp; SMS_NAME == 'HBase'</badlines>

SMS_ID < 7000 && SMS_NAME == 'HBase'

Table 1 lists the operators in <badlines> and corresponding parameter types.

Table 1 Operators and corresponding parameter types

Operator

Parameter Type

&&

The parameter type is Boolean.

&

The parameter type is integer.

|

The parameter type is integer.

^

The parameter type is integer.

/

The parameter type is digit.

==

The parameter type is string.

>=

The parameter type is digit.

>

The parameter type is digit.

<<

The parameter type is integer.

<=

The parameter type is digit.

<

The parameter type is digit.

%

The parameter type is digit.

*

The parameter type is digit.

!=

The parameter type is string.

||

The parameter type is Boolean.

+

The parameter type is digit and string.

>>

The parameter type is integer.

-

The parameter type is string.

>>>

The parameter type is integer.