Delimited File

1. Load Metadata

Source
The source of your file. This can be local upload, Amazon S3, or Azure Blob Storage.
delimiter
The delimiter character, use U+#### syntax (e.g. U+0001) for unicode characters
textQualifier
The text qualifier character, e.g. " (which would be represented as "")
headerRowsToIgnore
The number of records from the top of the file to ignore before the data starts (includes column header). See note below for more on this parameter.
encoding
The encoding of the file, defaults to UTF8 if not specified, but also supports UTF8_BOM, ASCII, and UTF16
path
The path to the source file to load, this can reference an execution parameter (e.g. @file_path)
If you use both useHeaderRecord="true" and HeaderRowsToIgnore = 1, two rows will be ignored. Refer to the below to ensure you are receiving the results you want: One row as headers: useHeaderRecord="true" and HeaderRowsToIgnore = 0
Two rows as headers: useHeaderRecord="true" and HeaderRowsToIgnore = 1 Three rows as headers: useHeaderRecord="true" and HeaderRowsToIgnore = 2
Image 1: Load the Metadata

2. Schema

  1. 1.
    Add in your applicable column(s) (Image 2). See the documentation here for further details on each column type.
Image 2: Schema

3. Filter

  1. 1.
    You may choose to use CQL to create a filter (Image 3). Review the documentation here for more on filters.
Image 3: Adding a Filter
<DelimitedDataSource
delimiter=","
textQualifier="&quot;"
headerRowsToIgnore="0"
path="C:\Users\Cinchy\Sample.csv" encoding="UTF8">
<Schema>
<Column name="firstname" ordinal="1" dataType="Text" maxLength="50"
isMandatory="false" validateData="false" trimWhitespace="true" description=""/>
<Column name="lastname" ordinal="2" dataType="Text" maxLength="50"
isMandatory="false" validateData="false" trimWhitespace="true" description=""/>
<CalculatedColumn name="lob" formula="@lob" dataType="Text" maxLength="100"
isMandatory="false" validateData="false" description="" />
<CalculatedColumn name="name" formula="CONCAT(firstname, lastname)"
dataType="Text" maxLength="100" isMandatory="false" validateData="false"
description="" />
<Column name="net worth" ordinal="3" dataType="Number" maxLength="50"
isMandatory="false" validateData="true" trimWhitespace="true" description="">
<Transformations>
<StringReplacement pattern="\$" replacement="" />
</Transformations>
</Column>
</Schema>
<Filter/>
</DelimitedDataSource>