# README

## kotlin-csv

![Version](https://img.shields.io/badge/version-1.9.3-blue.svg?cacheSeconds=2592000) [![License: Apache License 2.0](https://github.com/img.shields.io/badge/License-Apache%20License%202.0-yellow.svg) ](https://github.com/doyaaaaaken/kotlin-csv/blob/master/LICENSE/README.md)![CircleCI](https://circleci.com/gh/doyaaaaaken/kotlin-csv/tree/master.svg?style=svg) [![codecov](https://codecov.io/gh/doyaaaaaken/kotlin-csv/branch/master/graph/badge.svg) ](https://codecov.io/gh/doyaaaaaken/kotlin-csv)[![CodeFactor](https://www.codefactor.io/repository/github/doyaaaaaken/kotlin-csv/badge)](https://www.codefactor.io/repository/github/doyaaaaaken/kotlin-csv)

Pure Kotlin CSV Reader/Writer.

## Design goals

#### 1. Simple interface

* easy to setup
* use DSL so easy to read

#### 2. Automatic handling of I/O

* in Java, we always need to close file. but it's boilerplate code and not friendly for non-JVM user.
* provide interfaces which automatically close file without being aware.

#### 3. Multiplatform

* Kotlin Multiplatform projects support.

## Usage

### Download

#### Gradle

```gradle
// Gradle Kotlin DSL
implementation("com.github.doyaaaaaken:kotlin-csv-jvm:1.9.3") // for JVM platform
implementation("com.github.doyaaaaaken:kotlin-csv-js:1.9.3") // for Kotlin JS platform

// Gradle Groovy DSL
implementation 'com.github.doyaaaaaken:kotlin-csv-jvm:1.9.3' // for JVM platform
implementation 'com.github.doyaaaaaken:kotlin-csv-js:1.9.3' // for Kotlin JS platform
```

#### Maven

```maven
<dependency>
  <groupId>com.github.doyaaaaaken</groupId>
  <artifactId>kotlin-csv-jvm</artifactId>
  <version>1.9.3</version>
</dependency>
<dependency>
  <groupId>com.github.doyaaaaaken</groupId>
  <artifactId>kotlin-csv-js</artifactId>
  <version>1.9.3</version>
</dependency>
```

#### [kscript](https://github.com/holgerbrandl/kscript)

```kotlin
@file:DependsOn("com.github.doyaaaaaken:kotlin-csv-jvm:1.9.3") // for JVM platform
@file:DependsOn("com.github.doyaaaaaken:kotlin-csv-js:1.9.3") // for Kotlin JS platform
```

### Examples

#### CSV Read examples

**Simple case**

You can read csv file from `String`, `java.io.File` or `java.io.InputStream` object.\
No need to do any I/O handling. (No need to call `use`, `close` and `flush` method.)

```kotlin
// read from `String`
val csvData: String = "a,b,c\nd,e,f"
val rows: List<List<String>> = csvReader().readAll(csvData)

// read from `java.io.File`
val file: File = File("test.csv")
val rows: List<List<String>> = csvReader().readAll(file)
```

**Read with header**

```kotlin
val csvData: String = "a,b,c\nd,e,f"
val rows: List<Map<String, String>> = csvReader().readAllWithHeader(csvData)
println(rows) //[{a=d, b=e, c=f}]
```

**Read as `Sequence`**

`Sequence` type allows to execute lazily.\
It starts to process each rows before reading all row data.

Learn more about the `Sequence` type on [Kotlin's official documentation](https://kotlinlang.org/docs/reference/sequences.html).

```kotlin
csvReader().open("test1.csv") {
    readAllAsSequence().forEach { row: List<String> ->
        //Do something
        println(row) //[a, b, c]
    }
}

csvReader().open("test2.csv") {
    readAllWithHeaderAsSequence().forEach { row: Map<String, String> ->
        //Do something
        println(row) //{id=1, name=doyaaaaaken}
    }
}
```

NOTE: `readAllAsSequence` and `readAllWithHeaderAsSequence` methods can only be called within the `open` lambda block. The input stream is closed after the `open` lambda block.

**Read line by line**

If you want to handle line-by-line, you can do it by using `open` method. Use `open` method and then use `readNext` method inside nested block to read row.

```kotlin
csvReader().open("test.csv") {
    readNext()
}
```

**Read in a `Suspending Function`**

```kotlin
csvReader().openAsync("test.csv") {
    val container = mutalbeListOf<List<String>>()
    delay(100) //other suspending task
    readAllAsSequence().asFlow().collect { row ->
        delay(100) // other suspending task
        container.add(row)
    }
}
```

Note: `openAsync` can be and only be accessed through a `coroutine` or another `suspending` function

**Customize**

When you create CsvReader, you can choose read options:

```kotlin
// this is tsv reader's option
val tsvReader = csvReader {
    charset = "ISO_8859_1"
    quoteChar = '"'
    delimiter = '\t'
    escapeChar = '\\'
}
```

| Option                         | default value | description                                                                                                                                                                                                                                                                                                     |
| ------------------------------ | ------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| logger                         | *no-op*       | Logger instance for logging debug information at runtime.                                                                                                                                                                                                                                                       |
| charset                        | `UTF-8`       | Charset encoding. The value must be supported by [java.nio.charset.Charset](https://docs.oracle.com/javase/8/docs/api/java/nio/charset/Charset.html).                                                                                                                                                           |
| quoteChar                      | `"`           | Character used to quote fields.                                                                                                                                                                                                                                                                                 |
| delimiter                      | `,`           | <p>Character used as delimiter between each field.<br>Use <code>"\t"</code> if reading TSV file.</p>                                                                                                                                                                                                            |
| escapeChar                     | `"`           | <p>Character to escape quote inside field string.<br>Normally, you don't have to change this option.<br>See detail comment on <a href="https://github.com/doyaaaaaken/kotlin-csv/blob/master/src/commonMain/kotlin/com/github/doyaaaaaken/kotlincsv/dsl/context/CsvReaderContext.kt">ICsvReaderContext</a>.</p> |
| skipEmptyLine                  | `false`       | Whether to skip or error out on empty lines.                                                                                                                                                                                                                                                                    |
| autoRenameDuplicateHeaders     | `false`       | Whether to auto rename duplicate headers or throw an exception.                                                                                                                                                                                                                                                 |
| ~~skipMissMatchedRow~~         | `false`       | Deprecated. Replace with appropriate values in `excessFieldsRowBehaviour` and `insufficientFieldsRowBehaviour`, e.g. both set to `IGNORE`. ~~Whether to skip an invalid row. If `ignoreExcessCols` is true, only rows with less than the expected number of columns will be skipped.~~                          |
| excessFieldsRowBehaviour       | `ERROR`       | Behaviour to use when a row has more fields (columns) than expected. `ERROR` (default), `IGNORE` (skip the row) or `TRIM` (remove the excess fields at the end of the row to match the expected number of fields).                                                                                              |
| insufficientFieldsRowBehaviour | `ERROR`       | Behaviour to use when a row has fewer fields (columns) than expected. `ERROR` (default), `IGNORE` (skip the row) or `EMPTY_STRING` (replace missing fields with an empty string).                                                                                                                               |

#### CSV Write examples

**Simple case**

You can start writing csv in one line, no need to do any I/O handling (No need to call `use`, `close` and `flush` method.):

```kotlin
val rows = listOf(listOf("a", "b", "c"), listOf("d", "e", "f"))
csvWriter().writeAll(rows, "test.csv")

// if you'd append data on the tail of the file, assign `append = true`.
csvWriter().writeAll(rows, "test.csv", append = true)

// You can also write into OutpusStream.
csvWriter().writeAll(rows, File("test.csv").outputStream())
```

You can also write a csv file line by line by `open` method:

```kotlin
val row1 = listOf("a", "b", "c")
val row2 = listOf("d", "e", "f")

csvWriter().open("test.csv") {
    writeRow(row1)
    writeRow(row2)
    writeRow("g", "h", "i")
    writeRows(listOf(row1, row2))
}
```

**Write in a `Suspending Function`**

```kotlin
val rows = listOf(listOf("a", "b", "c"), listOf("d", "e", "f")).asSequence()
csvWriter().openAsync(testFileName) {
    delay(100) //other suspending task
    rows.asFlow().collect {
        delay(100) // other suspending task
        writeRow(it)
    }
}
```

**Write as String**

```kotlin
val rows = listOf(listOf("a", "b", "c"), listOf("d", "e", "f"))
val csvString: String = csvWriter().writeAllAsString(rows) //a,b,c\r\nd,e,f\r\n
```

**long-running write (manual control for file close)**

If you want to close a file writer manually for performance reasons (e.g. streaming scenario), you can use `openAndGetRawWriter` and get a raw `CsvFileWriter`.\
**DO NOT forget to `close` the writer!**

```kotlin
val row1 = listOf("a", "b", "c")

@OptIn(KotlinCsvExperimental::class)
val writer = csvWriter().openAndGetRawWriter("test.csv")
writer.writeRow(row1)
writer.close()
```

**Customize**

When you create a CsvWriter, you can choose write options.

```kotlin
val writer = csvWriter {
    charset = "ISO_8859_1"
    delimiter = '\t'
    nullCode = "NULL"
    lineTerminator = "\n"
    outputLastLineTerminator = true
    quote {
        mode = WriteQuoteMode.ALL
        char = '\''
    }
}
```

| Option                   | default value    | description                                                                                                                                                                                                                                                                                                                                          |
| ------------------------ | ---------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| charset                  | `UTF-8`          | Charset encoding. The value must be supported by [java.nio.charset.Charset](https://docs.oracle.com/javase/8/docs/api/java/nio/charset/Charset.html).                                                                                                                                                                                                |
| delimiter                | `,`              | <p>Character used as delimiter between each fields.<br>Use <code>"\t"</code> if reading TSV file.</p>                                                                                                                                                                                                                                                |
| nullCode                 | `(empty string)` | Character used when a written field is null value.                                                                                                                                                                                                                                                                                                   |
| lineTerminator           | `\r`             | Character used as line terminator.                                                                                                                                                                                                                                                                                                                   |
| outputLastLineTerminator | `true`           | Output line break at the end of file or not.                                                                                                                                                                                                                                                                                                         |
| prependBOM               | `false`          | Output BOM (Byte Order Mark) at the beginning of file or not.                                                                                                                                                                                                                                                                                        |
| quote.char               | `"`              | Character to quote each fields.                                                                                                                                                                                                                                                                                                                      |
| quote.mode               | `CANONICAL`      | <p>Quote mode.<br>- <code>CANONICAL</code>: Not quote normally, but quote special characters (quoteChar, delimiter, line feed). This is <a href="https://tools.ietf.org/html/rfc4180#section-2">the specification of CSV</a>.<br>- <code>ALL</code>: Quote all fields.<br>- <code>NON\_NUMERIC</code>: Quote non-numeric fields. (ex. 1,"a",2.3)</p> |

## Links

**Documents**

* [Change Logs](https://github.com/doyaaaaaken/kotlin-csv/releases)

**Libraries which use kotlin-csv**

* [kotlin-grass](https://github.com/blackmo18/kotlin-grass): Csv File to Kotlin Data Class Parser.

## Miscellaneous

### 🤝 Contributing

Contributions, [issues](https://github.com/doyaaaaaken/kotlin-csv/issues) and feature requests are welcome! If you have questions, ask away in [Kotlin Slack's](https://kotlinlang.slack.com) `kotlin-csv` room.

### 💻 Development

```sh
git clone git@github.com:doyaaaaaken/kotlin-csv.git
cd kotlin-csv
./gradlew check
```

### Show your support

Give a ⭐️ if this project helped you!

### 📝 License

Copyright © 2024 [jsoizo](https://github.com/jsoizo). This project is licensed under [Apache 2.0](https://github.com/doyaaaaaken/kotlin-csv/blob/master/LICENSE/README.md).

***

*This project is inspired ❤️ by* [*scala-csv*](https://github.com/tototoshi/scala-csv)

*This README was generated with ❤️ by* [*readme-md-generator*](https://github.com/kefranabg/readme-md-generator)

### Acknowledgments

This project was originally created by [@doyaaaaaken](https://github.com/doyaaaaaken). The initial work and contributions are greatly appreciated.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://kenta-koyama-biz.gitbook.io/kotlin-csv/readme.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
