Golang : Read large file with bufio.Scanner cause token too long error
There are couple of ways to read files with Golang and one of the common way to read in file is with bufio.NewScanner() function. For example :
package main
import (
"bufio"
"fmt"
"os"
)
func main() {
file, err := os.Open("somefile.dat")
if err != nil {
panic(err.Error())
}
defer file.Close()
reader := bufio.NewReader(file)
scanner := bufio.NewScanner(reader)
scanner.Split(bufio.ScanBytes)
for scanner.Scan() {
fmt.Println(scanner.Bytes())
}
if err := scanner.Err(); err != nil {
fmt.Println("error while reading :", err)
}
}
Most of the time, things should be fine and dandy. However, there will be cases when the program attempts to read in a very large file and it will throw out this error message :
"bufio.Scanner: token too long"
This error message was caused by the input token size that exceeded the bufio.MaxScanTokenSize
buffer size. The allocated size is MaxScanTokenSize = 64 * 1024
To read in large files, it is recommended to use bufio.Reader.ReadLine instead.
References :
See also : Golang : Scanf function weird error in Windows
By Adam Ng
IF you gain some knowledge or the information here solved your programming problem. Please consider donating to the less fortunate or some charities that you like. Apart from donation, planting trees, volunteering or reducing your carbon footprint will be great too.
Advertisement
Tutorials
+10.7k Golang : Concurrency and goroutine example
+4.4k Swift : Convert (cast) Float to Int or Int32 value
+5.2k Golang : How to search a list of records or data structures
+8.5k Golang : Create unique title slugs example
+19.9k Golang : Clean up null characters from input data
+8.1k Golang : Populate or initialize struct with values example
+9.2k Golang : Find and replace data in all files recursively
+4.9k Golang : Error handling methods
+5.2k Linux/MacOSX : Search for files by filename and extension with find command
+6.2k Golang : Get Alexa ranking data example
+15.7k Golang : File path independent of Operating System
+10k Golang : Sieve of Eratosthenes algorithm