Skip to content
GitHubXDiscord

Classifier

The Classifier resource allows you to manage AWS Glue Classifiers which are used to categorize the data in AWS Glue Data Catalog. Classifiers help AWS Glue to infer the schema of the data you are working with, making it easier to transform and analyze data.

Create a basic JSON classifier for your data.

import AWS from "alchemy/aws/control";
const jsonClassifier = await AWS.Glue.Classifier("myJsonClassifier", {
JsonClassifier: {
Name: "myJsonClassifier",
JsonPath: "$.data[*]",
Classification: "json"
}
});

Create a Grok classifier with additional settings for pattern matching.

const grokClassifier = await AWS.Glue.Classifier("myGrokClassifier", {
GrokClassifier: {
Name: "myGrokClassifier",
Classification: "log",
GrokPattern: "%{COMBINEDAPACHELOG}",
CustomPatterns: "MYCUSTOMPATTERN %{GREEDYDATA:message}"
}
});

Create an XML classifier to help with XML data parsing.

const xmlClassifier = await AWS.Glue.Classifier("myXmlClassifier", {
XMLClassifier: {
Name: "myXmlClassifier",
Classification: "xml",
RowTag: "record"
}
});

Set up a CSV classifier that can recognize CSV formatted data.

const csvClassifier = await AWS.Glue.Classifier("myCsvClassifier", {
CsvClassifier: {
Name: "myCsvClassifier",
Classification: "csv",
Delimiter: ",",
QuoteSymbol: "\"",
ContainsHeader: "PRESENT"
}
});