Working with DataFramesΒΆ
The redact function can be called as a user-defined function (UDF) on a DataFrame column.
Before you do this, you must install pandas.
pip install pandas
The following example:
Reads a CSV file.
Redacts a given column.
Writes the CSV back to disk.
from tonic_textual.redact_api import TextualNer
import pandas as pd
ner = TextualNer()
df = pd.read_csv('file.csv')
# Let's say there is a notes column in the CSV containing unstructured text
df['notes'] = df['notes'].apply(lambda x: ner.redact(x).redacted_text if not pd.isnull(x) else None))
df.to_csv('file_redacted.csv')