The scripts are responsible for normalizing four different POS-Tagging annotated corpora to Penn TreeBank's simplified tagset.
The scripts included in this repository were developed in the paper Defining a state-of-the-art POS-tagging environment for Brazilian Portuguese clinical texts.
You are free to use and modify the scripts, as long as you cite properly.