Streak2O: Data Augmentation for Handwritten Text Recognition in Neural Networks
| dc.contributor.advisor | Teixeira, Marvi | |
| dc.contributor.author | Beltran Feliciano, Eduardo J. | |
| dc.date.accessioned | 2021-08-17T00:10:56Z | |
| dc.date.available | 2021-08-17T00:10:56Z | |
| dc.date.issued | 2021 | |
| dc.description | Design Project Article for the Graduate Programs at Polytechnic University of Puerto Rico | en_US |
| dc.description.abstract | Streak2O is a machine learning data augmentation algorithm based on the combination of two other independent algorithms: Streak and Droplet. These three augmentations are implemented as non-trainable TensorFlow custom Keras layers to optimize execution time in a GPU based environment. They generate configurable random artifacts that imitate real life handwritten historical document or manuscript water damage and document mishandling. Testing this augmentation algorithm with small subsets of the NIST-SD19 dataset on a convolutional neural network architecture shows that they can help reduce neural network overfitting falling partially into the category of synthetic data generation. Key Terms ⎯ Handwritten Text Recognition, Machine Learning, Synthetic Data Augmentation, TensorFlow. | en_US |
| dc.identifier.citation | Beltran Feliciano, E. J. (2021). Streak2O: Data Augmentation for Handwritten Text Recognition in Neural Networks [Unpublished manuscript]. Graduate School, Polytechnic University of Puerto Rico. | en_US |
| dc.identifier.uri | http://hdl.handle.net/20.500.12475/1128 | |
| dc.language.iso | en_US | en_US |
| dc.publisher | Polytechnic University of Puerto Rico | en_US |
| dc.relation.haspart | San Juan | en_US |
| dc.relation.ispartof | Computer Engineering Program; | |
| dc.relation.ispartofseries | Spring-2021; | |
| dc.rights.holder | Polytechnic University of Puerto Rico, Graduate School | en_US |
| dc.rights.license | All rights reserved | en_US |
| dc.subject.lcsh | Polytechnic University of Puerto Rico--Graduate students--Research | en_US |
| dc.subject.lcsh | Polytechnic University of Puerto Rico--Graduate students--Posters | en_US |
| dc.subject.lcsh | Writing--Identification | en_US |
| dc.subject.lcsh | Neural networks (Computer science) | |
| dc.title | Streak2O: Data Augmentation for Handwritten Text Recognition in Neural Networks | en_US |
| dc.type | Article | en_US |
Files
Original bundle
1 - 2 of 2
Loading...
- Name:
- PUPR_SP21_MCpE_Eduardo J. Beltran Feliciano_Article.pdf
- Size:
- 443.41 KB
- Format:
- Adobe Portable Document Format
- Description:
- PUPR_SP21_MCpE_Eduardo J. Beltran Feliciano_Article
Loading...
- Name:
- PUPR_SP21_MCpE_Eduardo J. Beltran Feliciano_Poster.pdf
- Size:
- 1.01 MB
- Format:
- Adobe Portable Document Format
- Description:
- PUPR_SP21_MCpE_Eduardo J. Beltran Feliciano_Poster
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.63 KB
- Format:
- Item-specific license agreed upon to submission
- Description: