Hello, I'm

Darwin Saire

I am a Machine Learning Engineer, Data Scientist, and NLP Developer; Ph.D. at Recod.ai laboratory in the Institute of Computer (IC) at the University of Campinas (UNICAMP), Brazil.

In my years of research, I improved my knowledge of machine learning and deep learning by collecting, cleaning, and interpreting data.
My skills include but are not limited to good reasoning, adaptability, responsibility, hard work, good relationships, good capacity for abstraction and problem-solving. Furthermore, I have experience with the different required frameworks (e.g., PyTorch, TensorFlow).

As part of my professional experience, I am working modeling, training, and interpreting the results of NLP models for SiDi - Samsung, located in Campinas - Brazil, developing the Bixby application for mobile, TVs, watches, and fridge devices.

About Me

I'm Darwin Saire Pilco

I have always been deeply interested in statistics, analytics, and mathematics, which led me to study data science. I graduated with a bachelor's degree in Systems Engineering (Computer Science) at the National University of San Agustin, Peru. I followed this with a master's degree in Computer Science at the University of Campinas, Brazil. Currently, I am finishing my Ph.D. at the University of Campinas, Brazil, and the University of Lorraine, France.

My research focused on semantic segmentation tasks and discovered a learning representation (i.e., context embedding) with a helpful structure and behavior to address the loss of spatial precision problem present in popular deep learning models. Specifically, I model the latent space (i.e., give a structure) through Gaussian mixing, providing a clustering behavior to the features, which positively impacts the final result of the semantic segmentation.

Interests

  Machine Learning.

  Deep Learning.

  Natural Language Processing.

  Pattern Recognition.

  Computer Vision.

Language

  Spanish (native);

  Portuguese (proficient).

  English (intermediate).

Education

2024 - Present  

MBA in Software Engineering

Currently, I'm studying under University of São Paulo (USP)/ESALQ, Brazil.

2019 - 2020  

Ph.D. internship at LORIA Institute

Ph.D. internship in partnership with INRIA, at the University of Lorraine, France.

2017 - 2022  

Ph.D. in Computer Science

Completed with GPA 3.85/4.0, at the University of Campinas, Brazil.

2015 - 2016  

M.Sc. in Computer Science

Completed with GPA 3.85/4.0, at the University of Campinas, Brazil.

2008 - 2013  

B.Sc. in Computer Science

Completed at the National University of San Agustin, Peru.

Experience

Apr. 2022 - Present  

Machine Learning, NLP Developer

Model development for classification in Samsung's virtual assistant (Bixby), for mobile, watch, TV, and fridge devices.
SiDi - Samsung Institute.

Aug. 2016 - Feb. 2017  

Software Consultant

Developed automatic tests to verify correct communication between Salesforce and JIRA, improving the efficiency in ~ 80%.
zAgile Inc.

Aug. 2013 - Jan. 2015  

Researcher - Developer

Implemented GLCM, Gabor, and LBP algorithms for feature extraction and built median and gaussian filters for generating candidate regions, achieving ~ 99% of sensitivity.

Implemented and evaluated CIEL*a*b* algorithm and color-matching functions, in quality control process,achieving mAP ~ 87%.
Incalpaca TPX S.A



Skills

AWS
Jira
Json
OpenCV
NLTK
Kubeflow
Latex
Selenium
Pytest
Pylint
RestAPI
Docker
Conda
Pandas
Scipy
Jupyter
JS
Pillow
Python
C
AndroidStudio
C++
Java
SSH
Qt
HTML
MySQL
Sklearn
TF
TensorBoard
TFLite
Pytorch
NumPy
Matplotlib
GPTchat
Tmux
GitHub
Scikit-image
Keras

Certificates

deeplearningAI

Natural Language Processing in TensorFlow

DeepLearning.AI and Coursera
Issued Dec 2023 - No Expiration Date
Credential ID XH9YJWPC3DGG

Coursera

Browser-based Models with TensorFlow.js

Coursera Course Certificates
Issued Apr 2023 - No Expiration Date
Credential ID JFQYSDSEYW2H

deeplearningAI

Generative Adversarial Networks (GANs) Specialization

DeepLearning.AI and Coursera
Issued Feb 2023 - No Expiration Date
Credential ID 3JP4XS4BQLB8

deeplearningAI

Apply Generative Adversarial Networks (GANs)

DeepLearning.AI and Coursera
Issued Feb 2023 - No Expiration Date
Credential ID JRETTJHW3M3J

deeplearningAI

Build Better Generative Adversarial Networks (GANs)

DeepLearning.AI and Coursera
Issued Dec 2022 - No Expiration Date
Credential ID P3FDAZN67FXJ

deeplearningAI

Build Basic Generative Adversarial Networks (GANs)

DeepLearning.AI and Coursera
Issued Oct 2022 - No Expiration Date
Credential ID DEUWK2HZ23SJ

Coursera

Optimizing Machine Learning Performance

Coursera Course Certificates
Issued Sep 2022 - No Expiration Date
Credential ID 8WYV6TFT6XRZ

deeplearningAI

Natural Language Processing Specialization

DeepLearning.AI and Coursera
Issued Aug 2021 - No Expiration Date
Credential ID 8XMEX9AHESPX

deeplearningAI

Natural Language Processing with Attention Models

DeepLearning.AI and Coursera
Issued Aug 2021 - No Expiration Date
Credential ID ZDV8R8QE3J5T

deeplearningAI

Natural Language Processing with Probabilistic Models

DeepLearning.AI and Coursera
Issued Jul 2021 - No Expiration Date
Credential ID AP47Z5LZCRNV

deeplearningAI

Natural Language Processing with Probabilistic Models

DeepLearning.AI and Coursera
Issued Jul 2021 - No Expiration Date
Credential ID VFE7Y9D4RDQ4

deeplearningAI

Natural Language Processing with Classification and Vector Spaces

DeepLearning.AI and Coursera
Issued Jun 2021 - No Expiration Date
Credential ID H5L73XTDZZQG

HSE University

Bayesian Methods for Machine Learning

Higher School of Economics and Coursera
Issued Aug 2018 - No Expiration Date
Credential ID A97CKSJLJBS9

deeplearningAI

Deep Learning Specialization

DeepLearning.AI and Coursera
Issued May 2018 - No Expiration Date
Credential ID VQ3XYBPMFRS2

deeplearningAI

Convolutional Neural Networks

DeepLearning.AI and Coursera
Issued May 2018 - No Expiration Date
Credential ID EGJFPMTA5QMK

deeplearningAI

Sequence Models

DeepLearning.AI and Coursera
Issued May 2018 - No Expiration Date
Credential ID GCDK8UE3LPN5

deeplearningAI

Structuring Machine Learning Projects

DeepLearning.AI and Coursera
Issued Apr 2018 - No Expiration Date
Credential ID E3EDD7BH6EWZ

deeplearningAI

Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization

DeepLearning.AI and Coursera
Issued Mar 2018 - No Expiration Date
Credential ID UBG5Q9B37GTP

deeplearningAI

Neural Networks and Deep Learning

DeepLearning.AI and Coursera
Issued Feb 2018 - No Expiration Date
Credential ID CRA92D8T4BD8

Toronto University

Neural Networks for Machine Learning

University of Toronto and Coursera
Issued Feb 2018 - No Expiration Date
Credential ID KJ4CY7EWM4L6

Stanford University

Computer Science

Stanford University
Issued Sep 2014 - No Expiration Date
Credential ID d781a200

Illinois University

Heterogeneous Parallel Programming - CUDA

University of Illinois and Coursera
Issued Mar 2014 - No Expiration Date
Credential ID DMS97ATJ42

Publications

PHGMM

Global and Local Features through Gaussian Mixture Models on Image Semantic Segmentation

Darwin Saire, Adin Ramirez Rivera

IEEE Access - journal, 2022

Paper

Code

GNNBlock

Graph Neural Blocks on Segmentation

Darwin Saire, Adin Ramirez Rivera

Workshop on International Conference on Computer Vision (ICCV), 2022

Paper

Poster

HG

Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task

Darwin Saire, Adin Ramirez Rivera

IEEE Access- journal, 2021

Paper

Code

ICPR2020

Documents Counterfeit Detection Through a Deep Learning Approach

Darwin Saire, Salvatore Tabbone

International Conference on Pattern Recognition (ICPR), 2020

Paper

WICML2020

Semantic Segmentation Through Graph Neural Network Blocks

Darwin Saire, Salvatore Tabbone, Adin Ramirez Rivera

Workshop on International Conference on Machine Learning (ICML), 2020

Paper

Poster

WNIPS2019

Semantic Segmentation on Image Using Multi-Task Hourglass Networks

Darwin Saire, Adin Ramirez Rivera

Workshop on Advances in Neural Information Processing Systems (NeurIPS), 2019

Paper

Poster

WICML2019

Graph Learning Network: A Structure Learning Algorithm

Darwin Saire, Adin Ramirez Rivera

Workshop on International Conference on Machine Learning (ICML), 2019

Paper

Poster

Code

gcn_model

Graph Convolutional Network for Semantic Segmentation Task

Darwin Saire, Adin Ramirez Rivera

Poster on Machine Learning Summer School (MLSS), 2018

Paper

SIBGRAPI

Multi-scale Morphological Image Simplification Based on Extrema Relationships: Improvements and Applications

Darwin Saire, Neucimar Jeronimo Leite

Conference on Graphics, Patterns and Images (SIBGRAPI), 2016

Paper

Projects

data-traduction-model

Neural Machine Translation

Neural Machine Translation (NMT) model to translate human readable dates (“25th of June, 2009”) into machine readable dates (“2009-06-25”).

Code

emojifier-model

NLP Emoji Generation

The emoji project will help you make your text messages more expressive. So rather than writing “Congratulations on the promotion! Let’s get coffee and talk. Love you!” the emojifier can automatically turn this into “Congratulations on the promotion!. 👍 Let’s get coffee and talk. ☕️ Love you! ❤️”

Code

music_generation

Jazz Music Generation

A sequence model (LTSM) can be used to generate musical values, which are then post-processed into midi music.

Code

dinos_generation

Generate New Dinosaur Names

We have collected a list of all the dinosaur names to create new dinosaur names. We will build a character-level language model to generate new names. Our algorithm will learn the different name patterns and randomly generate new names.

Code

style-transfer

Neural Style Transfer

Neural Style Transfer (NST) is one of the most fun techniques in deep learning. As seen below, it merges two images, namely, a “content” image (C) and a “style” image (S), to create a “generated” image (G). The generated image G combines the “content” of the image C with the “style” of image S.

Code

face-recognition

Face Recognition

Face Verification - “is this the claimed person?” and Face Recognition - “who is this person?”. By comparing two such vectors, you can then determine if two pictures are of the same person using triplet loss function.

Code

car-detection

Autonomous driving - Car detection

We use object detection using the very powerful YOLO model on a car detection dataset.

Code

sign-language-resnet

Sign language classification

We use a Resnet50 to perform the sign language classification.

Code1

Code2

cnn-cat-classifier

Deep Neural Network for Image Classification

We use the Convolutional Neural Network (CNN) for cat classification.

Code

lg-cat-classifier

Logistic Regression with Neural Network

We build a logistic regression classifier to recognize cats.

Code

suco-SLIC suco-SLIC

Parallel SLIC superpixels

Parallelization of SLIC superpixel Algorithm.

Code

suco-SLIC

Parallel K-means

Parallelization of K-Means Clustering Algorithm.

Code

place-segmentation-char-recognition

License Plate Detection

The automatic vehicle license recognition system (ALPR) has four main modules: 1) Image pre-processing, 2) Detection/location of the license plate, 3) Characters segmentation, and 4) Character recognition.

Code

Portfolio

Web Design

Web Design

portfolio

Coming Soon

portfolio

Coming Soon

portfolio

Coming Soon

WordPress

Wordpress

WordPress

Wordpress

Contact Me

Workspace

Campinas, São Paulo, Brazil