Cinque TerreHello world! I am Shazan Jabbar, a Data Scientist at Alberta Machine Intelligence Institute (amii). I am also a part of the DoMiNO research team. I have a Master's in Computing Science from the Department of Computing Science at University of Alberta under the supervision of Prof. Osmar Zaiane. I also closely work with Prof. Alvaro Vargas.

I enjoy doing research in interdiscplinary projects and applying data mining methods to solve real world problems. During my Master's study I worked with DoMiNO team, developing spatial pattern mining techniques to discover interesting associations between air pollution and adverse health effects in Canada. I also developed data visualization tools to help our interdiscplinary collaborators and stakeholders at DoMiNO team to explore the patterns I discovered. Cinque Terre My Mater's thesis is mainly based on these work. Currently I continue my work as a full time researcher at amii and DoMiNO, extending some of my methods, building predictive models, writing papers and developing the data visualization tool VizAR.

I also love coding. During my Master's I also worked as a professional programmer for amii in an Information Retrieval project. Inbetween my Master's and Bachelor's I worked as a Software Engineer (Databases and ETL) for a Syllicon Valley based startup called Leapset (Pvt.) Ltd. (now known as Cake Corporation).

My current research interests are in association rule mining, graph/network analytics, deep learning, topic models, supervised learning models such as classification & regression analysis, information retrieval & text analytics, and spatial data mining.

My Erdos number is 3! (I co-authored with Dr. Russel Greiner. Dr. Greiner co-authored with Dr. Michael Molloy and Dr. Molloy co-authored with Dr. Paul Erdos). You can find more about me in my resume, LinkedIn or in Scholar! You can also contact me via my email: mohomedj[at]ualberta[dot]ca.


Education

Cinque Terre

University of Alberta, Edmonton, Canada

Master of Science in Computing Science, GPA 4.0/4.0

Thesis: Discovering Spatial Patterns using Statistically Significant Dependencies [Link]
Advisor: Prof. Osmar R. Zaiane

Cinque Terre

University of Colombo, Colombo, Sri Lanka

Bachelor of Science in Computer Science, First Class Honors

Thesis: PageRank Based Core-Attachment Model to Detect Protein Complexes by Analysing Protein Networks [Link]
Advisor: Dr. Ruvan Weerasinghe


Experience

Cinque Terre

Data Scientist

Alberta Machine Intelligence Institute, Canada

Spatial Data Mining, Data Visualization, Machine Learning, Statistical Pattern Recognition, Information Retrieval, NLP

Cinque Terre

Graduate Research / Teaching Assistant

University of Alberta, Canada

Data Mining, Machine Learning

Cinque Terre

Software Engineer (Database / ETL)

Sysco Labs (A Sysco Company)

Database Management, Data Integration, Report Generation, ETL Job Designing


Research / Projects

Key words: Regression  Clustering Classification Association Rule Mining Graph Algorithms Graph Analytics Spatial Data Mining Communitiy Detection Co-location Patterns Conditional Random Feilds Contrast Sets Common Sets Data Visualization Mobile Virtual Reality
DoMiNO (Data Mining for Neonatal Outcome) [MSc Thesis]

This project aims to discover association patterns between air pollutant emissions and adverse health conditions in Canada. To address this goal we followed a spatial data mining approach and proposed a new technique to transform a spatial dataset into a transaction dataset to apply association analysis techniques more easily. We proposed two novel pattern discovery techniques to find spatial associations using statistical significance tests. One such proposed technique discovers a novel type of spatial patterns called spatial contrast sets, which aims to characterize a particular spatial group and contrast it from the others. The other type of pattern, spatial common sets aims to find association patterns which are common among many spatial groups. These patterns were able to discover many interesting associations between air pollutants and adverse health conditions in Canada successfully. Currently we are implementing a pattern visualization software suite, VizAR, to help practitioners and our collaborators to explore the patterns we discover.

Advisor: Prof. Osmar R. Zaiane

Cinque Terre

Cinque Terre

Golden Retriever (Question Retrieval System)

Proposed a community question retrieval system based on a weighted TF-IDF scheme, relevance heuristics and term expansions. Goal of this system is, when given a search query, to find similar questions in community forums. Compared to other well known methods such as LDA, LSI, Language Model based IR techniques, BM25 and Word2Vec based models our approach performs well in retrieving similar questions in community forums. This proposed model was placed among the finalists of IEEE International Conference on Helath Informatics Data Analytics Challenge in 2015

Advisor: Dr. Osmar R. Zaiane

Cinque Terre

OPECH - (Extracting Protein Clusters from Hierarchies)

Proposed an algorithm OPECH which builds a cluster hierarchy of proteins based on their topological distance in protein interaction networks and uses a semi-supervised schoring scheme in combination with FOSC framework to extract optimal set of clusters without defining a cut-off threshold.

Mentor: Prof. Joerg Sander

Cinque Terre

Software Energy Consumption Prediction

Developed machine learning models (i.e. regression models such as linear regression, svm regression, etc.) to predict energy consumption of software applications based on the system calls they make. The motivation of the work was to encourage developing energy efficient software applications.

Mentors: Prof. Russel Greiner / Prof. Abram Hindle

Cinque Terre

RNA Secondary Structure Prediction

Proposed a method to predict the secondary structures of RNA molecules using Conditional Random Fields (CRF). CRF models are mainly used in NLP and Biology to predict the labels of structured sequences. In contrast to ordinary classifiers advanced CRF models such as Skip-Chain CRF can take the dependency information with neighboring objects into account when predicting labels.

Mentors: Prof. Russel Greiner / Dr. Hosna Jabbari

Cinque Terre

KFGNN Queries in Time-Dependent Road Networks

Intorudced a novel alternate form of KNN queries, KFGNN (K-Fastest Group Nearest Neighbors), to query for k-set of moving objects in a road network which can reach a static query point fastest. Proposed algorithms to query KFGNN in static and time-dependent road networks. Proposed algorithms can gurantee optimal solution while achieving a 100x CPU permorance increase compared to a baseline approach.

Mentors: Prof. Mario Nascimento

Cinque Terre

Protein Interaction Network Analysis [Honors thesis]

Developed a community detection algorithm using PageRank network centrality measure to find key proteins in protein interaction networks and use them to find communities or clusters of proteins which show homogeneous behavriours. The proposed method outperforms others in recall rate while maintaining a competitive precision rate. [in media]

Advisor: Dr. Ruvan Weerasinghe

Cinque Terre

Virtual Eye

Project “Virtual Eye” is a platform which consists of a smart mobile client providing cost efficient and ubiquitous smart interaction with virtual environments deployed in a remote server. [in media]

Mentors: Dr. Chamath Keppitiyagama / Prof. Nihala Kodikara

Cinque Terre


Publications

My research works which have been published in peer-reviewed conferences and journals.

  1. Colin Bellinger, M. Shazan Mohomed Jabbar, Osmar R. Zaiane, Alvaro Osornio-Vargas, "Applications of Data Mining and Machine Learning in Air Pollution Epidemiology", Public Health, BMC, 2017 [pdf]
  2. M. Shazan Mohomed Jabbar, Colin Bellinger, Osmar R. Zaiane, Alvaro Osornio-Vargas, "Aggregated Spatial Transactions for the Discovery of Statistically Significant Co-location Patterns with Dependency Rules", International Journal on Data Science and Analytics, Springer, 2017 [pdf]
  3. M. Shazan Mohomed Jabbar, Osmar R. Zaiane, Alvaro Osornio-Vargas, "Discovering Spatial Contrast and Common Sets with Statistically Significant Co-location Patterns", The 32nd ACM Symposium on Applied Computing (Data Mining Track), Marrakesh, Morocco, April 3-7, ACM, 2017 [pdf]
  4. Hamman W. Samuel, Mi-Young Kim, Sankalp Prabhakar, M. Shazan Mohomed Jabbar, Osmar R. Zaiane, "Community Question Retrieval in Health Forums", In Proceedings of the IEEE International Conference on Biomedical and Health Informatics (BHI), Orlando, FL, USA, February 16-19 2017 [pdf]
  5. M. Shazan Mohomed Jabbar, Osmar R. Zaiane, "Learning Statistically Significant Contrast Sets", 29th Canadian Conference on Artificial Intelligence, Victoria, Canada, May 31-June 3, Springer, 2016 [pdf]
  6. Jundong Li, Aibek Adilmagambetov, M. Shazan Mohomed Jabbar, Osmar R. Zaiane, Alvaro Osornio-Vargas, Osnat Wine, "On Discovering Co-Location Patterns in Datasets: A Case Study of Pollutants and Child Cancers", in GeoInformatica, Springer, 2016 [pdf]
  7. Shaiful A. Chowdhury, Luke N. Kumar, Md Toukir Imam, M. Shazan Mohomed Jabbar, Varun Sapra, Karan Aggarwal, Abram Hindle, and Russell Greiner, "A system-call based model of software energy consumption without hardware instrumentation", in the sixth International Green Computing and Sustainable Computing Conference, IEEE, 2015 [pdf]
  8. Hamman W. Samuel, Mi-Young Kim, Sankalp Prabhakar, M. Shazan Mohomed Jabbar, "Golden Retriever: Question Retrieval System", In Proceedings of the IEEE International Conference on Healthcare Informatics (ICHI), Dallas, Texas, USA, Oct. 2015 [pdf]
  9. Warunika Ranaweera, Shazan Jabbar, Ruwan Wickramarachchi, Maheshya Weerasinghe, Naduni Gunathilake, Chamath Keppitiyagama, Damitha Sandaruwan, Prabath Samarasinghe, "A lightweight approach to simulate a 2D radar coverage for virtual maritime environments", in the 8th International Conference on Computer Science & Education, IEEE, 2013 [pdf]
  10. Warunika Ranaweera, Ruwan Wickramarachchi, Shazan Jabbar, Maheshya Weerasinghe, Naduni Gunathilake, Chamath Keppitiyagama, Damitha Sandaruwan, Prabath Samarasinghe, "Virtual Eye: A sensor based mobile viewer to aid collaborative decision making in virtual environments," in the International Conference on Advances in ICT for Emerging Regions, IEEE, 2012 [pdf]