Git Product home page Git Product logo

mustafacanayter's Introduction

Data scientist with background in linguistics, leveraging computational linguistics and data analysis to narrate deep insights. Proficient in Python, SQL, and regex for data handling and analysis. Experienced in machine learning models, data visualization with Tableau and JavaScript, and research coordination.

Highlights of Work on Data Science

  • MADAIN - Mole Analysis with Deep Adam-Optimized Inception Network: We've developed a convolutional neural network (CNN) using the InceptionV3 architecture with the Adam optimizer, aiming to classify skin lesions into one of seven categories, prioritizing recall for cancerous classes to minimize false negatives. After benchmarking multiple CNN architectures and optimizers, and running extensive tests including adjusting epochs, custom weight schemes, and implementing both multiclass and binary classifiers, our model has been integrated into a web app showcased on GitHub pages. Our dataset, sourced from Kaggle, features 10,015 images. Despite challenges like class imbalance, our ongoing efforts include fine-tuning through increased neuron density, inverse proportional weighting, and experimental augmented image training to improve classification accuracy and recall rates, particularly for underrepresented classes.
  • Effects of Climate Variability on Wine Production Metrics: We developed a Pandas database to organize global wine production data and historical temperature records. Our role involved extensive use of Pandas for data cleaning and integration, which ensured the high quality and uniformity of the dataset. We applied statistical analysis tools from the SciPy library to verify the data's integrity and accuracy. Additionally, we created insightful visualizations using Matplotlib and Seaborn, which helped in effectively narrating the findings of our analysis. This project showcased our proficiency in Python and various data science tools, contributing valuable insights into the effects of climate variability on wine production metrics.
  • Geospatial Visualization of Volcanic Activity: Geospatial Visualization of Volcanic Activity project involved creating a dynamic and interactive web-based platform to visualize volcanic activity across the globe. A SQL database in PostgreSQL was developed for efficient management and storage of refined data. Utilizing the interactive mapping capabilities of Folium and Ipyleaflet, the platform presents geospatial and seismological data with precision. To further enhance the user experience, interactive elements were crafted with JavaScript, enabling smooth navigation through various data visualizations. The immersive environment of the platform is amplified by the background use of an MP4 video, offering users both an educational and analytical tool. The visualizations shed light on the Volcanic Explosivity Index (VEI), as well as the human and economic impacts of volcanic events, thus serving as a comprehensive resource for understanding the significance of volcanoes over time.

Highlights of Work on Linguistics

  • Towards Accounting for L2 Accent - The Case of Turkish Vowel Space: My methodology involved the meticulous collection, structuring, and analysis of human speech data. To ensure precise formant frequency tracking and analysis, I employed the powerful Praat software for formant tracking, while Audacity was utilized for efficient audio file processing and modification. A robust Excel database was created to facilitate data cleaning, tagging, and organizing, enabling a structured approach to the complex dataset. Through this rigorous process, I aimed to shed light on the nuances of L2 accent and contribute to the broader understanding of language acquisition and phonetic variation. The findings of this research have significant implications for linguistic theory and practical applications in language teaching and speech technology.

I am excited about collaborating with fellow data scientists and linguists to tackle challenges and drive innovation. Let's connect on LinkedIn or explore my projects here on GitHub to work together and make an impact in data science and linguistics.

mustafacanayter's People

Contributors

canayter avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.