Skip to content
View AvaAvarai's full-sized avatar
☃️
☃️

Highlights

  • Pro

Organizations

@CWU-VKD-LAB

Block or report AvaAvarai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AvaAvarai/README.md

"I believe in intuition and inspiration. Imagination is more important than knowledge. For knowledge is limited, whereas imagination embraces the entire world, stimulating progress, giving birth to evolution. It is, strictly speaking, a real factor in scientific research.”
Albert Einstein 1929, quoted by Dr. Boris Kovalerchuk in Visual Knowledge Discovery and Machine Learning 2018.

Alice Williams

GitRoll Profile Badge

Double Bachelor's of Science senior student majoring in Computer Science and Applied Mathematics. Active research assistant at the Visual Knowledge Discovery and Imaging Lab of Central Washington University. Former system administrator, full-stack software developer, and web-developer. Next, graduate studies in Computational Science focusing on visual machine learning. With a goal of building trusted expert artificial intelligence systems.

Solving problems at the intersection of machine learning and data visualization by blending machine computation with human cognition. Computing with data visualizations directly, instead of just making visuals. While integrating visual approaches with conventional machine learning methods.

Oxford Dictionary, "Intelligence", noun, 1. the ability to acquire and apply knowledge and skills. Current research goals: Aquire knowledge by visual knowledge discovery with multidimensional lossless visualizations, then apply aquired knowledge to build skills that solve difficult tasks more interpretably and accurately.

If you find my work of interest or benefit, then please consider a supporting gesture through "Buy Me A Coffee", thank you.

Academic Research

  1. "Boosting of Classification Models with Human-in-the-Loop Computational Visual Knowledge Discovery"
  • Authors: Alice Williams & Dr. Boris Kovalerchuk.
  • Status: Preprint under review by ArXiv. Presenting at HCII2025.
  • Contributions: Introduces the Computational and Interactive Visual Learning (CIVL) framework, integrating human-in-the-loop visualization and computational techniques to improve classification model boosting, particularly in class-overlapping areas, by separating simple and complex cases, leveraging interpretable decision rules, and enhancing classifier accuracy and explainability through lossless multidimensional visualizations like Parallel Coordinates and General Line Coordinates (GLCs). This improves classification model boosting, particularly in class-overlapping areas, by separating simple and complex cases, leveraging interpretable decision rules, and enhancing classifier accuracy and explainability through lossless multidimensional visualizations achieving 100% accuracy on the Fisher Iris data with significantly fewer parameters than AdaBoost, which won the 2003 Gödel Prize, reducing the risk of overfitting by maintaining a high case-to-parameter ratio while offering enhanced interpretability.
  • Topics: Visual Knowledge Discovery, Classification Model Boosting, Interpretable Machine Learning, Human-in-the-Loop, Lossless Visualization
  • Developed software: Java_Tabular_Vis_Toolkit, HyperblockParser, InLineCoordinatesCoefficientSolver
  1. "Synthetic Data Generation and Automated Multidimensional Data Labeling for AI/ML in General and Circular Coordinates"
  • Authors: Alice Williams & Dr. Boris Kovalerchuk.
  • Status: Published in IEEE proceedings and presented at IV2024 in track of Artificial Intelligence and Visual Knowledge Discovery, earned the 'Best Paper Award'.
  • Contributions: Proposed an algorithm and implemented interactive software for labeled synthetic data generation using former General Line Coordinate methods and newly developed Circular Coordinates both Static and Dynamic with multi-class visualization and parameterized class discrimination. Addressed data balancing, demonstrated deficiencies of popular SMOTE (Synthetic Minority Oversampling Technique), and showed improvements to classifier performance across 14 standard machine learning classifiers.
  • Topics: Synthetic Data Generation, Automated Data Labeling, General Line Coordinates, Circular Coordinates, Parallel Coordinates, Shifted Paired Coordinates, Tabular AI/ML Data, Multidimensional Data Visualization, Visual Knowledge Discovery.
  • Developed software: Dynamic_Coordinates_Vis_System

Chronologically ordered from current at top to first at bottom.

Research Interests

Research Interest Description
Visual Machine Learning Building machine learning models with visual representations for better interpretability.
Visual Knowledge Discovery Extracting patterns visually from multidimensional data to solve a task like machine learning.
Data Mining Extracting valuable information from large sample count and dimensionality of data.
Multidimensional Data Visualization Representing multidimensional data in representations advantageous for the task to solve.
Natural Language Processing Analyzing and generating human language for human-computer interaction or computation.
Automated Decision-Making Developing and analyzing automated decision-making systems for limited human interaction.
Human-Computer Interaction Designing effective human-computer interaction using visually interactive interfaces.

In no preferential order.

Technical Experience

Role Organization Focus/Description
Startup Founder and Consultant (Active) AI Education Technology Stealth Startup LLCs Small business founding, startup product development, market research, and additionally, I provide AI and ML consulting.
Research Assistant (Active) CWU Visual Knowledge Discovery and Imaging Lab Researching Visual Knowledge Discovery and Machine Learning data classification, synthesization, and model interpretability.
Teaching Assistant CWU Computer Science and Mathematics Departments Assisted in Computer Architecture, Algorithm Analysis, Mathematical Computing, and as an undergraduate CS Tutor.
Web Developer Freelance Sole-Proprietor Business Supported individuals and small businesses in developing, updating, and maintaining web applications and websites.
Full-Stack Software Developer Contract Projects and Freelance Business Delivered data processing automation and business solutions for individuals, small businesses, and an enterprise contract.
Linux Game Server Administrator Game Server Distributor and Management LLCs Responsible for server sharding, setup, protection, updates, backup solutions, user registration, and software support.

Let's Connect

I'm always open to discussing new projects or opportunities. Feel free to reach out or connect with me!

Pinned Loading

  1. Dynamic_Coordinates_Vis_System Dynamic_Coordinates_Vis_System Public

    Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.

    Python 5 2

  2. Java_Tabular_Vis_Toolkit Java_Tabular_Vis_Toolkit Public

    Cross-platform tool for Computational Interactive Visual Learning using lossless General Line Coordinate data visualizations and human-in-the-loop guided classification by eight classifier algorith…

    Java 2 1

  3. ML_Classifier_Comparison_Tool ML_Classifier_Comparison_Tool Public

    Machine Learning classifier comparison GUI application. Choose 21 classifiers, evaluation data (optional for evaluation of synthetic data), hyperparameters, cross-validation splits, and rng seed; t…

    Python 1 1

  4. Hyperblock_Parser Hyperblock_Parser Public

    Parses conjunctive normal form hyperblock notation to parallel coordinate graph visualizations. Hyperblocks are an interpretable way to build machine learning models.

    Python 1 1

  5. DCGAN_Custom_Architecture_Builder_and_Image_Synthesizer DCGAN_Custom_Architecture_Builder_and_Image_Synthesizer Public

    DCGAN (Deep Convolutional Generative Adversarial Network) custom architecture builder and image synthesizer to specify the architecture of the generator and discriminator, visualize the models, tra…

    Python 2

  6. Local_Small_LM_Document_RAG Local_Small_LM_Document_RAG Public

    Local semantic sentence embedding Reader-Answerer model for Retrieval Augmented Generation (RAG) of cited question answering from .pdf, .md, & .docx files using small language models.

    Python 4