The aim of this project was to detect and count the number of stars in the Pillars of Creation image captured by the James Webb Space Telescope (JWST) on October 21st, 2022. The methods and algorithms used for this procedure are covered in the documentation of the project.
- Arya Hassibi (LinkedIn, GitHub)
- Milad Bafarassat (LinkedIn, GitHub)
- Rasul Barak (LinkedIn, GitHub)
- Kourosh Sharifi (LinkedIn, GitHub)
At the time, all 4 members were sophomore computer science students (3rd semester) at Sabancı University, and completed this project for their PROJ 201 course.
This project was supervised by Prof. Ersin Göğüş from Sabancı University.
The list of libraries employed for this project:
The majority of the code is in the Notebook folder. Once cloned or downloaded, simply run each cell in a sequential manner. Make sure that the .fits
data sets are in the same directory as the Jupyter notebooks that you wish to run. They are:
For more information regarding this project, feel free to read the project report. The rest of this README
file will be the content of the report.
The goal of this project was to use computational methods to analyze and process the quantitative data provided by the James Webb Space Telescope (JWST). Specifically, the team worked with images taken by the Near-Infrared Camera (NIRCam) instrument of the JWST. These images were unique due to the use of advanced electron absorption technology, which enabled the team to capture highly detailed infrared wavelength data with unparalleled precision. The project aimed to detect and count the stars in a specific region of the Milky Way Galaxy, called the Pillars of Creation. To accomplish this, various image processing techniques were employed. The rest of the report is dedicated to the explanation of each phase of the project in more detail.
Figure: A Visual Representation of the Dataset using the m16-Bone Colormap
The James Webb Space Telescope (JWST) is a space observatory that is optimized for infrared wavelengths. This optimization for infrared wavelengths will enable scientists to go back further in time and see red-shifted light as well as inside nebulas, and other objects that are harder to observe with visible light spectrum. This information sheds light on the universe’s past, present, and future. To tackle this project, a team of four sophomore computer science students have set out to find the answer to the question below:
“How to count the number of stars in a specific region of an image taken by James Webb Space Telescope (JWST), and to categorize them based on their mass and temperature, in order to compare these findings with other academic literature for different galaxies”
To be able to record and measure these wavelengths, Webb uses the previously mentioned instruments in specific conditions. All instruments are kept cold at temperatures below 54 Kelvin, with MIRI being kept at only 7K, to reduce the unwanted noise from the instruments as much as possible. Thereafter, the light gets reflected back to the secondary mirror by the 18 primary mirrors that are precisely positioned relative to each other, which are coated with a layer of gold to maximize the reflection of infrared light. The primary goal of the JWST is to study galaxy, star, and planet formations. The JWST has a total of 4 instruments in the ISIM. Each of these instruments are specialized for a specific set of tasks. For instance, NIRCam is Webb’s primary camera, which covers the infrared wavelength range from 0.6 - 5.0 micrometers. NIRCam’s data, and its analysis, are the primary subjects of this group project. NIRCam uses its near-IR HgCdTe detector to start its electron sensing process. Meanwhile, a semiconductor absorbs an incoming photon, which generates mobile electron hole pairs. These electrons travel under the influence of pre-built and applied electric fields until they find their way to where they can be collected.
A wide variety of tools were employed to develop this project. Namely:
- Python 3.8: As the main programming language
- Astropy: For acquiring, reading and employing FIST data
- NumPy: For applying mathematical function to the data
- Matplotlib: For visualizing the data using graphs and charts
- SciPy: For optimizing part of the code concerned with computation
- Mikulski Archive for Space Telescopes: To acquire FITS data
- OpenCV: For image manipulation, algorithm implementation and image generation
- ScikitLearn: For linear regression
The algorithms utilized were many, including:
- Canny Edge Detection
- Median Blurring
- Otsu Thresholding
More information can be found in the upcoming pages, regarding algorithms used, a broad description of each, their use-case, why they were used, and what was obtained by implementing them in the code. As for a brief description on the whole process:
- The required data was collected from an archive.
- The downloaded data was processed.
- A number of statistics were obtained after the modifications.
- Conclusions were made after visualizing and reasoning the data
The first step was to obtain the information about the m16 region (also known as the Pillars of Creation) via Flexible Image Transport System (FITS) from the Mikulski Archive for Space Telescopes (MAST). By specifying the desired space telescope as James Webb, the portal then displayed the necessary parameters to choose and filter the data.
As for the camera, the Near Infrared Camera (NIRCam) was chosen as the observer. Thereafter, the file was downloaded, unzipped and then stored with the Jupyter notebook at hand. For the sake of learning, some sample runs were conducted to understand the content of the dataset, and how to visualize the data.
After that, the team began to construct the first part of the program using the Astropy library. This library is a collection of software packages for astronomy and astrophysics. It provides a wide range of tools and functions for working with astronomical data, including functions for reading and writing data files, handling celestial coordinates, performing photometry and spectroscopy, and much more.
- Gaussian Distribution
- Median Blur
- Otsu Method
Figure: A Visual Representation of the Data in the Middle of the Pipeline
- Canny Edge Detection
- Rectangle Detection (OpenCV)
- Plotting Intensity per Pixel
Figure: The Log-Norm Interpolated Representation of the Data
In the end, we were able to detect about 10,000 (ten thousand) stars with close to 93% accuracy.
For more information regarding the exact calculations and methods used, please refer to the Notebooks folder of this repository. There, you can find in-depth explanation for all the tools used, and their corresponding results.