training-material/topics/imaging/tutorials/astronomy-source-extractor/tutorial.md at 87ee61ecfd1bb4e2019a5db9013e8cdd37821eb0 · galaxyproject/training-material

layout

tutorial_hands_on

title

Source extractor on DESI Legacy Surveys sky images

questions

How do I detect luminous sources from a dark background?

What are the required inputs and their formats?

How can I easily get sky images?

How can detections be improved?

How can I use the extracted source properties?

How can I get the seed image for the Voronoi segmentation tutorial?

objectives

How to perform luminous source extraction in Galaxy.

How to identify objects.

How to analyse sky images in Galaxy.

How to create a simple segmentation mask.

How to visualize the detected sources.

time_estimation

key_points

Source Extractor is a well known astronomy library to detect luminous sources from sky images.

This tutorial shows how to analyse image data for object detection and showcases how an astronomy software tool can be applied to data from several different domains.

requirements

type

topic_name

tutorials

internal

imaging

imaging-introduction

contributions

authorship

funding

Andrei-EPFL

oscars

fiesta

eurosciencegateway

Input Requirements

The source-extractor tool accepts a single image file as input, with the option to provide a mask and/or a filter. Typically, for astronomy, a sky image contains luminous sources. In addition, the tool accepts several parameters related to the background estimation and source detectionm, which are set to the suggested default values. A subset of them is described in the subsection below.

Image:

Preferrably: light sources on a dark background.
Format: a single-channel 2D array stored as .tiff or .fits (FITS is a widely used format in the astronomy community).

Mask (Optional):

Masks regions affected by bright sources (e.g. stars) to improve background estimation.
Pixels with

value > maskthresh

or boolean True are masked.

Format: a single-channel 2D array stored as .tiff or .fits.

Checking the metadata of an image

Tip 1: Use {% tool Show image info %} to inspect .tiff metadata. Required:

RGB = false (1) Interleaved = false SizeZ = 1 SizeT = 1 SizeC = 1

Tip 2: Use {% tool astropy fitsinfo %} to check .fits metadata. Required: Dimensions (N, M) , where N and M are pixel dimensions in 2D. {: .comment}

Filter Kernel (Optional): The filter kernel is used to smooth the input image, which can enhance the detection of faint and extended sources. However, in crowded fields, filtering may reduce performance by blending nearby objects.

If Filter Case is set to none, no filtering is applied.
If Filter Case is default, a built-in smoothing kernel is used:

1 2 1
2 4 2
1 2 1

If Filter Case is file, you must provide a custom 2D array stored as plain text file, that contains whitespace-separated values.

Checking the metadata of an image You can check on your computer whether the filter file has the correct format by reading it with: import numpy as np kernel = np.loadtxt("filter.txt") since this is the way the tool's back-end implementation loads the file. {: .comment}

Parameters for Background Estimation and Thresholding

In this subsection, we describe a subset of tool's parameters that you can change.

Before source detection, the tool estimates the image background. This is done by dividing the image into a grid of boxes, each with a default size of:

bw = 64  # box width in pixels
bh = 64  # box height in pixels

Within each box, the pixel histogram is filtered to remove outliers, and the background level is estimated using a mode approximation based on the median and mean of the remaining pixel values. While 64 is the default value in the SEP package, the original paper suggests that on most images, a value between 32 to 128 pixels should work fine.

After background estimation, the tool identifies groups of pixels that exceed a defined brightness threshold. These parameters should help distinguish between real luminous sources and random fluctuations that can appear in the background.

Detection Criteria:

Minimum Area: The number of connected pixels required to consider something a source.

minarea = 5 # default

Threshold: The value of the pixel (j, i) must exceed:

thresh * err[j,i]

where:

thresh = 1.5 # default

The interpretation of err[j,i] depends on the err_option parameter:

err_option = 'float_globalrms'  # Use global RMS (i.e. root mean square) of the background (default)
err_option = 'array_rms'        # Use a pixel-wise RMS array of the background
err_option = 'none'             # Use 'thresh' as an absolute threshold

It is advisable to adapt the error estimation to the studied image: e.g. if the background is reasonably uniform, using a global value should be sufficient. In contrast, if the background changes drastically in different regions of the image, a pixel-wise RMS would be preferred.

Getting data from DESI Legacy Surveys

Data Acquisition

Create a new history for this tutorial. You can rename the default unnamed history.

{% snippet faqs/galaxy/histories_create_new.md %}

Run the {% tool DESI Legacy Survey %} tool.

Important: Choose the Data Product Image.

The default values are used for this tutorial. The history now contains the .fits image file that is used as input for the source-extractor tool. {: .hands_on}

Running the Source-Extractor Tool

Once you’ve selected the source-extractor tool, choose the input file named: DESI Legacy Survey -> Image fits. After the tool has finished running, several output images and data products will be available:

The background subtracted image with detected sources highlighted by red ellipses
The estimated background
The background RMS
The segmentation map
A catalog table listing the detected sources along with measured parameters such as flux (i.e. sum of member pixels) , position, size, and shape

Example Outputs:

The original image is published by Legacy Surveys / D. Lang (Perimeter Institute). The Legacy Surveys are described in {% cite legacy-survey-astronomy %}.

Ellipse drawing

The tool already provides as output an image with ellipses around detected objects. Nevertheless, if you want to create a figure by yourself you can use the table of detected sources returned by the tool objects in the following way:

from matplotlib.patches import Ellipse
import matplotlib.pyplot as plt

fig, ax = plt.subplots()
for i in range(len(objects)):
    e = Ellipse(xy=(objects['x'][i], objects['y'][i]),
                width=6*objects['a'][i],
                height=6*objects['b'][i],
                angle=objects['theta'][i] * 180. / np.pi)
    e.set_facecolor('none')
    e.set_edgecolor('red')
    ax.add_artist(e)

{: .hands_on}

Using a Mask to Improve Source Detection

Bright stars can skew background estimation and obscure nearby faint sources. In the previous output, some central sources were missed due to bright star interference.

A simple mask can help. Here's an example:

This mask can be easily created with:

import numpy as np
import tifffile
mask = np.zeros((360,360))
mask[270:325, :] = 1
mask[239:, :200] = 1
tifffile.imwrite("mask.tiff", mask)

Upload the mask to Galaxy, select it in the source-extractor tool, and re-run.

Improved Outputs:

You can observe that the central sources are now detected and also the background dynamic range has decreased, due to the mask.

An important output of this tool is the segmentation map of the detected sources:

This map can be used as the seed image required by [Voronoi segmentation tutorial]({% link topics/imaging/tutorials/voronoi-segmentation/tutorial.md %}). In this case, you can observe that the two bright stars still have an important effect on the source detection. Therefore, to improve the results, you can try: better masking, using the array RMS as a relative error in thresholding or different background mesh sizes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input Requirements

Parameters for Background Estimation and Thresholding

Getting data from DESI Legacy Surveys

Running the Source-Extractor Tool

Example Outputs:

Using a Mask to Improve Source Detection

Improved Outputs:

FilesExpand file tree

tutorial.md

Latest commit

History

tutorial.md

File metadata and controls

Input Requirements

Parameters for Background Estimation and Thresholding

Getting data from DESI Legacy Surveys

Running the Source-Extractor Tool

Example Outputs:

Using a Mask to Improve Source Detection

Improved Outputs: