Previous attempts at creating images depicting cell- or nucleus-like objects have primarily been aimed at mimicking images acquired using fluorescence microscopy. Lehmussola et al. created a complex simulator, called SIMCEP, which is able to create populations of realistic two-dimensional (2D) nucleus populations 4. Svoboda et al. built on the concepts introduced in that article to create fully three-dimensional (3D) image data 5,10. Recently, they have further extended their work to also include time-lapse simulation 11. In common, for the work mentioned above is that they only simulate the nucleus appearance and did so using parametric methods for shape and texture generation. Zhao and Murphy 12 instead uses machine learning to generate the shape of the nucleus and the cytoplasm as well as the texture of the nucleus.

(Left) The three levels of texture details (Low frequency Gaussian noise, thresholded Brownian noise, simulated cytoskeleton shape). (Right) Example of a final cytoplasm result (for this example a different base texture than the one shown in Fig. 6 has been used).

The recent trend toward high-throughput screening in medical and biomedical research has led to a substantial increase of the amount of data produced. This development has rendered manual analysis of all the resulting data to no longer being a feasible approach. Therefore, computer-based evaluations using advanced image analysis have become an integral part of many research projects, for example, in 1. A good example of a field where research on the use of image analysis as a diagnostic tool has been ongoing for many years is that of Papanicolaou (Pap) test analysis. Attempts at the creation of automated systems have been made since 1950s, and the problem is still subject to active research 2.

where Aλ is the material's absorbance at a specific wavelength, λ, of incoming light, I0 is the intensity of the light before it passes through the sample, and I1 is the intensity of light that remains after passing through the sample. Essentially, for the Pap-smear application, the Beer–Lambert law relates the absorbance of the Pap-stain to its concentration

Bacilli are a fairly common occurrence in Pap-smear images, the most common kind being the Lactobacilli, or, as it is also known, Döderlein's vaginal bacillus 33. They appear as simple, single line structures, and always in clusters of varying size.

In this article, we present a synthetic image generation framework for simulating bright-field microscopy images of cervical cell populations. The simulation method accurately models object primitives as well as the characteristics and behavior of the measurement system. In our evaluation study, even experienced cytology professionals showed rather poor performance in deciding whether an image was synthetic or real when shown the images under realistic screening conditions. A simple test on nuclear texture features indicated that they came from similar distributions.

We will define speckles as point noise that is distributed in a uniform way across an image. The cause of a single speckle can vary, with common sources being dust, glass impurities, stain particles, or other small biological objects. Regardless of the source, however, they are very similar in their appearance and can thus be grouped together as a single type of simulation. The base speckle is created as a single pixel that is then extended one pixel in a random direction. Depending on the size of speckles that is required for a specific simulation this shape can then be made bigger using binary dilation 3.

For the simulation, each object type is given a specific absorbance value. These values can be directly based on measurements taken from source materials or selected based on a specific desired target appearance. In this project, we measured representative values from our cellular database.

The presented framework offers a flexible approach to image synthesis. Each block of the process is interchangeable depending on the requirements on the finished results. Furthermore, if needed, additional object types can be added with minimal effort using the methods and principles described in this article. As an example, the addition of other common cervical epithelial cell types, for example, parabasal cells or squamous superficial cells, can be achieved by creating small databases of primitives that can then be used as the basis for shape and texture generation. To illustrate this, a dataset of cells, expert classified to exhibit signs conforming to high grade intraepithelial lesions, were collected and added to the simulation pipeline. A resulting image can be seen in Figure 13. This flexibility indicates that the synthesis process could be adapted to mimic other types of cellular material commonly analyzed using bright-field microscopy, for example, lung or oral cavity smears.

The uneven illumination is simulated by calculating a distance transform, Id(x, y), initialized from the edge of the image. We scale the distance transform to [0,1] and add an exponent, Id(x, y)n. The background is initialized as a uniform image, H(x, y), with a predefined intensity, H(x, y) = b. The perturbed background illumination can then be written as

Another approach to evaluating the quality of the synthetic image generation algorithms, is to compare results from an automated image cytometry algorithm applied to synthetic images and real data with expert annotated ground truth 4. Other validation approaches have included comparing results for several image cytometry tools when used on synthetic images 4 or comparing scores for different image descriptive features from synthetic and real images 12.

As ground truth is always readily available, it is possible to do optimizations, evaluations, and validations on amounts of data not feasible when relying on manually analyzed images. Another benefit is the ability to control the characteristics of the synthetic data both concerning imaging properties such as uneven illumination and aberrations caused by the imaging system. This allows for controlled testing of an algorithm's robustness under very specific circumstances.

Image

The impulse response in the image plane is more difficult to approximate quantifiably. Generally, the PSF is smaller in the x-y dimension relative to the z-dimension,. By taking an image depicting a step function, that is, a sharp edge, and studying the first derivative of that image, it is possible to obtain an approximation of the PSF that lies close to reality. The measured PSF can then be compared to Gaussian kernels with different sigma to find the closest match. Using this approach, a suitable value for σx,y for the model system used in this project was found to be 0.9 pixels corresponding to around 0.22 microns.

To create an algorithm that is able to, for example, segment nuclei in an image, the developer is at some point going to need a validation dataset containing a representable selection of the material being analyzed, as well as a ground-truth segmentation for the objects of interest. This is used to measure the performance and robustness of developed algorithms. Acquiring this ground-truth data can, however, in many cases be a very difficult prospect. Aside from issues such as obscuring biological material, the image will have been subjected to aberrations linked to the modality of the acquisition technique 3. The process of dealing with these degenerative effects can often become very complicated. When working with images acquired from a microscope, one may have to deal with issues such as uneven illumination, detector noise, and compression artefacts 4,5. These errors are added to already existing aberrations related to the physical limitations of the optics 6.

Here, zi is a discrete random variable denoting the particular intensity level present in the image. The sum covers the range of all the image intensity levels (L). The entropy defines the amount of uncertainty in the measured data and is calculated as

Ring lights dark field are industrial LED lights for illuminating products in machine vision applications like surface defects on metal parts.

Data from more than 900 fields of view, acquired as focus-stacks from 82 specimens, were available for the development of the simulation method. The database contains specimen prepared using both the conventional method as well as the LBC method. Image acquisition was performed using an Olympus BX51 bright-field microscope equipped with a 40×, 0.95 NA lens and a Hamamatsu ORCA-05G 1.4 Mpx monochrome camera, giving a pixel size of 0.25 µm. The microscope light path was filtered using a 570-nm bandpass filter, a wavelength previously shown to maximize the contrast of nuclei in Pap-smears 14. The microscope was fitted with an E-662 Piezo server controller (Physik Instrumente GmbH & Co. KG, Karlsruhe, Germany). This allowed for z-axis step control with a 0.1 µm resolution during image acquisition.

The DC component, u0, depends only on the position of the shape, which in this setting is of no importance and can therefore be removed, u0 = 0. By normalizing the size of the shapes in the database, it is now possible to add a scale parameter to the generation of the new shapes. This gives us an increased control and makes it possible to, for example, link our shape size to a pixel resolution or add controlled random size variations to the generated shapes. A few examples of randomly generated nucleus and cytoplasm shapes can be seen in Figure 3.

For the time being, the method produces one diagnostic cell type (squamous intermediate) and four levels of debris. In the future, more cell types and other types of debris should be added. Also, other common types of distortion such as folding should be included to the cytoplasm model to add even more variety in the shapes that are produced. The greatest benefit with the use of simulated images is the availability of ground truth. Using simulated images reduces the dependency on manually generated ground-truth data, which has the drawback of being expensive and time consuming to produce. Well-designed synthetic images make it possible to reserve that valuable ground truth data for final control validation. For instance, first the endless supply of synthetic data can be used to form hypotheses about how varying amount of debris or inhomogeneous background illumination will affect the qualitative performance of a segmentation algorithm. Then brute force parameter tuning, over an arbitrary large parameter space, can point out optimal parameters, a plausible range of optimal parameters or relations and dependence between different optimal parameter settings from which the algorithm designer can gain insight and form hypotheses. Then finally, a rigorous and more data economical procedure can be used for fine tuning and cross-validation on expert annotated real data to estimate the performance of an algorithm on real life data.

20221124 — Software Installation for Pulsar Data Analysis · Ubuntu Linux (64-bit) · Contents: · Define some environmental variables · Install some ...

A flowchart depicting the simulation framework described in this article. The simulation process is divided into three phases. In the first phase shape primitives, phantoms, are created. These objects include cell nuclei, cell cytoplasms, bacilli, WBC, OOF objects, and speckles. Once the primitives are created they are distributed over the image using a weighted distribution model. Also, each object is given a depth coordinate. In the second phase image degradation similar tp, the one seen in bright-field microscopes is simulated. Finally, in phase 3, sensor noise is simulated according to the characteristics of a CCD sensor.

20231114 — LED TVs are a type of flat panel display that uses LED backlighting instead of CCFLs (cold cathode fluorescent lamps).

Advanced Illumination Machine Vision Lighting, NAICS code 335129 Advanced Illumination® (Ai) is a leading developer and manufacturer of LED lighting and ...

The process is initialized by randomly selecting an initial texture patch, B0, of predefined size from the sample texture, Ts, and placing it in a corner of the target texture,Tout. A boundary zone,, of width, wE, is defined for each patch. A random coordinate pair, (x, y), is generated in Ts and a patch candidate, B(x,y) is acquired. If the distance between the overlap regions,, is lower than a predefined distance threshold, dmax, the patch is accepted to be pasted to the generated texture. The distance metric used is usually the mean squared error. As a final step, the pixels in the overlap region need to be blended together to avoid sharp edges in the texture. This can be achieved using feathering 24 or a minimum error boundary cut 23. For our implementation, we use feathering, that is, pixel intensities are weighted relative to their distance to the edges of the patches.

The classical way to handle the ground-truth issue is to manually try to obtain ground-truth data with the help of one or, preferably, several experts. However, this approach is notoriously prone to introduce errors in itself, due to the limitations in reproducibility for human operators 7. Furthermore, to create a statistically sound ground-truth dataset, especially when working with biological data and its basically infinite variations, it is often necessary to manually analyze huge quantities of data. This can in many cases be an impossible task.

Phantom object generation refers to the creation of the primitives that constitute the main content of an image. In some cases, these objects can be quite simple point-like objects 17, whereas others entail the creation of full 3D representations 5. For some simulation frameworks, phantom generation includes not only the creation of single objects but also entire populations of object occurrences 4,18.

The final stage aims to emulate the performance of the device sensor and its conversion of incoming light into a digital representation. Sensors introduce Poisson noise 20 that can be seen with the naked eye. Furthermore, the A/D converter and amplification circuits introduce noticeable levels of noise 20. Other possible aberrations include dark current noise, fixed pattern noise, and blooming effects 5.

To finish the cytoplasm generation, three levels of details are added. Each of these are optional and can be excluded if sample specifications dictate it. The first level is a low frequency Gaussian noise that adds intensity variations to the texture. The second level is composed of thresholded Brownian noise, which is a correlated noise whose power spectrum decreases as a function of f 2. This noise appears as clouds of small flecks that are unevenly distributed over the cytoplasm, which corresponds to, for superficial squamous cells, fairly commonly seen keratin precursors. The final level of detail entails the simulation of a cytoskeleton structure. The skeleton is generated by first generating a random Voronoi diagram by randomly placing points in an image. The diagram is relaxed using Lloyd's algorithm 27 to get a more evenly spaced point distribution. A radial distance transform is generated from the edge of the image to create a downward slope from the middle of the image to the edges. Random vertices of the Voronoi diagram are then selected and a line is created by following the edges of the diagram by steepest descent according to the distance transform. A random chance of branching at each vertex results in a random tree structure that is very similar in appearance to cytoskeletons observed in reference images. In Figure 7, the three levels of details that can be added to the cytoplasm base texture have been illustrated as well as an example of a finished textured cytoplasm.

The creation process of leukocyte clusters. (Left) Shapes generated using parametric model. (Middle) Shape masks added to the final cluster. (Right) The final result with added Perlin noise.

Image

White blood cells (WBC), or leukocytes, are a typical indication of an inflammation of the cervix. They usually appear as dark elliptical objects, often clustered together in small groups spread over the image. In the case of more serious cases of inflammation, larger sheets of WBC can be found, often obscuring significant sections of a sample.

As has been previously stated, and illustrated in Figure 1, debris objects are a common problem when developing any kind of analysis algorithm. In the following sections, the types of debris that have been added to the simulation will be described in more detail.

where i is the unit imaginary number. This reduces our 2D data to at 1D problem. We can now calculate the discrete Fourier transform of z(k) as

The primary focus for this study was to try to imitate the appearance of samples produced using liquid-based cytology (LBC) 13. A description of the LBC method can be found in the reference data section below. This choice was related to LBC producing cleaner specimens. Regions with well distributed and mainly free-lying cells on conventional smears look similar to LBC:s and those are reasonably well represented by our simulator. By increasing the object distribution density, the generated images will become more similar to dense regions on smears. But, the conventional smears also contain highly dense regions with many layers of cells, mucus, and other obscuring structures. Simulating that appearance properly would have required significant additional modeling efforts without adding to the usefulness of the simulator, as there are hardly any image analysis algorithms that can extract useful information from such dense regions.

For outstanding vlogs, interviews, and influencer videos, the Angler BI-90W-RL 18-inch ring light will illuminate you with ultrabright, ...

Bright fieldmicroscope

Spot lights can vary in size and application, from small flashlights and landscape lights to large stadium lights.

As the squamous intermediate cell is the target for most studies related to Pap-smear analysis, it is of utmost importance that it is as accurately modeled as possible. The cells can, for modeling purposes, be divided into two main components, the cytoplasm and the nucleus.

where and constitute the addition of Poisson and Gaussian noise, respectively, (*) denotes a convolution operation, stands for the approximation of the PSF with a Gaussian kernel with sigma, is the background image generated by Eq. 10 and is a phantom image. The phantom image contains the set of Np phantom objects,. To simulate the depth of field, each object, pi, is assigned a z position, zi, using a median filtered Perlin noise image, scaled to encompass the desired depth in the image. The generation of the phantom image can then be written as a sum of phantom objects with appropriate amounts of out-of-focus blur,

Because of the availability of focus stacks as reference data, we have been able to quantifiably determine a suitable standard deviation for the Gaussian kernel related to the depth of focus. This was achieved by first studying a nucleus at the focus level at which it is in focus. This level was determined by summing the gradient magnitude values within the cell nuclei for all focus levels and choosing the level with the greatest sum. We can then compare each offset step in the focus stack to a Gaussian blurred version of the focused image, allowing us to decide which sigma yields a degradation that lies closest to the observed one. The quantification is achieved by taking the sum of the intensity difference for each pixel of the nucleus. In Figure 10, a plot showing the optimal ratio σ/µm, relative to each focus level offset can be seen for a sample cell. The same experiment was carried out for a representative population of 10 different cells. For small focus offsets, it is, due to discretization issues, hard to get reliable measurements. However, for offsets >1 µm it becomes apparent that a reasonable approximation of the impulse response in the z-direction is, where dz is the z-offset in µm.

The quality and realism of the synthetic images which are generated through the described procedure needs to be evaluated. As we are generating images that can be visually inspected, a natural approach is to do a visual comparison between synthetic and real images. This approach also seems relevant since the most common way of analyzing Pap-test specimens is through visual inspection. In designing such a test, it becomes important to have realistic conditions for the visual inspection. Given unlimited time and the possibility of zooming and scrolling the images, it is in most cases possible to find out which image is real and which is synthetic. But, this is far from the conditions under which this kind of images normally are scrutinized.

We have, therefore, designed a customized evaluation test. The experimental design was inspired by a study devised by Meyer et al. 36. In that study, users were asked to compare a real scene to an identical computer generated one. For the experiment, the subject was showed a tightly cropped view of the real and synthetic scene projected through a lightly frosted glass, to account for limitations of existing display devices. For our validation study, a simple user interface was designed. A patch of size 200 × 200 pixels, randomly selected, was cut out from an image that in turn was randomly selected from a database of images containing 25 synthetic images created with varying settings and 25 real images from different specimen. Before showing the image to the user, a pixel-wise Gaussian noise (σ = 0.7 graylevels) was added. This was done to make the variations in background intensity and smoothness between different real images less visually disturbing, making it easier to focus on the details in the images. The patch was shown to the test subject for 2 s, after which a nontimed prompt for an answer was displayed. In total, 120 patches were displayed to each user. However, the first two patches were training images used to get the user comfortable with the validation system and are not counted in the final result. The outcome of this study is discussed in Results section.

Such tests can give valuable evaluations of whether the feature distributions obtained from the synthetic images are the same as those obtained from real ones. There are, however, problems in that many aspects of the synthetic images are generated from feature distributions extracted from real images. So, for many simple features such as nuclear size, we can obtain perfect agreement between normal and synthetic distributions. Another issue is the fact that manually obtaining a suitable ground truth dataset is, as has previously been discussed, a far from trivial task paired with many difficulties. We have as a complement to the visual evaluation made a comparison of the distribution of one nontrivial feature, the moments of the nuclear texture distribution. The choice is motivated by the fact that from a diagnostic perspective, the key structures in Pap-smear images are the nuclei and their chromatin structure. Following the approach taken by Svoboda et al. 5, a number of central moments as well as an entropy score was calculated for several real and synthetic nuclei. The n th central moment is calculated as

This section collects any data citations, data availability statements, or supplementary materials included in this article.

In this article, we describe a framework for creating realistic synthetic images intended to be used for the development of various kinds of image analysis tasks such as preprocessing and segmentation. Simulated images can fill several functions in a development pipeline, for example:

Bright field imageinterpretation

The goal of this project was to create a simulation framework able to create realistic images containing not only cell nuclei but also cytoplasms as well as a wide variety of other objects and artifacts commonly found in Pap-test specimens. When working with biological data, one of the biggest challenges lies in trying to deal with the extensive variation present in the samples. Objects are often subjected to distortions such as overlapping and folding or obscured by material lying outside the current focus plane of the imaging system. If stains are used, it is common to experience stain intensity variations on both an intrasample and intersample basis.

Results are shown for the six test subjects 1–6 and a random result (x). The results show the number of true positives (TP), the number of false positives (FP), the number of true negatives (TN), and the number of false negatives (FN). From these numbers, the sensitivity TP/(TP+FN) and specificity TN/(TN+FP) have also been calculated.

The Q-Q plot in Figure 11 show a distinct linear relationship between the synthetic and real nuclei for all features, indicating that, as is expected, the two populations are drawn from similar distributions. However, the angle of the linear dependency shows that the real data have a wider distribution than the synthetic data. This, again, is not surprising as each synthetic nucleus texture represents a combination of data from three real nuclei. The choice of using three nuclei as a sampling base stems from an effort to make the texture of the synthetic nuclei more general and not a scrambled copy of a single. This had the side effect of making the variation in the synthesized textures somewhat smaller than in the real ones. This effect would be reduced if we used a single real nucleus as a model for the texture of a synthetic one.

Official websites use .gov A .gov website belongs to an official government organization in the United States.

The second phase of the image synthesis process is the simulation of light as it travels through the sample to the detector. Three primary factors of this problem have been considered for this project; background illumination specifications, how individual objects absorb the light, and finally the approximation of the impulse response function in three dimensions.

The most commonly used type of imaging sensor within microscopy is the charged coupled device (CCD) sensor. These sensors operate using a linear transfer function, that is, the output signal for each pixel is proportional to the number of photons it receives. Optical imaging sensors have, as has been previously stated, certain limitations. The dominant source of noise in sensors, known as photon noise, is related to the fact that the number of photons emitted from a constant light source over a finite time interval is stochastic. Under normal operating conditions, this noise is Poisson distributed and quite easy to simulate. A second type of sensor noise significant enough to be added to the simulation is called readout noise that is a product of phenomena related to the A/D converter and amplification circuits. This noise behaves as additive white Gaussian noise 35.

To generate the shape of the cytoplasm and nucleus phantoms as accurately as possible, we used our cell image database. From our database of some 12,000 segmented cell nuclei, we randomly selected 100 normal squamous intermediate cell nuclei. The nuclei were segmented through a manually seeded watershed algorithm. The corresponding cytoplasm was interactively segmented using the live-wire tool 21. For each cell, the shapes of the cytoplasm and the nucleus have been parameterized using a method for Fourier shape representation described by Zhang and Lu 22. The boundary of an object mask is sampled at a predefined number, K, points spaced at an equal arc length. Starting at an arbitrary point, (x(0), y(0)), coordinate pairs (x1, y1), (x2,y2), …, (x(K − 1), y(K − 1)) are encountered when traversing the boundary in a preselected direction. The shape boundary coordinates can be represented as a sequence of coordinates s(k) = [x(k), y(k)], for k = 0,1,2,…, K − 1. Also, each coordinate pair can be treated as a complex number so that

Bright-field microscope Diagram

2024930 — Union Park Capital Acquires iiM GmbH, Strengthening Exaktera's Posi on in the Machine Vision Market. Boston, MA – September 30, ...

Image

Quantile-quantile plot comparison of descriptors computed from real and synthetic cervical cell nuclei. The quantile-quantile plot illustrate whether the acquired measurements belong to the same distribution. A linear relationship indicates that points belong to the same probability distribution. When the trend is steeper than 45 degrees, which is the case in the plots above, it is an indication that the data plotted on the y-axis, which refers to real data in this figure, has a larger dispersion than the x-axis data. The descriptors shown above are: (Top row, left to right) second central moment, third central moment, and fourth central moment. (Bottom row, left to right) Fifth central moment, sixth central moment, and entropy. [Color figure can be viewed in the online issue, which is available at http://wileyonlinelibrary.com.]

Five central moments, n =2,…,6, as well as the entropy was calculated for a sample of 30 nuclei from real images and an equal number from synthetic images. From this, individual quantile-quantile (Q-Q) plots have been generated for each feature (Fig. 11). A Q-Q plot is a probability plot that compares two distributions by plotting their quantiles against each other 37. If the compared populations are drawn from similar distributions, the points should have an approximately linear relation.

Darkfieldmicroscope

Illustration depicting the steps of the cloth simulation process: (a) Initial shape generated according to method described in Shape generation section that is used as the target for the deformation, (b) Initial cloth mesh prior to any deformation, (c) cloth mesh after deformation, and (d) final rendered result produced by adding a transparent material to the mesh.

Although the Pap-smear has shown its worth through decades of use, it is hampered by certain difficulties, for example, variable smear thickness, uneven cell distribution across the field of view, obscuring elements such as blood and inflammatory cells, and variable fixation and staining results. To overcome some of these problems, so-called LBC preparation methods have been developed. Using LBC, the sample is immersed in a solution that is then subjected to a number of processes that work to homogenize the sample, remove unwanted components (e.g., red blood cells) and finally deposit a suitable mono-layer sample on a glass slide 13. These samples are considerably less cluttered and while thickness of the cells distribution still varies, it is overall far less than for the conventional smear. However, the types of debris objects described in the Problem Statement section earlier will still be present in samples prepared using an LBC protocol.

In bright-field microscopy an object's color is related to its light absorption properties. This behavior can be described using the well known Beer–Lambert law 34, which states that

For this article, objects that are not usable intermediate epithelial cells are referred to as debris or artifacts. A selection of the most common types of artifacts found in Pap-smear samples have been illustrated in Figure 1. These artifacts introduce an added level of difficulty when trying to achieve goals such as reliable nucleus segmentation in a high-throughput setting. When developing a synthetic image generation framework for this type of modality, it is essential that as many of these types of debris are modeled as accurately as possible and included in the final results.

Bright fieldmicroscope principle

The addition of W gives us the ability to control how generated objects are distributed over an image. There are many benefits to this approach. The weight-map allows us to both customize the distribution, perhaps in an attempt to match the distribution of a specific sample image, and create random distributions with certain characteristics, for example, single cluster, multiple clusters, or perhaps a swath of material across the image. Examples of weighted distributions can be seen in Figure 8. In our present study, we have only used these possibilities to a limited extent generating some simple distribution patterns.

This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.

The cytoplasm texture is difficult to define as it is mainly a result of which deformations it has been exposed to. To that end, a cloth simulation algorithm 25 has been used to simulate cytoplasm deformations. The cloth simulator starts with a circular mesh, that is, a grid-like structure consisting of a number of points (vertices),, connected by edges. This mesh is then deformed using a force-field that gradually shrinks to the generated shape's outer boundary and pushing on vertices that it touches on the way. The deformation of the cloth takes place in 3D with a ground plane limiting movements in a negative z-direction. In the end, the cloth object will have an outer boundary that corresponds to the generated shape, but the vertices that make up the central part of the mesh will be pushed to produced wrinkles and folds. The cloth simulation is an iterative physics simulation where each vertex in the cloth mesh is given acceleration depending on outer forces that affect it as well as inner forces decided by the distances to its neighbors. One iteration of the simulation is called a time-step. The index t gives the time that has passed as the initiation of the simulation, and Δt is used to indicate how much time is increased between iterations. At each iteration, a vertex's new position,, is calculated using the Verlet integration 26,

To distribute the generated objects in naturally looking clusters, we use the weighted distribution method described in the Population Distribution Generation section with randomly generated weight-maps. Finally, the leukocyte phantoms are textured using the common Perlin noise model 32.

In Figure 9, the steps of the WBC phantom generation have been illustrated. The simulation process is divided in two main steps: the creation of each cell and the construction of clusters. For cell shape generation, we have used a parametric model previously used by Lehmussola et al. 4. The shape of a cell is based on the parametric form of a circle, where each coordinate pair is written as and generated by

The distribution of generated objects is an important factor of the simulated image generation process. The simplest approach to generating a coordinate pair, (x, y), for an image with dimensions (M, N) is to draw the coordinates from a uniform distribution, U(a, b) where a and b describes the interval,

The simulation process for bacilli generation starts with the generation of randomly oriented short binary lines. These line segments are then clustered in the same way as described for the WBC generation in the end of White Blood Cells section.

where. The complex coefficients un are called the Fourier shape descriptors (FSD) of the boundary. These descriptors represent the shape of the object in the frequency domain. We carry out this kind of FSD extraction on a representative population of nuclear shapes and thus collect a set of statistical populations of FSD. We can now use naive Bayesian theory to generate new shapes. This means that we from these populations extract the mean and standard deviation for each descriptor and use that to define a statistical distribution from which we draw samples at random. These samples define a new shape which is returned to real space through inverse Fourier transform. The FSD can be normalized with respect to size by dividing the descriptors with the magnitude of the second component,, of the signal, yielding the normalized shape descriptor vector as

The algorithms described in this article were implemented in Matlab (2011b, The MathWorks, Natick, MA) using the image processing toolbox DIPimage 16.

The goal of the Papanicolaou test is the detection of neoplastic (cancerous) and proneoplastic (precancerous) cell changes in the cervix before they reach an invasive stage. Using a spatula or a brush, the area known as the transformation zone of the uterine cervix is scraped to obtain epithelial cells. The epithelium has a layered structure consisting of basal, parabasal, intermediate, and superficial cell. Out of these, the intermediate cells are the most common and have frequently been the target of automated analysis studies 13. The collected material is smeared onto a glass slide to produce the Pap-smear. The sample is fixed and stained to enhance the contrast between nucleus and cytoplasm 15.

The way the signal transmission is approximated is naturally highly dependent on the modality being simulated. Perhaps, the most important phenomenon is the impulse response of the system, also known as the point spread function (PSF), which is a central aspect of all types of microscopy 3. The PSF is most often replaced by a Gaussian kernel as its generally accepted to be a good approximation 4,8,17. Other aspects belonging to this stage are uneven illumination 10 and various kinds of chromatic aberrations 19.

Figure 2 depicts a flowchart, organized as suggested by Svoboda et al., illustrating the synthesis process of the algorithm described in this article. In the following sections, we are going to describe the steps of the method in more detail.

When the cloth mesh has undergone deformation, a surface render is performed to get a base texture for the cytoplasm. The cloth simulation process has been illustrated in Figure 6. Note that through the simulation process, a naturally looking folding pattern has been created at the edge of the cytoplasm shape which propagates into the central part of the cytoplasm phantom.

* Correspondence to: Patrik Malm; Division of Visual Information and Interaction, Department of Information Technology, Centre for Image Analysis, Uppsala University, 751 05 Uppsala, Sweden. E-mail: patma409@gmail.com

for i = 1, …, K, where K is the number of boundary points, controls the range of values drawn from the uniform distribution and s is a scale parameter.

Conceptual illustration of patch-based texture synthesis. (Left) Sample image. (Middle) Patch evaluation based on the distance between the overlap region of two patches. (Right) Finished synthesized texture. [Color figure can be viewed in the online issue, which is available at http://wileyonlinelibrary.com.]

It is common for images taken of Pap-smears to contain objects that lie well outside the current focus plane. The source of these objects varies. Sometimes it is biological material such as cells, WBC clusters or mucus. Other times, the cause is dirt on the cover glass of the samples or air bubbles in the fixation liquid. These artifacts can take on many forms, and it is an ambitious task to try to simulate all of them. However, because they generally are placed so far outside the focus plane, their appearance is very blurred. To that end, we have created an out of focus (OOF) debris group where an object is created using the same parametric model and textured the same way as the WBC described in White Blood Cells section. These objects are then taken to be placed at a focus plane well outside the defined space and blurred accordingly (see Impulse response section).

The synthesis framework described in this article is available on the MATLAB file-exchange, at http://www.mathworks.com/matlabcentral/fileexchange/48915-synthetic-bright-field-microscopy-image-generator, as an open-source code package on publication acceptance.

Bright field imageexplained

The cytoplasms appear as a mostly transparent gel-like substance. It has little stability and is thus easily deformed, resulting in variations in shape and size between cells. The cytoplasm of a healthy cell displays little in the way of a regular texture other than a fine network of fibers, these are known as the cytoskeleton.

where the exponent, n, controls the strength of the variation. We can now add the second type of aberration which is related to light scattering in the slide. To simulate these effects, we have chosen to use a Perlin noise image, Ipe(x, y): [0,1], together with a constant, cpe, that controls the strength of the variations. Perlin noise is a computationally efficient way of generating correlated noise 32. It is commonly used in computer graphics to generate natural looking textures. It is not directly based on any physical model of light transport, but the results are visually similar to that caused by the light scattering effects.

(Top) Nucleus at its focus level (Left) and its actual degeneration at a 1.2-µm offset (Middle image), compared to a Gaussian degeneration with σ = 1.2 (Right image). (Bottom) Plot discerning optimal ratio for σ relative to the z-offset (μm) as a function of the z-offset for specific offsets.

An example of a finished synthetic image compared to a real image can be seen in Figure 12. The results of the visual evaluation can be seen in Table1. Six test subjects have been used for the evaluation. The first 1 test subject is a cytologist with over 30 year experience of cervical smear screening. Test subjects 2–5 are cytometry algorithm developers with several year experiences from developing methods for automated Pap-smear analysis. The last test subject 6 is an algorithm developer with experience in life-science applications. This person had no experience with cytometry images and was included as a reference. Also for reference purposes, a random result subject (x) has been added to the result. For all subjects, the test was the first time they came into contact with the image generation method described in this article. All the subjects had prior experience with observing Pap-smear images in grayscale. For each user the number of true positives (TP), false positives (FP), true negatives (TN), and false negatives (FN) were counted. For this evaluation, a real image was counted as a positive and a synthetic image as a negative. Based on these values, the sensitivity, TP/(TP+FN), and specificity, TN/(TN+FP), for each user was also calculated. In this setting, the sensitivity value relates to the ability to accurately detect real images as being real and the specificity the ability to accurately detect synthetic images as synthetic. The sensitivity for the six human test subjects is not consistent with a random result (Mean: 0.7667, 95% confidence interval [0.671, 0.863]), but it is still low enough to indicate that experienced individuals had a difficult time separating real images from synthetic ones under these experimental conditions. The random assignment, “subject x,” achieved almost as good results as the actual persons, it deviated somewhat from the expected 0.50 in sensitivity and specificity as the population size was rather small. A different experimental design may have given different results. It is usually possible to tell the difference if you are given unlimited time and can zoom and scroll the image arbitrarily. But in routine cytology screening, the time available for this analysis is extremely limited so our conclusion is that our synthetic images are visually quite similar to real ones when studied under realistic conditions.

Although humans analyze Pap-smears in color, computer-based systems commonly operate on monochrome images. This is because color has very little diagnostic value when analyzing specimen stained using the Pap-stain. Instead, when developing automated systems, achieving a good contrast for nucleus texture is the highest priority. Therefore, specific filters are used that produce monochrome images with maximum nucleus contrast 14. The simulator described in this article is imitating images taken using such circumstances.

It is important to recognize the fact that, while synthetic images can function as a great development tool, problems such as overfitting need to be taken into account. Because synthetic images are the result of their defining parameters, there exists a limitation in the variation present in the images. Good design can, to a certain extent, alleviate the problem, but the fact remains that real data remains a necessity in the creation of any image processing algorithm aimed at real world applications. However, one can compare the risk of overfitting synthetic data with the risks of overfitting when tuning algorithms with the help of small amounts of real ground truth data. We can benefit from the great flexibility of synthetic data, while at the same time guarantee an estimate on real data if proper cross-validation is performed for the final parameter tuning using real expert annotated data.

Nucleus texture generation. (Top row) The three nuclei used as texture samples. (Bottom row) Generated nucleus texture and final generated nucleus.

To add randomness to the nucleus texture generation, each new texture is generated from three different sample nuclei randomly selected from the database. This significantly reduces the risk of recurring texture details. Furthermore, we have added weighting to the patch selection so that patches that are to be placed close to the center of the phantom are more likely to be found in central parts of the sample images. An example of a generated nucleus can be seen in Figure 5.

The authors would like to thank the participants of the user study. The work was carried out within the framework of a collaboration with a research project at the Center for Advanced Computing in Thiruvananthapuram headed by Rajesh Kumar and the Regional Cancer Center, Kerala, India, headed by Dr K Sujathan, funded by the Department of Information Technology, Government of India. Ethical permit for using stained Pap smears was obtained from Indian Council of Medical Research, permit number INDO/FRC/402/2005-IHD. Funding was also provided by the Swedish Research Council (2008-2738) and VINNOVA (2008-01712).

As digital imaging is becoming a fundamental part of medical and biomedical research, the demand for computer-based evaluation using advanced image analysis is becoming an integral part of many research projects. A common problem when developing new image analysis algorithms is the need of large datasets with ground truth on which the algorithms can be tested and optimized. Generating such datasets is often tedious and introduces subjectivity and interindividual and intraindividual variations. An alternative to manually created ground-truth data is to generate synthetic images where the ground truth is known. The challenge then is to make the images sufficiently similar to the real ones to be useful in algorithm development. One of the first and most widely studied medical image analysis tasks is to automate screening for cervical cancer through Pap-smear analysis. As part of an effort to develop a new generation cervical cancer screening system, we have developed a framework for the creation of realistic synthetic bright-field microscopy images that can be used for algorithm development and benchmarking. The resulting framework has been assessed through a visual evaluation by experts with extensive experience of Pap-smear images. The results show that images produced using our described methods are realistic enough to be mistaken for real microscopy images. The developed simulation framework is very flexible and can be modified to mimic many other types of bright-field microscopy images. © 2015 The Authors. Published by Wiley Periodicals, Inc. on behalf of ISAC

The problem with this approach is that populations generated using this method will not look natural. Objects in biological samples tend to end up in more concentrated groups 28. The problem of population generation has previously been studied as described in 4 and 29 showing meaningful results. For this study, we have instead chosen to use a different approach, a method known as rejection sampling 30, in computer graphics also known as Russian Roulette Monte Carlo sampling 31, to generate our distributions. The basic concept behind rejection sampling is that a coordinate pair drawn from a uniform distribution is accepted with a probability Wx,y, where. We call W a weight-map and a distribution created using W a weighted distribution. New coordinate pairs are drawn until a specified number of coordinates have been accepted. This approach is simple to use and produce good results as long as W contains large enough areas with a relatively high probability score. For weight-maps, where all positions have a low probability the algorithm will take a long time to execute as the rejection rate will be high. However, with proper understanding of the method's limitations the approach works exceedingly well.

For this study, nucleus textures have been created using a texture synthesis approach, well known in computer graphics, called patch-based texture generation 23. The goal of this method is to be able to create a texture image of any size based on a small texture sample. A schematic description of the texture generation process can be seen in Figure 4.

The nucleus is in contrast to the cytoplasm a more rigid structure that has a principally elliptical shape. Because of the staining involved when preparing the Pap-smear, it is generally darker relative to the cytoplasm. The appearance of the nucleus texture is related to the distribution of the chromatin within the nucleus membrane.

Bright field imageexample

where k is a constant commonly referred to as the extinction coefficient (a characteristic of the dye), c is the concentration of the dye, and l is the length of the light path.

Under perfect circumstances the background illumination should be a uniform function. This, however, is rarely the case. The most common kind of aberration for bright-field microscopy is uneven illumination, a low frequency light intensity variation over the field-of-view. Furthermore, for bright-field microscopy the light path has to pass through a glass slide. This will generate aberrations caused by the optical properties of glass as well as by scratches and dirt on the surface. To simulate these phenomena, two principal degradation steps have been added to the background generation process.

Secure .gov websites use HTTPS A lock ( Lock Locked padlock icon ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

IEC 62471 is a comprehensive standard used for evaluating the photobiological safety of lamps and lighting systems that emit light in the spectral region 200- ...

Bright fieldmicroscopeimage

The framework described in this article includes the creation of a wide variety of objects, ranging from the individual cells to clusters of bacilli, as well as an emulation of an optical system paired with a detector. Svoboda et al. 5 suggested that the simulation process can be split into three main phases: phantom object generation, signal transmission, and finally, signal detection and image formation.

The shape of the PSF for a specific bright-field microscopy setup is dependent on the NA of the objective used, the refractive index of the medium between the sample and the objective lens and the wavelength, λ, of the image forming light 6. Trying to mathematically determine the precise degenerative effects of the PSF for specific hardware specifications can be exceedingly difficult. For this study, we have instead chosen to simplify the process by approximating the impulse response function with a Gaussian function, Gσ. Furthermore, for simplicity and speed, we have elected to separate the blurring process into a depth of focus blurring, GΔz, and an image plane blurring, GΔx,y.

The feathering, or blurring, of the nucleus mask,, prevents the creation of sharp, unnatural edges around the nucleus texture.

We have generated a small population of synthetic images and also picked an equal number of real images and cropped those to the same size as the synthetic ones and supplied these two image datasets as Supporting Information allowing the reader to evaluate the similarities and differences between the images. These images can be found as a Supporting Information.

Rigid Industries Light Bars & Cubes ... Best Selling ... View ...

A selection of artifacts that commonly occur in bright-field microscopy images of Pap-smears. From top to bottom: bacilli (small linear objects), WBC (small dark elliptical objects), dye specks, and OOF objects.

View Advanced Distributors (www.advanceddistributors.com) location in Minnesota, United States , revenue, industry and description. Find related and similar ...

The simulation described in this article constitutes a highly flexible framework for emulating Pap-smear images taken using a standard bright-field microscope. This is, to the best of our knowledge, only the second attempt at creating realistic synthetic images emulating data collected using bright-field microscopes, the first one being our own previous work 8. This new framework, however, improves greatly on previous work through the addition of shape and texture generation based on data obtained from an extensive database of segmented cells, the inclusion of more debris models, a more versatile method for object distribution, and an accurate approximation of the image depth of field, allowing for simulated focus stacks to be created. The resulting images have been validated using visual inspection of images by trained individuals with experience in analyzing cell images as well as cytology experts.

The final step of the synthesis process is to remove some of the degenerative effects applied to the parts of the image displaying nucleus texture. Because the nucleus texture is a product of patch-based texture generation from source material, the texture has actually already been subjected to the degenerative effects of the optics and the sensor system. Therefore, the final image, If(x, y) is updated according to

To create synthetic images that are sufficiently similar to reality, access to real data is a necessity. For the development of the framework described in this article, a large database of Pap-smear images, acquired at several focus levels, was available. In the following sections, the source material will be discussed in more detail.