Page 26 - Forum-2021-JanToMarch
P. 26

A high thresh-
                                                                                                   old will result in
                                                                                                   many small clus-
                                                                                                   ters, whereas a low
                                                                                                   threshold will result
                                                                                                   in fewer, but larger
                                                                                                   clusters. This thresh-
                                                                                                   old can be changed
                                                                                                   by the investigator
                                                                                                   to suit the task
                                                                                                   and preferences. A
                                                                                                   cluster overview can
                                                                                                   be generated, which
                                                                                                   shows the most rel-
                                                                                                   evant image of each
                                                                                                   cluster. This enables
                                                                                                   the investigator to
                                                                                                   quickly find relevant
                                                                                                   clusters.
                                                                                                     It is then up to
                                                                                                   the investigator to
                                                                                                   classify the clusters.
                                                                                                   In ImEx, the images
                                                                                                   in a cluster are dis-
                                                                                                   played in a scrollable
                                                                                                   canvas at the bottom
        Figure 4. Overview of wreckage site 4                                                      of the screen. The
        and the location of the wreckage pieces.                                                   display size of the
        (Source: Dutch Safety Board)                                                               images can be ad-
                                                                                                   justed. The inves-
                                                                     Figure 5. The exploration-search   tigator can create
                                                                     axis with example multimedia   buckets for holding
                                                                     analytics (sub)tasks.         whole clusters or a
                                                                                                   selection of images
          Furthermore, the investigator should   categories (such as cats, dogs, houses, cars, etc.) by extracting   in order to structure
        be able to                           features from images, such as shapes and textures. Features   the image collection.
          •  Browse fluidly through the images.  extracted from an image are represented by a value, where   Relevant images or
          •  Place images in user-defined catego-  a higher value means the feature is present more frequent-  parts of images can
            ry “buckets” to structure the image   ly and more clearly in the image. In the training phase, the   be queried to find
            collection.                      neural network learns which features are best to discriminate   additional images.
                                             between categories. By finding these features, it can decide to   By generating buck-
          •  Retrieve and filter images based on                                                   ets, and by adding
            these buckets.                   which category an image belongs.
                                               ImEx works slightly different. As noted, training a neural   images to these
          •  Gain information about progress   network requires a lot of training examples, which are usually   buckets, the image
            made in the structuring of the image   not available for crash sites or other accident sites. Therefore,   collection is given
            collection.
                                             rather than classifying images (deciding to which category   structure by the user.
          We developed ImEx (Incident Image   an image belongs), ImEx only calculates whether images   A second window
        Explorer) with these tasks and features   look similar or not. ImEx still makes use of a neural network   shows the progress
        in mind to assist investigators in inves-  trained to classify everyday objects and scenes (such as differ-  of structuring the
        tigations with large image collections.   ent types of animals, sceneries, intact airplanes, other modes   image collection
        To cluster images based on similarity,   of transportation, instruments, etc.).            and a Sankey
        ImEx makes use of a convolutional neural   The neural network used in ImEx extracts 2,048 features   diagram to show
        network. A brief description follows, as   per image. The similarity between two images can then be   relations between
        a full explanation of neural networks   calculated by correlating the 2,048 features of one image with   the buckets. Based
        goes beyond the scope of this paper. In   the 2,048 features of another image. If this correlation is higher   on images contained
        short, convolutional neural networks are   than a user-defined threshold, the two images are placed in   in multiple buckets,
        the current state of the art in computer   the same cluster. If other images also correlate higher than   the Sankey diagram
        vision. By making use of large collections   this threshold, these images are also placed in the same   shows a breakdown
        of labeled training data, a neural net-  cluster. This process is repeated until all images are placed in   of each bucket, e.g.,
        work is trained to discriminate between   a cluster.                                       upon close inspec-
        26  •   January-March 2021 ISASI Forum
   21   22   23   24   25   26   27   28   29   30   31