Manual Testing Guide

By following this guide, you will be able to test all the significant elements of the application. All of the files required for testing are in the repository under “source/test_resources”.

Testing Projects

Project Creation

  1. When you start up the application, you should see a landing page with a button for adding a new project.

    1. Click the new project button.
    2. Provide a name and a description for the project on the next page, and press submit.
    3. Verify that your new project shows up on the project page list.
    4. Click on the new project and go to the project page. Verify that there are no crawls, models, or datasets yet.

Project Settings

  1. Click the “pencil” icon next to the name of the project on the project overview page.

    1. Supply a different name and description for the project, and hit “submit”.
    2. Verify that the project was edited successfully by checking the success message at the top of the page.
  2. Go back to the settings page.

    1. Click on the “trashcan” icon. Verify that there is a popup asking you whether you want to delete the project.
    2. Click on the trash icon and click yes.
    3. Verify that you are taken to the landing page, and that there are no projects listed on the landing page.

Testing Indices

Index Creation

  1. Create a new project.

  2. Click on the “Add Index” button either in the sidebar or under the list of indices on the project page.

    1. Add an index. Give the index a name and a zip file. There are two zipfiles in the repository to use, located at “source/resources/test_resources”. Click submit.
    2. Verify that the index was added successfully by checking for the success message at the top of the page.
    3. Verify that the index was successfully created by checking the status next to the name of the index.
    _images/index_creation_success.png

Index Settings

  1. Click on the link to the index on the project overview page. This will take you to the index settings page.

    _images/edit_index_link.png
    1. Supply a new zipfile for the index creation. Use the zipfile that you did not use earlier – “sample2.zip” if you earlier used “sample.zip”.
    2. Verify that the index was updated successfully by checking the indices list.
    3. Verify that the new files were added to the newly created index.
  2. Return to the index settings page and click the “trashcan” icon. As before, confirm that the cancel button works, and then delete the index.

    1. Confirm that the index was deleted successfully by looking at the list of indices on the project overview page.

Testing Seeds

At the navbar, click on the “Seeds” tab.

  1. Create a Seeds List
    1. Create a seeds list by providing a file.
    2. Create another by pasting URLs into the textbox.
    3. Paste in invalid URL into the textbox, and verify that it is highlighted red.
  2. Edit a seeds list
    1. Click on the icon for the seeds list to access the edit seeds page.
    2. Remove some URLs and click “Reset” to return to the original seeds list.
    3. Make one of the URLs invalid, and press “Save”
    4. Verify that the invalid URL is highlighted with red.
    5. Fix or remove the URL and click “Save”

Testing Crawls

Testing Nutch Crawls

Included with the repository is a test seeds file. You can use this file to testing of nutch and ache crawls. The seeds file is located at “source/test_resources/test_crawl_data/cats.seeds”.

  1. From the project overview page, click the Add Crawl button on the list of crawls or in the sidebar dropdown.

  2. At the add crawl page, supply a name and description.

    1. Make sure that the “nutch” option is selected.
    _images/crawler_nutch.png
    1. Select one of the previously created seed lists and create the crawl.
  3. Verify that the crawl has been added successfully to the crawls list table.

  4. Go to the crawl page by following the link in the crawls list table.

    1. Verify that the crawl status and available buttons are the same as in this image.
    _images/nutch_dashboard_initial.png
    1. The following buttons should be available: “Start Crawl”, “Get Seeds List”. All other buttons should be greyed-out.
    2. The crawl status should be set to NOT STARTED with 0 rounds left to crawl.
  5. Start a crawl and verify that the crawl completes successfully.

    1. When you start the crawl, there should be two rounds left.
    2. At the end of the first round, summary statistics should list total pages crawled as between 6 and 9.
    3. After the first round is done, the status should show “SUCCESS” before going onto the next round.
    4. On the start of the next round, the crawl status should change to “STARTED”
    5. At the end of the second round, the rounds left should be zero.
    6. The pages crawled should be between 300 and 400.

Test Crawl Settings

  1. On the crawl page, click the “gears” icon to access the settings.

    1. Change the name and description of the crawl, and submit.
    2. Click the “trashcan” icon to delete the crawl.
    3. Hit cancel on the popup first, and then delete the crawl.
    4. Verify that you are brought to the project overview page.