Andal.LND/edx-platform

Fork 0

Files

Ben Patterson b8807e8163 Clarify swtiches on paver tasks.

2014-10-16 19:59:15 -04:00

19 KiB

Raw Blame History

Testing

Overview

We maintain three kinds of tests: unit tests, integration tests, and acceptance tests.

Overall, you want to write the tests that maximize coverage while minimizing maintenance. In practice, this usually means investing heavily in unit tests, which tend to be the most robust to changes in the code base.

The pyramid above shows the relative number of unit tests, integration tests, and acceptance tests. Most of our tests are unit tests or integration tests.

Unit Tests

Each test case should be concise: setup, execute, check, and teardown. If you find yourself writing tests with many steps, consider refactoring the unit under tests into smaller units, and then testing those individually.
As a rule of thumb, your unit tests should cover every code branch.
Mock or patch external dependencies. We use voidspace mock.
We unit test Python code (using unittest) and Javascript (using Jasmine)

Integration Tests

Test several units at the same time. Note that you can still mock or patch dependencies that are not under test! For example, you might test that LoncapaProblem, NumericalResponse, and CorrectMap in the capa package work together, while still mocking out template rendering.
Use integration tests to ensure that units are hooked up correctly. You do not need to test every possible input--that's what unit tests are for. Instead, focus on testing the "happy path" to verify that the components work together correctly.
Many of our tests use the Django test client to simulate HTTP requests to the server.

UI Acceptance Tests

Use these to test that major program features are working correctly.
We use lettuce to write BDD-style tests. Most of these tests simulate user interactions through the browser using splinter.
We use Bok Choy to write end-user acceptance tests directly in Python, using the framework to maximize reliability and maintainability.

Test Locations

Python unit and integration tests: Located in subpackages called tests. For example, the tests for the capa package are located in common/lib/capa/capa/tests.
Javascript unit tests: Located in spec folders. For example, common/lib/xmodule/xmodule/js/spec and {cms,lms}/static/coffee/spec For consistency, you should use the same directory structure for implementation and test. For example, the test for src/views/module.coffee should be written in spec/views/module_spec.coffee.
UI acceptance tests:
- Set up and helper methods, and stubs for external services: common/djangoapps/terrain
- Lettuce Tests: located in features subpackage within a Django app. For example: lms/djangoapps/courseware/features
- Bok Choy Tests: Artifacts are located under common/test/acceptance

Factories

Many tests delegate set-up to a "factory" class. For example, there are factories for creating courses, problems, and users. This encapsulates set-up logic from tests.

Factories are often implemented using FactoryBoy

In general, factories should be located close to the code they use. For example, the factory for creating problem XML definitions is located in common/lib/capa/capa/tests/response_xml_factory.py because the capa package handles problem XML.

Running Tests

You can run all of the unit-level tests using the command

paver test

This includes python, javascript, and documentation tests. It does not, however, run any acceptance tests.

Running Python Unit tests

We use nose through the django-nose plugin to run the test suite.

You can run all the python tests using paver commands. For example,

paver test_python

runs all the tests. It also runs collectstatic, which prepares the static files used by the site (for example, compiling Coffeescript to Javascript).

You can re-run all failed python tests by running: (see note at end of section)

paver test_python --failed

To test lms or cms python, use:

paver test_system -s lms

paver test_system -s cms

You can also run these tests without collectstatic, which is faster:

paver test_system -s lms --fasttest

paver test_system -s cms --fasttest

To run a single django test class:

paver test_system -t lms/djangoapps/courseware/tests/tests.py:ActivateLoginTest

When developing tests, it is often helpful to be able to really just run one single test without the overhead of PIP installs, UX builds, etc. In this case, it is helpful to look at the output of paver, and run just the specific command (optionally, stripping away coverage metrics). At the time of this writing, the command is:

python ./manage.py lms test --verbosity=1 lms/djangoapps/courseware/tests/test_courses.py   --traceback --settings=test

To run a single django test:

paver test_system -t lms/djangoapps/courseware/tests/tests.py:ActivateLoginTest.test_activate_login

To re-run all failing django tests from lms or cms, use the --failed,-f flag (see note at end of section)

paver test_system -s lms --failed
paver test_system -s cms --failed

There is also a --fail_fast, -x option that will stop nosetests after the first failure.

common/lib tests are tested with the test_lib task, which also accepts the --failed and --fail_fast options. For example:

paver test_lib -l common/lib/calc

paver test_lib -l common/lib/xmodule --failed

To run a single nose test file:

nosetests common/lib/xmodule/xmodule/tests/test_stringify.py

To run a single nose test:

nosetests common/lib/xmodule/xmodule/tests/test_stringify.py:test_stringify

To run a single test and get stdout, with proper env config:

python manage.py cms --settings test test contentstore.tests.test_import_nostatic -s

To run a single test and get stdout and get coverage:

python -m coverage run --rcfile=./common/lib/xmodule/.coveragerc which ./manage.py cms --settings test test --traceback --logging-clear-handlers --liveserver=localhost:8000-9000 contentstore.tests.test_import_nostatic -s # cms example
python -m coverage run --rcfile=./lms/.coveragerc which ./manage.py lms --settings test test --traceback --logging-clear-handlers --liveserver=localhost:8000-9000  courseware.tests.test_module_render -s # lms example

generate coverage report:

coverage report --rcfile=./common/lib/xmodule/.coveragerc

or to get html report:

coverage html --rcfile=./common/lib/xmodule/.coveragerc

then browse reports/common/lib/xmodule/cover/index.html

To run tests for stub servers, for example for YouTube stub server, you can do one of:

paver test_system -s cms -t common/djangoapps/terrain/stubs/tests/test_youtube_stub.py
python -m coverage run --rcfile=cms/.coveragerc `which ./manage.py` cms --settings test test --traceback common/djangoapps/terrain/stubs/tests/test_youtube_stub.py

Very handy: if you uncomment the pdb=1 line in setup.cfg, it will drop you into pdb on error. This lets you go up and down the stack and see what the values of the variables are. Check out the pdb documentation

Note: More on the --failed functionality

In order to use this, you must run the tests first. If you haven't already run the tests, or if no tests failed in the previous run, then using the --failed switch will result in all of the tests being run. See more about this in the nose documentation.
Note that paver test_python calls nosetests separately for cms and lms. This means that if tests failed only in lms on the previous run, then calling paver test_python --failed will run all of the tests for cms in addition to the previously failing lms tests. If you want it to run only the failing tests for lms or cms, use the paver test_system -s lms --failed or paver test_system -s cms --failed commands.

Running Javascript Unit Tests

We use Jasmine to run JavaScript unit tests. To run all the JavaScript tests:

paver test_js

To run a specific set of JavaScript tests and print the results to the console:

paver test_js_run -s lms
paver test_js_run -s lms-coffee
paver test_js_run -s cms
paver test_js_run -s cms-squire
paver test_js_run -s xmodule
paver test_js_run -s common

To run JavaScript tests in your default browser:

paver test_js_dev -s lms
paver test_js_dev -s lms-coffee
paver test_js_dev -s cms
paver test_js_dev -s cms-squire
paver test_js_dev -s xmodule
paver test_js_dev -s common

These paver commands call through to a custom test runner. For more info, see js-test-tool.

Running Bok Choy Acceptance Tests

We use Bok Choy for acceptance testing. Bok Choy is a UI-level acceptance test framework for writing robust Selenium tests in Python. Bok Choy makes your acceptance tests reliable and maintainable by utilizing the Page Object and Promise design patterns.

Prerequisites:

These prerequisites are all automatically installed and available in Devstack, the supported development enviornment for the edX Platform.
Chromedriver and Chrome (see Running Lettuce Acceptance Tests below for the latest tested versions)
Mongo
Memcache
mySQL

To run all the bok choy acceptance tests:

paver test_bokchoy

Once the database has been set up and the static files collected, you can use the 'fast' option to skip those tasks. This option can also be used with any of the test specs below:

paver test_bokchoy --fasttest

To run a single test, specify the name of the test file. For example:

paver test_bokchoy -t test_lms.py

Notice the test file location is relative to common/test/acceptance/tests. For example:

paver test_bokchoy -t studio/test_studio_bad_data.py

To run a single test faster by not repeating setup tasks:

paver test_bokchoy -t studio/test_studio_bad_data.py --fasttest

To test only a certain feature, specify the file and the testcase class:

paver test_bokchoy -t studio/test_studio_bad_data.py:BadComponentTest

To execute only a certain test case, specify the file name, class, and test case method:

paver test_bokchoy -t test_lms.py:RegistrationTest.test_register

During acceptance test execution, log files and also screenshots of failed tests are captured in test_root/log.

To put a debugging breakpoint in a test use:

from nose.tools import set_trace; set_trace()

By default, all bokchoy tests are run with the 'split' ModuleStore. To override the modulestore that is used, use the default_store option. The currently supported stores are: 'split' (xmodule.modulestore.split_mongo.split_draft.DraftVersioningModuleStore) and 'draft' (xmodule.modulestore.mongo.DraftMongoModuleStore). For example:

paver test_bokchoy --default_store='draft'

Running Lettuce Acceptance Tests

We use Lettuce for acceptance testing. Most of our tests use Splinter to simulate UI browser interactions. Splinter, in turn, uses Selenium to control the Chrome browser.

Prerequisite: You must have ChromeDriver installed to run the tests in Chrome. The tests are confirmed to run with Chrome (not Chromium) version 34.0.1847.116 with ChromeDriver version 2.6.232917.

To run all the acceptance tests: paver test_acceptance

To run only for lms or cms:

paver test_acceptance -s lms
paver test_acceptance -s cms

To test only a specific feature:

paver test_acceptance -s lms --extra_args="lms/djangoapps/courseware/features/problems.feature"

To test only a specific scenario

paver test_acceptance -s lms --extra_args="lms/djangoapps/courseware/features/problems.feature -s 3"

To start the debugger on failure, add the --pdb option to extra_args:

paver test_acceptance -s lms --extra_args="lms/djangoapps/courseware/features/problems.feature --pdb"

To run tests faster by not collecting static files, you can use paver test_acceptance -s lms --fasttest and paver test_acceptance -s cms --fasttest.

By default, all acceptance tests are run with the 'draft' ModuleStore. To override the modulestore that is used, use the default_store option. Currently, the possible stores for acceptance tests are: 'split' (xmodule.modulestore.split_mongo.split_draft.DraftVersioningModuleStore) and 'draft' (xmodule.modulestore.mongo.DraftMongoModuleStore). For example: paver test_acceptance --default_store='draft' Note, however, all acceptance tests currently do not pass with 'split'.

Acceptance tests will run on a randomized port and can be run in the background of paver cms and lms or unit tests. To specify the port, change the LETTUCE_SERVER_PORT constant in cms/envs/acceptance.py and lms/envs/acceptance.py as well as the port listed in cms/djangoapps/contentstore/feature/upload.py

During acceptance test execution, Django log files are written to test_root/log/lms_acceptance.log and test_root/log/cms_acceptance.log.

Note: The acceptance tests can not currently run in parallel.

Debugging Acceptance Tests on Vagrant

If you are using a local Vagrant dev environment to run acceptance tests, then you will only get console text output. To actually see what is happening, you can turn on automatic screenshots. For each step two screenshots will be taken - before, and after. To do this, simply add the step:

Given I enable capturing of screenshots before and after each step

to your scenario. This step can be added anywhere, and will enable automatic screenshots for all following steps for that scenario only. You can also use the step

Given I disable capturing of screenshots before and after each step

to turn off auto screenshots for all steps following it.

Screenshots will be placed in the folder {TEST_ROOT}/log/auto_screenshots. Each time you launch acceptance tests, this folder will be cleaned. Each screenshot will be named according to the template string {scenario_number}__{step_number}__{step_function_name}__{"1_before"|"2_after"}.

If you don't want to have screenshots be captured for all steps, but rather want fine grained control, you can use the decorator

@capture_screenshot_before_after

before any Python function in feature_name.py file. The decorator will capture two screenshots - one before the decorated function runs, and one after. Also, the function

from lettuce import world; world.capture_screenshot("image_name")

is available, and can be inserted at any point in code to capture a screenshot specifically in that place. In both cases the captured screenshots will go to the same folder as when using the step method - {TEST_ROOT}/log/auto_screenshot.

A totally different approach to visually seeing acceptance tests run in Vagrant is to redirect Vagrant X11 session to your local machine. Please see https://github.com/edx/edx-platform/wiki/Test-engineering-FAQ for instruction on how to achieve this.

Viewing Test Coverage

We currently collect test coverage information for Python unit/integration tests.

To view test coverage:

Run the test suite:
```
 paver test
```
Generate reports:
```
 paver coverage
```
Reports are located in the reports folder. The command generates HTML and XML (Cobertura format) reports.

Code Style Quality

To view code style quality (including pep8 and pylint violations):

paver run_quality

More specific options are below.

Running a particular quality report:
```
  paver run_pep8
  paver run_pylint
```
Running a report, and setting it to fail if it exceeds a given number of violations:
```
  paver run_pep8 --limit=800
```
The run_quality uses the underlying diff-quality tool (which is packaged with diff-cover). With that, the command can be set to fail if a certain diff threshold is not met. For example, to cause the process to fail if quality expectations are less than 100% when compared to master (or in other words, if style quality is worse than what's already on master):
```
  paver run_quality --percentage=100
```

Testing using queue servers

When testing problems that use a queue server on AWS (e.g. sandbox-xqueue.edx.org), you'll need to run your server on your public IP, like so.

./manage.py lms runserver 0.0.0.0:8000

When you connect to the LMS, you need to use the public ip. Use ifconfig to figure out the number, and connect e.g. to http://18.3.4.5:8000/

Acceptance Test Techniques

Element existence on the page
Do not use splinter's built-in browser methods directly for determining if elements exist. Use the world.is_css_present and world.is_css_not_present wrapper functions instead. Otherwise errors can arise if checks for the css are performed before the page finishes loading. Also these wrapper functions are optimized for the amount of wait time spent in both cases of positive and negative expectation.
Dealing with alerts
Chrome can hang on javascripts alerts. If a javascript alert/prompt/confirmation is expected, use the step 'I will confirm all alerts', 'I will cancel all alerts' or 'I will anser all prompts with "(.*)"' before the step that causes the alert in order to properly deal with it.
Dealing with stale element reference exceptions
These exceptions happen if any part of the page is refreshed in between finding an element and accessing the element. When possible, use any of the css functions in common/djangoapps/terrain/ui_helpers.py as they will retry the action in case of this exception. If the functionality is not there, wrap the function with world.retry_on_exception. This function takes in a function and will retry and return the result of the function if there was an exception
Scenario Level Constants
If you want an object to be available for the entire scenario, it can be stored in world.scenario_dict. This object is a dictionary that gets refreshed at the beginning on the scenario. Currently, the current logged in user and the current created course are stored under 'COURSE' and 'USER'. This will help prevent strings from being hard coded so the acceptance tests can become more flexible.
Internal edX Jenkins considerations
Acceptance tests are run in Jenkins as part of the edX development workflow. They are broken into shards and split across workers. Therefore if you add a new .feature file, you need to define what shard they should be run in or else they will not get executed. See someone from TestEng to help you determine where they should go.

Also, the test results are rolled up in Jenkins for ease of understanding, with the acceptance tests under the top level of "CMS" and "LMS" when they follow this convention: name your feature in the .feature file CMS or LMS with a single period and then no other periods in the name. The name can contain spaces. E.g. "CMS.Sign Up"

19 KiB Raw Blame History