These files seem to have been copied over from old XML courses, and shouldn't be in our repository.
Common test infrastructure for LMS + CMS
data/ has some test course data.
Once the course validation is separated from django, we should have scripts here that checks that a course consists only of xml that we understand.