Contact
CoCalc Logo Icon
StoreFeaturesDocsShareSupport News AboutSign UpSign In
| Download

Stylometry_exercise

Views: 924

Project 1

Stylometry

In this folder you will see two notebook files:

  1. proj1.ipynb

  2. string_processing_examples.ipynb

The only reason string_processing_examples.ipynb appears is to provide examples of the kind of things you need to do to in order to solve the problems presented in this project. You can look through that file after you get started for inspiration on what to do.

The other file proj1.ipynb contains code that implements a way of measuring the similarity between two text documents. Some crucial parts of the implementation have been removed. You have to put them back in.

This will be made easier by the fact that you are given the correct output, just not all of the code that produced it.

I have also included a "frozen" version of proj1.ipynb, namely proj1.html. This is just a non-modifiable version of proj1.ipynb which you can refer back to as you overwrite some output in proj1.ipynb in the process of solving the problems.

All you have to do is "fix" proj1.ipynb so that the outputs are roughly the same as what they are in proj1.html. You can leave your completed worksheet in this directory and it will be automatically collected.