aclImdb is a small imdb movie review dataset, which is good choice to build an experimental model for sentiment analysis. In this tutorial, we will introduce some basci feartures for sentiment analysis beginners.
aclImdb dataset contains some small txt files, we have combined them to a big txt file (aclImdb-all.txt). In this tutorial, we will introduce you how to split this file to train, test and validation set for model learning.
In this tutorial, we will introduce random.sample() function, which will capture some random elements from a python sequence. It is very helpful to generate a randomized sequence.
In python, we can use random.shuffle() to randomize all elements in a sequence. However, to use this function correctly, there are some notices you should concern. In this tutorial, we will discuss this function.
There are 50,000 labeled text files in aclImdb dataset, in order to use this dataset easily, we should combine these small text files to a big one. In this tutorial, we will introduce you how to do.
Sometimes, we need combine some text files into one file to read and process. We can implement this function easily in python. In this tutorial, we will illustrate python beginners how to do.
To sort a python list, we can use list sort() and sorted() function. What is the difference between them? In this tutorial, we will discuss this topic.
Sometimes, you may find the eclipse console is not displayed on its main window. If you want to make it attach to the main window of eclipse, this tutorial will help you.
After you have started jupyter notebook, it will load a directory as its workspace. How to change this working directory? In this tutorial, we will tell python beginners how to do.
Jupyter notebook is a very nice tool to analysis data using python. In this tutorial, we will use some steps to show python beginners how to install it and run python script.