Skip to content

Commit

Permalink
paper: rename main tex file.
Browse files Browse the repository at this point in the history
* fix grammar
* set output of generated tikz file to tmp directory.
  • Loading branch information
shuLhan committed Jun 19, 2016
1 parent 7e9604a commit 3526252
Show file tree
Hide file tree
Showing 5 changed files with 17 additions and 12 deletions.
File renamed without changes.
2 changes: 1 addition & 1 deletion paper/diagram_process.tex
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
\begin{figure}[tp!]
\centering
\resizebox{0.3\textwidth}{!} {
\tikzsetnextfilename{diagramprocess}
\mytikzinput{diagramprocess}
\begin{tikzpicture}[
framed,
nodes = {
Expand Down
18 changes: 8 additions & 10 deletions paper/introduction.tex
Original file line number Diff line number Diff line change
Expand Up @@ -24,17 +24,15 @@
assume that class distribution is balanced, while in real world cases this
rarely happened.

RF has the disadvantage in their the computation time especially when training
the classification model.
For a large dataset with more than 10,000 samples (like the PAN-WVC-10 cases)
this could lead to hours of training time.
One of the solution is by using Cascaded Random Forest (CRF) framework proposed
by Bauman et al.
\cite{baumann2013cascaded}.
Their paper state that CRF give a fast training model time and increased
performance compared to RF.
Random Forest (RF) has the disadvantages in their the computation time
especially when training the classification model. For a large dataset with
more than 10,000 samples (like the PAN-WVC-10 cases) this could lead to hours
of training time. One of the solution is by using Cascaded Random Forest (CRF)
framework proposed by Bauman et al. \cite{baumann2013cascaded}. Their paper
state that CRF give a fast training model time and increased performance
compared to RF.

This paper attempt to overcome the dataset imbalance problem on PAN-WVC-10 by
This paper attempts to overcome the dataset imbalance problem on PAN-WVC-10 by
applying resample and classifier technique that has never been used before on
the dataset.
The PAN-WVC-10 dataset is resampled using Local Neighborhood SMOTE (LNSMOTE)
Expand Down
2 changes: 1 addition & 1 deletion paper/latexmk.sh
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
#!/bin/sh

latexmk -pvc -view=pdf Muhamad_Sulhan_-_Detecting_vandalism_on_English_Wikipedia_using_CRF.tex
latexmk -pvc -view=pdf 23513014_Muhamad_Sulhan_-_Paper.tex
7 changes: 7 additions & 0 deletions paper/preamble.tex
Original file line number Diff line number Diff line change
Expand Up @@ -47,6 +47,13 @@
\usetikzlibrary{backgrounds, shapes.geometric, positioning, patterns, external}
\tikzexternalize

%% Make tikz generate PDF file to .tmp directory
\makeatletter
\newcommand{\mytikzinput}[1]{%
\tikzsetnextfilename{tmp/#1}%
}
\makeatother

%% Pgfplots
\usepackage{pgfplots}

Expand Down

0 comments on commit 3526252

Please sign in to comment.