count2huge.pl man page on DragonFly

Man page or keyword search:  
man Server   44335 pages
apropos Keyword Search (all sections)
Output format
DragonFly logo
[printable version]

COUNT2HUGE(1)	      User Contributed Perl Documentation	 COUNT2HUGE(1)

NAME
       count2huge.pl - Convert the output of count.pl to huge-count.pl.

SYNOPSIS
       count2huge.pl takes the output of count.pl and sorts the bigrams by
       alphabet order. The output bigrams have the same order with the output
       of huge-count.pl

DESCRIPTION
       count2huge.pl convert the output of the count.pl to huge-count.pl for
       the same input text and options. The reason we do this is because for
       the vector relatedness measure of UMLS-Similarity, it requires the
       bigrams which starts with the same term are grouped together. When the
       bigrams are sorted, it could optimize the processing time. For details,
       please see the vector-input.pl of UMLS-Similarity.

       See perldoc count2huge.pl

USGAE
       count2huge.pl [OPTION] SOURCE DESTINATIONDIR

INPUT
   Required Arguments:
       SOURCE

       Input to count2huge.pl is a single flat bigrams list file generated by
       count.pl.

       DESTINATIONDIR

       count2huge.pl sorts the bigrams in the alphabet order. The result file
       count2huge.output is located in the destination directory. It has the
       same result with the output of huge-count.pl which has same text and
       options of count.pl.

       --split N

       huge-split will divide the output bigrmas tokenlist generated by
       count.pl Each part created with --split N will contain N lines. Value
       of N should be chosen such that huge-sort.pl can be efficiently run on
       any part containing N lines from the file contains all bigrams file.

       We suggest that N is equal to the number of KB of memory you have. If
       the computer has 8 GB RAM, which is 8,000,000 KB, N should be set to
       8000000.

       Other Options:

       --help

       Displays the help information.

       --version

       Displays the version information.

AUTHOR
       Ying Liu, University of Minnesota, Twin Cities.	liux0395@umn.edu

COPYRIGHT
       Copyright (C) 2009-2011, Ying Liu

       Ying Liu, University of Minnesota, Twin Cities.	liux0395@umn.edu

       Ted Pedersen, University of Minnesota, Duluth.  tpederse@umn.edu

       This program is free software; you can redistribute it and/or modify it
       under the terms of the GNU General Public License as published by the
       Free Software Foundation; either version 2 of the License, or (at your
       option) any later version.  This program is distributed in the hope
       that it will be useful, but WITHOUT ANY WARRANTY; without even the
       implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR
       PURPOSE.	 See the GNU General Public License for more details.

       You should have received a copy of the GNU General Public License along
       with this program; if not, write to the Free Software Foundation, Inc.,
       59 Temple Place - Suite 330, Boston, MA	02111-1307, USA.

perl v5.20.2			  2011-03-31			 COUNT2HUGE(1)
[top]

List of man pages available for DragonFly

Copyright (c) for man pages and the logo by the respective OS vendor.

For those who want to learn more, the polarhome community provides shell access and support.

[legal] [privacy] [GNU] [policy] [cookies] [netiquette] [sponsors] [FAQ]
Tweet
Polarhome, production since 1999.
Member of Polarhome portal.
Based on Fawad Halim's script.
....................................................................
Vote for polarhome
Free Shell Accounts :: the biggest list on the net