亚洲精品久久久久久一区二区_99re热久久这里只有精品34_久久免费高清视频_一区二区三区不卡在线视频

Home
Letters to Editor
Domestic
World
Business & Trade
Culture & Science
Travel
Society
Government
Opinions
Policy Making in Depth
People
Investment
Life
Books/Reviews
News of This Week
Learning Chinese
Novel Way to Identify Author of Text

That notable quotable is instantly recognizable among people as a line from Shakespeare's Hamlet. But soon, even computers may be able to automatically identify strings of text with their appropriate authors -- and by using a free computer program already available on the Internet.

According to a report to be published in the Physical Review Letters magazine, researchers at La Sapienza University in Rome have found that a computer file compression program called Gzip provided an unusual means of analyzing strings of data.

Typically, computer compression programs such as Gzip shrink large computer files -- text files, for instance -- by searching for repetitive strings of information. By finding and identifying those patterns, the compression program can reduce the original file to a smaller one that contains just the basic "building blocks" of data and instructions on how to use those blocks to recreate the original, larger file.

But Emanuele Caglioti, an associate professor of mathematics at the university and one of the report's authors says that the program's compression process is also the key that helps identify files of unknown data.

When a program such as Gzip shrinks or "zips" a file, "it is learning something about the file," says Caglioti. Specifically, it is learning the file's so-called entropy, or the minimum number of bits needed to encode the file. Files of similar content would share similar entropies since they share the same common "building blocks."

"If you zip a file -- say one composed of English text -- while [the Gzip program] is reading the file, it's learning the statistics of English," says Caglioti. "The more it reads it, the more it can compress it." And adding additional English files wouldn't produce a great change in the file's size since the basic pattern -- its entropy -- is already known.

But, if the second file turns out to be Italian, Caglioti says the process has to start all over again and a new entropy is created. "It has to learn [the] Italian," says Caglioti. And "This effort has a cost in terms of bits. It takes more space to incorporate the Italian file because it's a different language."

And Caglioti and his team of researchers discovered that this same process and principle can be used to "identify' works by author. In their research, the Italian scientists collected 90 texts by 11 Italian authors and in 93 percent of the cases; the method correctly matched small text samples with the authors.

"It's pretty clever what they did," said James Riordon, a physicist with the American Institute of Physics, the group that publishes the Physical Review Letters. "Effectively, it's like you're training someone in a language to identify it."

And Caglioti say that there's no reason to believe that the compression process couldn't be used in other means. "Aside from text recognition, it can be used to compare Web pages and find ones that are similar," he says. In addition to creating a better Web search engine, Caglioti notes, "there is the challenge of biological DNA sequencing." He said genetic researchers have already reported in Bioinformatics of using similar zipper approaches to map the human genome.

Mark Adler, the programmer who co-created Gzip in early 1990 as an alternative to other file compression programs, said he was surprised someone had used his program in such a manner. "It is impressive and a little surprising that simply comparing the length of the compressed output from concatenated known and unknown text provides such high accuracy," he says.

But he remains skeptical that the Italians' research paves the way to foolproof text identifiers -- at least until more studies are done.

"At some point using entropy as a measure may not be fine enough to distinguish between authors with similar styles or use of words and phrases," he says. "I'd wonder how well it would work for author recognition if you tried to distinguish between a thousand authors instead of a dozen."

"Up to now, this is more theoretical than practical," Caglioti conceeds. But he says he and his team will continue to work with the program and see what else turns up. "We ought to try and see where it can work."

(China Daily January 31, 2002)

Copyright ? China Internet Information Center. All Rights Reserved
E-mail: webmaster@china.org.cn Tel: 86-10-68996214/15/16
亚洲精品久久久久久一区二区_99re热久久这里只有精品34_久久免费高清视频_一区二区三区不卡在线视频
欧美一区1区三区3区公司| 欧美大片第1页| 亚洲国产视频一区二区| 亚洲欧美在线高清| 一本色道**综合亚洲精品蜜桃冫| 在线精品视频在线观看高清| 国产一区激情| 国产三级欧美三级日产三级99| 国产精品久久久久9999| 欧美色网在线| 欧美日韩亚洲一区二区三区在线观看| 欧美激情小视频| 欧美高清视频一区| 欧美精品videossex性护士| 欧美国产成人在线| 欧美日本视频在线| 欧美日韩网址| 欧美亚韩一区| 国产精品久久久久99| 国产精品免费一区二区三区在线观看 | 欧美日韩精品二区| 欧美人与禽性xxxxx杂性| 欧美日本在线视频| 欧美视频在线观看免费网址| 欧美午夜理伦三级在线观看| 欧美视频一区二| 国产精品观看| 国产日本亚洲高清| 一色屋精品视频在线观看网站| 亚洲成人在线| 亚洲精品偷拍| 亚洲女爱视频在线| 久久大香伊蕉在人线观看热2| 亚洲欧洲精品一区二区| 日韩一区二区免费高清| 亚洲色图在线视频| 亚洲欧美在线视频观看| 久久精品日韩一区二区三区| 蜜臀av国产精品久久久久| 欧美成人一区二区三区片免费| 欧美理论电影在线观看| 国产精品久久久久久久久借妻| 国产丝袜一区二区| 伊人久久大香线蕉综合热线 | 亚洲精品日韩在线观看| 亚洲午夜国产成人av电影男同| 午夜精品久久久久| 亚洲国产精品一区制服丝袜| 一区二区免费在线播放| 欧美在线观看视频一区二区| 老司机精品视频一区二区三区| 欧美精彩视频一区二区三区| 国产精品乱码一区二三区小蝌蚪 | 在线成人h网| 日韩一本二本av| 午夜精品国产更新| 亚洲美女尤物影院| 午夜精品三级视频福利| 狼狼综合久久久久综合网 | 亚洲精品一区二区三区av| 亚洲午夜三级在线| 亚洲国产综合在线| 午夜精品久久久久久久99水蜜桃| 久久色在线观看| 欧美午夜免费电影| 在线精品一区二区| 亚洲视频成人| 最新日韩在线| 久久国产精品久久w女人spa| 欧美激情精品久久久久久大尺度 | 亚洲欧美日韩国产另类专区| 亚洲美女少妇无套啪啪呻吟| 欧美一区免费| 欧美日韩视频在线观看一区二区三区| 国内在线观看一区二区三区| av成人毛片| 最近中文字幕日韩精品| 欧美一区二区三区精品| 欧美日韩国产欧美日美国产精品| 国产在线麻豆精品观看| 宅男精品视频| 日韩视频一区二区在线观看| 久久精品亚洲精品国产欧美kt∨| 欧美精品免费在线观看| 国内精品久久久久伊人av| 这里只有精品电影| 亚洲欧洲精品一区二区三区| 欧美一乱一性一交一视频| 欧美日韩国产小视频| 精品成人国产| 羞羞视频在线观看欧美| 亚洲自拍高清| 欧美日韩亚洲天堂| 亚洲国产精品一区制服丝袜| 欧美在线观看网站| 欧美一区2区视频在线观看| 欧美日韩在线第一页| 亚洲国产精品嫩草影院| 亚洲二区免费| 久久精品视频在线免费观看| 国产精品久久影院| 一区二区国产日产| 亚洲私人影院在线观看| 欧美另类专区| 亚洲人体影院| 亚洲人成小说网站色在线| 久久一区中文字幕| 国内精品久久久久影院薰衣草| 亚洲在线观看视频网站| 亚洲欧美日本伦理| 国产精品高潮呻吟久久av黑人| 99亚洲一区二区| 99精品国产热久久91蜜凸| 欧美成人午夜激情| 亚洲国产精品免费| 亚洲精品乱码久久久久| 欧美va亚洲va国产综合| 亚洲大片精品永久免费| 亚洲区一区二区三区| 欧美成人午夜视频| 亚洲国产另类久久久精品极度| 亚洲大胆av| 老牛国产精品一区的观看方式| 禁断一区二区三区在线| 亚洲第一主播视频| 猫咪成人在线观看| 在线免费观看成人网| 亚洲国产日韩欧美| 欧美国产亚洲精品久久久8v| 亚洲国产精品成人一区二区| 亚洲三级影院| 欧美激情视频给我| 日韩一区二区久久| 亚洲一区日韩| 国产精品自拍小视频| 欧美一区在线视频| 美女主播精品视频一二三四| 亚洲国产成人在线播放| 一本在线高清不卡dvd| 欧美体内谢she精2性欧美| 亚洲午夜视频| 久久精品亚洲| 亚洲国产精品999| 一本色道久久综合| 国产精品伦理| 欧美在线视频二区| 可以看av的网站久久看| 亚洲茄子视频| 亚洲欧美在线x视频| 国产主播一区二区三区四区| 亚洲国产99| 欧美人交a欧美精品| 亚洲一区综合| 老司机成人在线视频| 91久久精品www人人做人人爽| 在线一区日本视频| 国产伦精品一区二区三区免费迷 | 久久午夜色播影院免费高清| 亚洲国产成人av在线| 亚洲综合色婷婷| 国产一区二区三区久久精品| 亚洲精品日韩激情在线电影| 欧美视频在线观看视频极品| 欧美亚洲在线观看| 欧美精品 日韩| 亚洲欧美制服另类日韩| 免费在线成人| 亚洲视频一起| 久久亚洲精品一区二区| 日韩亚洲欧美中文三级| 久久精品国产免费观看| 亚洲精品美女在线| 亚欧成人在线| 亚洲国产小视频在线观看| 西西人体一区二区| 在线精品亚洲| 欧美亚洲系列| 亚洲国产精品久久| 欧美在线视频在线播放完整版免费观看| 激情小说另类小说亚洲欧美 | 亚洲专区免费| 亚洲国产精品一区在线观看不卡| 亚洲欧美日韩系列| 亚洲福利视频网站| 久久gogo国模裸体人体| 亚洲精品一区二区三区婷婷月| 久久成人免费网| 一本久久青青| 狂野欧美一区| 午夜日韩视频| 欧美日韩在线另类| 亚洲韩国青草视频| 国产老女人精品毛片久久| 99国产精品久久久久久久| 国产日韩亚洲欧美| 一区二区三区精品国产| 韩国女主播一区| 午夜视频久久久久久| 亚洲人成人一区二区三区| 久久久久久夜|