The FQS algorithm improves on the q

The FQS algorithm improves on the quick search (QS) algorithm, by applying the bad character rule,
aided with a statistically maximal expected shift value introduced in this work and a pre-testing stage
before full pattern matching. Unlike previous approaches that blindly tested the first and last symbols
in the pattern [20,21], our pre-testing stage is performed by computing the statistical maximal expected
shift position. We have compared FQS against three other competitive QS variants: the QS itself, FJS
and the Horspool algorithm. A range of text files were searched, including randomly generated text
files with different alphabet sizes (2 ≤ |Σ| ≤ 256), and practical benchmark text files, namely E. coli,
Bible and World192, from the Canterbury Corpus. The pattern lengths were varied from 10 to 1,000
with 19 varieties. We find that, statistically, FQS has the overall best performance (practical running
time, number of symbol comparisons and number of pattern shifts) over all of the other three algorithms,
mostly especially for text files with alphabet sizes less than 128. The results suggest that FQS could have
important applications in practice, especially for genomic data sets, such as DNA or RNA sequences with
four symbols or protein sequences with 20 symbols.

0/5000

Từ: -

Sang: -

Kết quả (Việt) 1: [Sao chép]

Sao chép!

The FQS algorithm improves on the quick search (QS) algorithm, by applying the bad character rule,aided with a statistically maximal expected shift value introduced in this work and a pre-testing stagebefore full pattern matching. Unlike previous approaches that blindly tested the first and last symbolsin the pattern [20,21], our pre-testing stage is performed by computing the statistical maximal expectedshift position. We have compared FQS against three other competitive QS variants: the QS itself, FJSand the Horspool algorithm. A range of text files were searched, including randomly generated textfiles with different alphabet sizes (2 ≤ |Σ| ≤ 256), and practical benchmark text files, namely E. coli,Bible and World192, from the Canterbury Corpus. The pattern lengths were varied from 10 to 1,000with 19 varieties. We find that, statistically, FQS has the overall best performance (practical runningtime, number of symbol comparisons and number of pattern shifts) over all of the other three algorithms,mostly especially for text files with alphabet sizes less than 128. The results suggest that FQS could haveimportant applications in practice, especially for genomic data sets, such as DNA or RNA sequences withfour symbols or protein sequences with 20 symbols.

đang được dịch, vui lòng đợi..

Kết quả (Việt) 2:[Sao chép]

Sao chép!

Các thuật toán FQS cải thiện về năng tìm kiếm nhanh (QS) thuật toán, bằng cách áp dụng các quy tắc tính cách xấu,
hỗ trợ với một giá trị thay đổi dự kiến tối đa về mặt thống kê được giới thiệu vào công việc này và một giai đoạn trước khi thử nghiệm
trước khi kết hợp đầy đủ mô hình. Không giống như các phương pháp trước đây là một cách mù quáng thử nghiệm những biểu tượng đầu tiên và cuối cùng
trong mô hình [20,21], giai đoạn thử nghiệm ban đầu của chúng tôi được thực hiện bằng cách tính toán thống kê tối đa dự kiến
vị trí thay đổi. Chúng tôi đã so sánh FQS chống lại ba biến thể QS cạnh tranh khác: QS bản thân, FJS
và các thuật toán Horspool. Một loạt các tập tin văn bản được tìm kiếm, bao gồm cả văn bản ngẫu nhiên tạo ra
các tập tin với kích thước bảng chữ cái khác nhau (2 ≤ | Σ | ≤ 256), và các tập tin văn bản thực tế điểm chuẩn, cụ thể là E. coli,
Kinh Thánh và World192, từ Canterbury Corpus. Chiều dài mô hình đã được thay đổi từ 10 đến 1000
với 19 giống. Chúng tôi thấy rằng, theo thống kê, FQS có hiệu suất tổng thể tốt nhất (chạy thực tế
thời gian, số lượng so sánh biểu tượng và số ca mẫu) trên tất cả các ba thuật toán khác,
chủ yếu là đặc biệt cho các tập tin văn bản với bảng chữ cái có kích thước nhỏ hơn 128. Kết quả cho thấy rằng FQS có thể có
ứng dụng quan trọng trong thực tế, đặc biệt là đối với bộ dữ liệu về gen, chẳng hạn như DNA hoặc RNA chuỗi với
bốn biểu tượng hoặc các trình tự protein với 20 ký tự.

đang được dịch, vui lòng đợi..

Kết quả (Việt) 3:[Sao chép]

Sao chép!

đang được dịch, vui lòng đợi..

Các ngôn ngữ khác

Hỗ trợ công cụ dịch thuật: Albania, Amharic, Anh, Armenia, Azerbaijan, Ba Lan, Ba Tư, Bantu, Basque, Belarus, Bengal, Bosnia, Bulgaria, Bồ Đào Nha, Catalan, Cebuano, Chichewa, Corsi, Creole (Haiti), Croatia, Do Thái, Estonia, Filipino, Frisia, Gael Scotland, Galicia, George, Gujarat, Hausa, Hawaii, Hindi, Hmong, Hungary, Hy Lạp, Hà Lan, Hà Lan (Nam Phi), Hàn, Iceland, Igbo, Ireland, Java, Kannada, Kazakh, Khmer, Kinyarwanda, Klingon, Kurd, Kyrgyz, Latinh, Latvia, Litva, Luxembourg, Lào, Macedonia, Malagasy, Malayalam, Malta, Maori, Marathi, Myanmar, Mã Lai, Mông Cổ, Na Uy, Nepal, Nga, Nhật, Odia (Oriya), Pashto, Pháp, Phát hiện ngôn ngữ, Phần Lan, Punjab, Quốc tế ngữ, Rumani, Samoa, Serbia, Sesotho, Shona, Sindhi, Sinhala, Slovak, Slovenia, Somali, Sunda, Swahili, Séc, Tajik, Tamil, Tatar, Telugu, Thái, Thổ Nhĩ Kỳ, Thụy Điển, Tiếng Indonesia, Tiếng Ý, Trung, Trung (Phồn thể), Turkmen, Tây Ban Nha, Ukraina, Urdu, Uyghur, Uzbek, Việt, Xứ Wales, Yiddish, Yoruba, Zulu, Đan Mạch, Đức, Ả Rập, dịch ngôn ngữ.