Each web server (and indeed any hos

Each web server (and indeed any host connected to the internet) has a unique IP address: a sequence of four bytes generally represented as four integers separated by dots; for instance 207.142.131.248 is the numerical IP address associated with the host www.wikipedia.org. Given a URL such as www.wikipedia.org in textual form, translating it to an IP address (in this case, 207.142.131.248) is a process known as DNS resolution or DNS lookup;DNS resolution here DNS stands for domain name service. During DNS resolution, the program that wishes to perform this translation(in our case,a component of the web crawler) contacts a DNS server that returns the translated IP address.DNS server (In practice, the entire translation may not occur at a single DNS server; rather, the DNS server contacted initially may recursively call upon other DNS servers to complete the translation.) For a more complex URL such as en.wikipedia.org/wiki/Domain_Name_System , the crawler component responsible for DNS resolution extracts the host name – in this case en.wikipedia.org – and looks up the IP address for the host en.wikipedia.org. DNS resolution is a well-known bottleneck in web crawling. Due to the distributed nature of the domain name service, DNS resolution may entail multiple requests and roundtrips across the Internet, requiring seconds and sometimesevenlonger.Rightaway,this puts in jeopardy our goal of fetching several hundred documents a second. A standard remedy is to introduce caching:URLs for which we have recently performed DNS lookups are likely to be found in the DNS cache, avoiding the need to go to the DNS servers on the Internet. However, obeying politeness constraints (see Section 20.2.3) limits the cache hit rate. There is another important difﬁculty in DNS resolution;the lookup imple mentations in standard libraries (likely to be used by anyone developing a crawler) are generally synchronous. This means that once a request is made to the domain name service, other crawler threads at that node are blocked

until the ﬁrst request is completed. To circumvent this, most web crawlers implement their own DNS resolver as a component of the crawler. Thread i executing the resolver code sends a message to the DNS server and then performs a timed wait: It resumes either when being signaled by another thread or when a set time quantum expires. A single, separate DNS thread listens on the standard DNS port (port 53) for incoming response packets from the name service. Upon receiving a response, it signals the appropriate crawler thread (in this case, i) and hands it the response packet if i has not yet resumed because its time quantum has expired.A crawler thread that resumes because its wait time quantum has expired retries for a ﬁxed number of attempts, sending out a new message to the DNS server and performing a timed wait each time; the designers of Mercator recommend of the order of ﬁve attempts. The time quantum of the wait increases exponentially with each of these attempts; Mercator started with one second and ended with roughly 90 seconds, in consideration of the fact that there are host names that take tens of seconds to resolve.

Each web server (and indeed any host connected to the internet) has a unique IP address: a sequence of four bytes generally represented as four integers separated by dots; for instance 207.142.131.248 is the numerical IP address associated with the host www.wikipedia.org. Given a URL such as www.wikipedia.org in textual form, translating it to an IP address (in this case, 207.142.131.248) is a process known as DNS resolution or DNS lookup;DNS resolution here DNS stands for domain name service. During DNS resolution, the program that wishes to perform this translation(in our case,a component of the web crawler) contacts a DNS server that returns the translated IP address.DNS server (In practice, the entire translation may not occur at a single DNS server; rather, the DNS server contacted initially may recursively call upon other DNS servers to complete the translation.) For a more complex URL such as en.wikipedia.org/wiki/Domain_Name_System , the crawler component responsible for DNS resolution extracts the host name – in this case en.wikipedia.org – and looks up the IP address for the host en.wikipedia.org. DNS resolution is a well-known bottleneck in web crawling. Due to the distributed nature of the domain name service, DNS resolution may entail multiple requests and roundtrips across the Internet, requiring seconds and sometimesevenlonger.Rightaway,this puts in jeopardy our goal of fetching several hundred documents a second. A standard remedy is to introduce caching:URLs for which we have recently performed DNS lookups are likely to be found in the DNS cache, avoiding the need to go to the DNS servers on the Internet. However, obeying politeness constraints (see Section 20.2.3) limits the cache hit rate. There is another important difﬁculty in DNS resolution;the lookup imple mentations in standard libraries (likely to be used by anyone developing a crawler) are generally synchronous. This means that once a request is made to the domain name service, other crawler threads at that node are blocked
 
until the ﬁrst request is completed. To circumvent this, most web crawlers implement their own DNS resolver as a component of the crawler. Thread i executing the resolver code sends a message to the DNS server and then performs a timed wait: It resumes either when being signaled by another thread or when a set time quantum expires. A single, separate DNS thread listens on the standard DNS port (port 53) for incoming response packets from the name service. Upon receiving a response, it signals the appropriate crawler thread (in this case, i) and hands it the response packet if i has not yet resumed because its time quantum has expired.A crawler thread that resumes because its wait time quantum has expired retries for a ﬁxed number of attempts, sending out a new message to the DNS server and performing a timed wait each time; the designers of Mercator recommend of the order of ﬁve attempts. The time quantum of the wait increases exponentially with each of these attempts; Mercator started with one second and ended with roughly 90 seconds, in consideration of the fact that there are host names that take tens of seconds to resolve.

0/5000

Từ: -

Sang: -

Kết quả (Việt) 1: [Sao chép]

Sao chép!

Each web server (and indeed any host connected to the internet) has a unique IP address: a sequence of four bytes generally represented as four integers separated by dots; for instance 207.142.131.248 is the numerical IP address associated with the host www.wikipedia.org. Given a URL such as www.wikipedia.org in textual form, translating it to an IP address (in this case, 207.142.131.248) is a process known as DNS resolution or DNS lookup;DNS resolution here DNS stands for domain name service. During DNS resolution, the program that wishes to perform this translation(in our case,a component of the web crawler) contacts a DNS server that returns the translated IP address.DNS server (In practice, the entire translation may not occur at a single DNS server; rather, the DNS server contacted initially may recursively call upon other DNS servers to complete the translation.) For a more complex URL such as en.wikipedia.org/wiki/Domain_Name_System , the crawler component responsible for DNS resolution extracts the host name – in this case en.wikipedia.org – and looks up the IP address for the host en.wikipedia.org. DNS resolution is a well-known bottleneck in web crawling. Due to the distributed nature of the domain name service, DNS resolution may entail multiple requests and roundtrips across the Internet, requiring seconds and sometimesevenlonger.Rightaway,this puts in jeopardy our goal of fetching several hundred documents a second. A standard remedy is to introduce caching:URLs for which we have recently performed DNS lookups are likely to be found in the DNS cache, avoiding the need to go to the DNS servers on the Internet. However, obeying politeness constraints (see Section 20.2.3) limits the cache hit rate. There is another important difﬁculty in DNS resolution;the lookup imple mentations in standard libraries (likely to be used by anyone developing a crawler) are generally synchronous. This means that once a request is made to the domain name service, other crawler threads at that node are blocked until the ﬁrst request is completed. To circumvent this, most web crawlers implement their own DNS resolver as a component of the crawler. Thread i executing the resolver code sends a message to the DNS server and then performs a timed wait: It resumes either when being signaled by another thread or when a set time quantum expires. A single, separate DNS thread listens on the standard DNS port (port 53) for incoming response packets from the name service. Upon receiving a response, it signals the appropriate crawler thread (in this case, i) and hands it the response packet if i has not yet resumed because its time quantum has expired.A crawler thread that resumes because its wait time quantum has expired retries for a ﬁxed number of attempts, sending out a new message to the DNS server and performing a timed wait each time; the designers of Mercator recommend of the order of ﬁve attempts. The time quantum of the wait increases exponentially with each of these attempts; Mercator started with one second and ended with roughly 90 seconds, in consideration of the fact that there are host names that take tens of seconds to resolve.

đang được dịch, vui lòng đợi..

Kết quả (Việt) 2:[Sao chép]

Sao chép!

Mỗi máy chủ web (và thực sự bất kỳ máy chủ kết nối với internet) có một địa chỉ IP duy nhất: là dãy số bốn byte thường đại diện là bốn số nguyên cách nhau bởi dấu chấm; ví dụ 207.142.131.248 là địa chỉ IP bằng số liên kết với các www.wikipedia.org chủ. Cho một URL như www.wikipedia.org ở dạng văn bản, dịch nó đến một địa chỉ IP (trong trường hợp này, 207.142.131.248) là một quá trình được gọi là độ phân giải DNS hoặc tra cứu DNS, độ phân giải DNS DNS ở đây là viết tắt của dịch vụ tên miền. Trong độ phân giải DNS, các chương trình muốn thực hiện bản dịch này (trong trường hợp của chúng tôi, một thành phần của web crawler) liên lạc một máy chủ DNS trả về máy chủ IP address.DNS dịch (Trong thực tế, toàn bộ dịch thuật có thể không xảy ra tại một đơn máy chủ DNS;. thay, các máy chủ DNS đã liên lạc với ban đầu có thể đệ quy gọi khi máy chủ DNS khác để hoàn thành bản dịch) Đối với một URL phức tạp hơn như en.wikipedia.org/wiki/Domain_Name_System, thành phần bánh xích trách nhiệm phân giải DNS chiết xuất từ các máy chủ tên - trong trường hợp này en.wikipedia.org - và trông lên các địa chỉ IP cho các máy chủ en.wikipedia.org. Độ phân giải DNS là một nút cổ chai nổi tiếng trong web bò. Do tính chất phân phối của các dịch vụ tên miền, độ phân giải DNS có thể kéo theo nhiều yêu cầu và roundtrips qua mạng Internet, đòi hỏi giây và sometimesevenlonger.Rightaway, điều này đặt vào nguy hiểm mục tiêu của chúng tôi lấy vài trăm tài liệu một lần thứ hai. Một biện pháp khắc phục tiêu chuẩn là để giới thiệu bộ nhớ đệm: URL mà chúng tôi đã thực hiện gần đây tra cứu DNS có khả năng được tìm thấy trong bộ nhớ cache DNS, tránh sự cần thiết để đi đến các máy chủ DNS trên Internet. Tuy nhiên, tuân theo các ràng buộc lịch sự (xem phần 20.2.3) giới hạn tốc độ hit cache. Có một quan trọng gặp khó khăn ở độ phân giải DNS; các mentations tra cứu imple trong thư viện tiêu chuẩn (có khả năng được sử dụng bởi bất cứ ai phát triển một trình thu thập) nói chung là đồng bộ. Điều này có nghĩa rằng một khi một request được gửi đến các dịch vụ tên miền, bài xích khác tại nút bị chặn cho đến khi yêu cầu fi đầu tiên được hoàn thành. Để phá vỡ này, hầu hết các trình thu thập web thực hiện DNS resolver riêng của họ như một thành phần của bánh xích. Gởi i thực hiện các mã resolver sẽ gửi một thông điệp tới các máy chủ DNS và sau đó thực hiện một chờ đợi theo thời gian: Nó lại tiếp tục, hoặc khi được báo hiệu bởi một sợi hoặc khi một lượng tử thời gian đặt hết hạn. Một đơn, sợi DNS riêng biệt lắng nghe trên các cổng tiêu chuẩn DNS (port 53) cho gói tin phản hồi đến từ các dịch vụ tên. Khi nhận được phản hồi thì tín hiệu sợi xích thích hợp (trong trường hợp này, tôi) và đưa cho nó gói tin trả lời nếu tôi vẫn chưa trở lại vì lượng tử thời gian của mình có expired.A sợi xích đó lại tiếp tục vì lượng tử thời gian chờ đợi của nó đã hết hạn thử lại cho một số cố định fi nỗ lực, việc gửi đi một thông điệp mới đến máy chủ DNS và thực hiện một timed chờ đợi từng thời gian; các nhà thiết kế của Mercator khuyên của thứ tự của fi đã nỗ lực. Thời gian lượng tử của sự gia tăng theo cấp số nhân với nhau chờ đợi của những nỗ lực này; Mercator bắt đầu với một thứ hai và kết thúc với khoảng 90 giây, trong việc xem xét thực tế là có những tên máy chủ mà phải mất vài chục giây để giải quyết.

đang được dịch, vui lòng đợi..

Kết quả (Việt) 3:[Sao chép]

Sao chép!

đang được dịch, vui lòng đợi..

Các ngôn ngữ khác

Hỗ trợ công cụ dịch thuật: Albania, Amharic, Anh, Armenia, Azerbaijan, Ba Lan, Ba Tư, Bantu, Basque, Belarus, Bengal, Bosnia, Bulgaria, Bồ Đào Nha, Catalan, Cebuano, Chichewa, Corsi, Creole (Haiti), Croatia, Do Thái, Estonia, Filipino, Frisia, Gael Scotland, Galicia, George, Gujarat, Hausa, Hawaii, Hindi, Hmong, Hungary, Hy Lạp, Hà Lan, Hà Lan (Nam Phi), Hàn, Iceland, Igbo, Ireland, Java, Kannada, Kazakh, Khmer, Kinyarwanda, Klingon, Kurd, Kyrgyz, Latinh, Latvia, Litva, Luxembourg, Lào, Macedonia, Malagasy, Malayalam, Malta, Maori, Marathi, Myanmar, Mã Lai, Mông Cổ, Na Uy, Nepal, Nga, Nhật, Odia (Oriya), Pashto, Pháp, Phát hiện ngôn ngữ, Phần Lan, Punjab, Quốc tế ngữ, Rumani, Samoa, Serbia, Sesotho, Shona, Sindhi, Sinhala, Slovak, Slovenia, Somali, Sunda, Swahili, Séc, Tajik, Tamil, Tatar, Telugu, Thái, Thổ Nhĩ Kỳ, Thụy Điển, Tiếng Indonesia, Tiếng Ý, Trung, Trung (Phồn thể), Turkmen, Tây Ban Nha, Ukraina, Urdu, Uyghur, Uzbek, Việt, Xứ Wales, Yiddish, Yoruba, Zulu, Đan Mạch, Đức, Ả Rập, dịch ngôn ngữ.