Highlight generation We generated h

Highlight generation We generated highlights for a test set of 600 documents. We created and
570 solved an ILP for each document. Sentences were first tokenized to separate words and punctuation,then parsed to obtain phrases and dependencies as described in Section 4 using the Stanford parser (Klein and Manning, 2003). For each phrase, features were extracted and salience scores calculated from the feature weights determined through SVM training. The distance from the SVM hyperplane represents the salience score. The ILP model (see Equation (1)) was parametrized as follows:the maximum number of highlights NS was 4,the overall limit on length LT was 75 tokens, the length of each highlight was in the range of [8,28]tokens, and the topic coverage set T contained the top 5 tf.idf words. These parameters were chosen to capture the properties seen in the majority of the training set; they were also relaxed enough to
allow a feasible solution of the ILP model (with hard constraints) for all the documents in the test set. To solve the ILP model we used the ZIB Optimization Suite software (Achterberg, 2007; Koch,2004; Wunderling, 1996). The solution was converted into highlights by concatenating the chosen leaf nodes in order. The ILP problems we created had on average 290 binary variables and 380 constraints.The mean solve time was 0.03 seconds

0/5000

Từ: -

Sang: -

Kết quả (Việt) 1: [Sao chép]

Sao chép!

Highlight generation We generated highlights for a test set of 600 documents. We created and570 solved an ILP for each document. Sentences were first tokenized to separate words and punctuation,then parsed to obtain phrases and dependencies as described in Section 4 using the Stanford parser (Klein and Manning, 2003). For each phrase, features were extracted and salience scores calculated from the feature weights determined through SVM training. The distance from the SVM hyperplane represents the salience score. The ILP model (see Equation (1)) was parametrized as follows:the maximum number of highlights NS was 4,the overall limit on length LT was 75 tokens, the length of each highlight was in the range of [8,28]tokens, and the topic coverage set T contained the top 5 tf.idf words. These parameters were chosen to capture the properties seen in the majority of the training set; they were also relaxed enough toallow a feasible solution of the ILP model (with hard constraints) for all the documents in the test set. To solve the ILP model we used the ZIB Optimization Suite software (Achterberg, 2007; Koch,2004; Wunderling, 1996). The solution was converted into highlights by concatenating the chosen leaf nodes in order. The ILP problems we created had on average 290 binary variables and 380 constraints.The mean solve time was 0.03 seconds

đang được dịch, vui lòng đợi..

Kết quả (Việt) 2:[Sao chép]

Sao chép!

Thế hệ Highlight Chúng tôi tạo điểm nhấn cho một tập kiểm tra 600 văn bản. Chúng tôi tạo ra và
570 giải quyết một ILP cho mỗi tài liệu. Câu đầu tiên đã được tokenized đến lời nói và dấu chấm câu riêng biệt, sau đó phân tích cú pháp để có được cụm từ và phụ thuộc như được mô tả trong phần 4 bằng cách sử dụng phân tích cú pháp Stanford (Klein và Manning, 2003). Đối với mỗi cụm từ, các tính năng được trích xuất và điểm nổi bật tính toán từ các trọng số tính năng xác định thông qua đào tạo SVM. Khoảng cách từ các siêu phẳng SVM đại diện cho số điểm nổi bật. Các mô hình ILP (xem phương trình (1)) được parametrized như sau: số lượng tối đa những điểm nổi bật NS là 4, giới hạn chiều dài tổng thể dài hạn là 75 thẻ, độ dài của mỗi nổi bật là trong khoảng [8,28] tokens và phạm vi bảo hiểm chủ đề thiết lập T chứa 5 từ tf.idf đầu. Những thông số đã được lựa chọn để chụp các tài sản nhìn thấy trong đa số các tập huấn luyện; họ cũng đã được nới lỏng, đủ để
cho phép một giải pháp khả thi của mô hình ILP (với những hạn chế cứng) cho tất cả các tài liệu trong tập kiểm tra. Để giải quyết các mô hình ILP chúng tôi sử dụng các phần mềm ZIB Optimization Suite (Achterberg, 2007; Koch, 2004; Wunderling, 1996). Giải pháp đã được chuyển đổi thành điểm nổi bật bằng cách kết hợp các nút lá được chọn theo thứ tự. Các vấn đề ILP chúng tôi tạo ra có trung bình 290 biến nhị phân và 380 constraints.The có nghĩa là giải quyết được thời gian 0,03 giây

đang được dịch, vui lòng đợi..

Kết quả (Việt) 3:[Sao chép]

Sao chép!

đang được dịch, vui lòng đợi..

Các ngôn ngữ khác

Hỗ trợ công cụ dịch thuật: Albania, Amharic, Anh, Armenia, Azerbaijan, Ba Lan, Ba Tư, Bantu, Basque, Belarus, Bengal, Bosnia, Bulgaria, Bồ Đào Nha, Catalan, Cebuano, Chichewa, Corsi, Creole (Haiti), Croatia, Do Thái, Estonia, Filipino, Frisia, Gael Scotland, Galicia, George, Gujarat, Hausa, Hawaii, Hindi, Hmong, Hungary, Hy Lạp, Hà Lan, Hà Lan (Nam Phi), Hàn, Iceland, Igbo, Ireland, Java, Kannada, Kazakh, Khmer, Kinyarwanda, Klingon, Kurd, Kyrgyz, Latinh, Latvia, Litva, Luxembourg, Lào, Macedonia, Malagasy, Malayalam, Malta, Maori, Marathi, Myanmar, Mã Lai, Mông Cổ, Na Uy, Nepal, Nga, Nhật, Odia (Oriya), Pashto, Pháp, Phát hiện ngôn ngữ, Phần Lan, Punjab, Quốc tế ngữ, Rumani, Samoa, Serbia, Sesotho, Shona, Sindhi, Sinhala, Slovak, Slovenia, Somali, Sunda, Swahili, Séc, Tajik, Tamil, Tatar, Telugu, Thái, Thổ Nhĩ Kỳ, Thụy Điển, Tiếng Indonesia, Tiếng Ý, Trung, Trung (Phồn thể), Turkmen, Tây Ban Nha, Ukraina, Urdu, Uyghur, Uzbek, Việt, Xứ Wales, Yiddish, Yoruba, Zulu, Đan Mạch, Đức, Ả Rập, dịch ngôn ngữ.