1. INTRODUCTIONItemset mining algor

1. INTRODUCTION
Itemset mining algorithms are probably the most well known algorithms in the field of frequent pattern mining. Many efficient solutions have been developed for this relatively simple class of patterns. While the task of mining frequent itemsets in a single relation is well studied, only a few solutions exist for mining frequent itemsets in arbitrary relational databases, which typically have more than one relation [4, 5, 9, 10]. These methods consider a relational itemset to be a set of items, where each item is an attribute value pair, belonging to one or more relations in the database. In order for two or more items from different relations to be in the same itemset, they must be connected. Two items are considered to be connected if there exists a join of their two relations in the database that connects them. In general, an itemset is said to occur in the database, if there exists a tuple in a join of the relations, which contains the itemset. In this paper we also adopt this notion of occurrence.
A good definition of a unit in which the support of a pattern is expressed i.e. what is being counted is a primary requirement to mine any type of frequent pattern. In existing works on relational itemset mining [4, 9, 10], the frequency of an itemset over multiple relations is expressed in the number of occurrences in a join of the database’s relations. However, this definition of itemset support is hard to interpret, because it heavily depends on how well the items in the set are connected. In this paper, we assume that key dependencies are specified in the relational scheme of the input database. We determine the support of an itemset by counting unique key values in the tuples where the itemset occurs. Consider the relational database in Figure 1, which we will use as a run- ning example throughout the paper. For this database, the
keys to be used are {Professor.PID, Course.CID, Student.SID, Study.YID}. This new support counting technique allows for
interpretable frequent itemsets, as it goes without saying that
itemsets frequent in Professor.PID have different semantics than itemsets frequent in Course.CID. This approach permits an efficient depth first algorithm that generates interesting frequent relational itemsets that are easy to understand.

0/5000

Từ: -

Sang: -

Kết quả (Việt) 1: [Sao chép]

Sao chép!

1. giới thiệu
Itemset khai thác mỏ thuật toán có thể là các thuật toán nổi tiếng nhất trong lĩnh vực khai thác mỏ mô hình thường xuyên. Nhiều giải pháp hiệu quả đã được phát triển cho lớp này tương đối đơn giản của mô hình. Trong khi nhiệm vụ khai thác mỏ itemsets thường xuyên trong một mối quan hệ duy nhất nghiên cứu, chỉ một vài giải pháp tồn tại cho khai thác mỏ itemsets thường xuyên ở bất kỳ quan hệ cơ sở dữ liệu, mà thường có nhiều hơn một mối quan hệ [4, 5, 9, 10]. Những phương pháp này xem xét một itemset quan hệ là một tập hợp các mặt hàng, nơi mỗi mục là một cặp giá trị thuộc tính, thuộc về quan hệ một hoặc nhiều trong cơ sở dữ liệu. Để cho hai hay một số mục từ quan hệ khác nhau để trong itemset tương tự, họ phải được kết nối. Hai bài được coi là được kết nối nếu có tồn tại một tham gia của quan hệ hai trong cơ sở dữ liệu kết nối chúng. Nói chung, một itemset nói để xảy ra trong cơ sở dữ liệu, nếu có tồn tại một tuple trong một tham gia của các mối quan hệ, chứa itemset. Trong bài này chúng tôi cũng áp dụng khái niệm này của sự xuất hiện.
A định nghĩa tốt của một đơn vị trong đó sự hỗ trợ của một mô hình được thể hiện tức là những gì được tính là một yêu cầu chính để khai thác bất kỳ loại mô hình thường xuyên. Trong tác phẩm hiện tại trên quan hệ itemset khai thác [4, 9, 10], tần số của một itemset qua nhiều quan hệ được thể hiện trong số lần xuất hiện trong một tham gia của cơ sở dữ liệu quan hệ. Tuy nhiên, định nghĩa này hỗ trợ itemset là khó để giải thích, bởi vì rất nhiều phụ thuộc vào tốt như thế nào các mục trong các thiết lập được kết nối. Trong bài này, chúng tôi giả định rằng quan trọng phụ thuộc được xác định trong các đề án quan hệ cơ sở dữ liệu đầu vào. Chúng tôi xác định sự hỗ trợ của một itemset bằng cách đếm duy nhất quan trọng giá trị trong tuples nơi itemset xảy ra. Xem xét cơ sở dữ liệu quan hệ trong hình 1, chúng tôi sẽ sử dụng như là một ví dụ chạy-ning trong suốt giấy. Cơ sở dữ liệu này, các
phím được sử dụng là {Professor.PID, Course.CID, Student.SID, Study.YID}. Này hỗ trợ mới đếm kỹ thuật cho phép
interpretable itemsets thường xuyên, vì nó đi mà không nói rằng
itemsets thường xuyên trong Professor.PID có ngữ nghĩa khác nhau hơn itemsets thường xuyên trong Course.CID. Cách tiếp cận này cho phép một thuật toán hiệu quả sâu đầu tiên tạo ra thú vị itemsets quan hệ thường xuyên được dễ dàng để hiểu.

đang được dịch, vui lòng đợi..

Kết quả (Việt) 2:[Sao chép]

Sao chép!

đang được dịch, vui lòng đợi..

Kết quả (Việt) 3:[Sao chép]

Sao chép!

đang được dịch, vui lòng đợi..

Các ngôn ngữ khác

Hỗ trợ công cụ dịch thuật: Albania, Amharic, Anh, Armenia, Azerbaijan, Ba Lan, Ba Tư, Bantu, Basque, Belarus, Bengal, Bosnia, Bulgaria, Bồ Đào Nha, Catalan, Cebuano, Chichewa, Corsi, Creole (Haiti), Croatia, Do Thái, Estonia, Filipino, Frisia, Gael Scotland, Galicia, George, Gujarat, Hausa, Hawaii, Hindi, Hmong, Hungary, Hy Lạp, Hà Lan, Hà Lan (Nam Phi), Hàn, Iceland, Igbo, Ireland, Java, Kannada, Kazakh, Khmer, Kinyarwanda, Klingon, Kurd, Kyrgyz, Latinh, Latvia, Litva, Luxembourg, Lào, Macedonia, Malagasy, Malayalam, Malta, Maori, Marathi, Myanmar, Mã Lai, Mông Cổ, Na Uy, Nepal, Nga, Nhật, Odia (Oriya), Pashto, Pháp, Phát hiện ngôn ngữ, Phần Lan, Punjab, Quốc tế ngữ, Rumani, Samoa, Serbia, Sesotho, Shona, Sindhi, Sinhala, Slovak, Slovenia, Somali, Sunda, Swahili, Séc, Tajik, Tamil, Tatar, Telugu, Thái, Thổ Nhĩ Kỳ, Thụy Điển, Tiếng Indonesia, Tiếng Ý, Trung, Trung (Phồn thể), Turkmen, Tây Ban Nha, Ukraina, Urdu, Uyghur, Uzbek, Việt, Xứ Wales, Yiddish, Yoruba, Zulu, Đan Mạch, Đức, Ả Rập, dịch ngôn ngữ.