Talk:Data mining
This is the talk page for discussing improvements to the Data mining article. This is not a forum for general discussion of the article's subject. |
Article policies
|
Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL |
This level-5 vital article is rated B-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Material from Data mining was split to Examples of data mining on 2015-09-21. The former page's history now serves to provide attribution for that content in the latter page, and it must not be deleted so long as the latter page exists. Please leave this template in place to link the article histories and preserve this attribution. |
1, 2, 3 |
This page has archives. Sections older than 31 days may be automatically archived by Lowercase sigmabot III when more than 4 sections are present. |
This article is substantially duplicated by a piece in an external publication. Since the external publication copied Wikipedia rather than the reverse, please do not flag this article as a copyright violation of the following sources:
|
Merge Analytics into Data mining
edit- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- To not merge given that there are different topics with different sets of readers; readers are best served by keeping the pages separate. Klbrain (talk) 17:58, 9 November 2022 (UTC)
The jargon "analytics", which could mean any sort of analysis whatsoever, appears to be trying to monopolise a generic English word for a very specific topic, hiding the real meaning. I think that any useful content there could be merged either into this article, data mining, or maybe rather Examples of data mining, since the content seems to be more about specific examples of data mining, under the name "analytics", rather than about the methods of data mining themselves. Another possible target for a merger would be Data analysis. In any case, Analytics is clearly quite poor quality currently, full of businessy jargon trying to pretend that as long as enough people follow the fashion, you can pretend that there's some new meaning there. Please state support or oppose in bold; if support, then please state which target article (data mining, examples of data mining, or data analysis) you recommend for the merger of analytics (we should in principle add {{merge from}} templates to those target articles too...) Boud (talk) 15:03, 30 June 2022 (UTC)
- Support merging the Analytics#Applications section with Examples of data mining into a more encyclopedic Applications of data mining. I think the subsections there would complement the disjointed lists of examples making up Examples of data mining currently.
- Then I would redirect Analytics to Data analysis. I think that the Analytics buzzword is intended to have a broader scope than Data mining (at least according to my reading of the convoluted Analytics#Analytics vs analysis section), which makes Data analysis a better redirect target. Felix QW (talk) 09:56, 2 July 2022 (UTC)
- Oppose It sounds like you are POV-pushing against the use of business jargon. It's better to remain neutral and just summarize the sources. Analytics is a broad term according to Gartner[1], but it is always almost used in a business context. My guess (I haven't performed a proper WP:BEFORE search) is that there is probably enough sourcing out there for a standalone article on the topic. But if editors come to consensus for a merge, then analytics is one aspect of Business intelligence and is better merged there. --
{{u|Mark viking}} {Talk}
23:08, 3 July 2022 (UTC)- I am certainly pushing in favour of Wikipedia being an encylopaedia, with entries of knowledge about the real world. Words that are common but meaningless make more sense in the Wiktionary. Boud (talk) 20:33, 3 July 2022 (UTC)
- It seems you have contempt for the business world and its jargon, and that is compromising your objectivity with respect to this topic. The encyclopedia is better served by editors summarizing reliable sources, not injecting their personal opinions into article content. Show some reliable sources that say analytics is meaningless and those that coined the term are nefariously trying to monopolize an English word, and we could add that criticism to the article. But it's also clear that there is a population of business folk, such as business analysts and consultants, who use this term and find it useful for charactering various forms of business intelligence. Summarizing their approach using RS is the best approach to developing this article. --
{{u|Mark viking}} {Talk}
23:08, 3 July 2022 (UTC)- Hostility towards business sources does make sense to me (at least hostility towards ONLY using business sources, which is currently the case in the data analytics article). Data analysis is a broad scientific field and should not be defined solely by people who have a clear incentive to generate novel definitions at cost of sensibility. 98.43.49.101 (talk) 20:02, 21 October 2022 (UTC)
- It seems you have contempt for the business world and its jargon, and that is compromising your objectivity with respect to this topic. The encyclopedia is better served by editors summarizing reliable sources, not injecting their personal opinions into article content. Show some reliable sources that say analytics is meaningless and those that coined the term are nefariously trying to monopolize an English word, and we could add that criticism to the article. But it's also clear that there is a population of business folk, such as business analysts and consultants, who use this term and find it useful for charactering various forms of business intelligence. Summarizing their approach using RS is the best approach to developing this article. --
- I am certainly pushing in favour of Wikipedia being an encylopaedia, with entries of knowledge about the real world. Words that are common but meaningless make more sense in the Wiktionary. Boud (talk) 20:33, 3 July 2022 (UTC)
- Oppose as the term "analytics" has evolved into a catch-all term for analysis of information. It seems that any person who applies information analysis to their domain can be considered to be doing "analytics." Although analytics has increasingly moved towards a role of describing statistical information in the business sphere, [2] it would not make sense as an "example of data mining" as all processes of data mining would include techniques in analytics. Meanwhile, IT and business users share interest in analytics departments, which might best make sense as an aspect of Business Intelligence, as @Mark viking mentioned. ZacharyWalkerPinto (talk) 15:58, 4 July 2022 (UTC)
- Oppose, the article "data mining" is more on the academic use; analytics is business jargon. Or to put it differently, one is about the methodology, the other about the business purpose, and the third is on example applications. One of the reasons to create examples of data mining in the first place was to make the article less crowded with a rather useless list of examples (that attracts a lot of spam). Maybe move the more concrete applications from Analytics there, too. Chire (talk) 23:48, 6 July 2022 (UTC)
New Data Mining Process models
editI have noticed that someone is keep deleting a new cited work for a process model that was published by IEEE. However, the citation of other papers was simply allowed including ones that simply compare process models to each other. In addition, the article is full with citation to a less significant work, journal articles and other type of research papers.
Somebody suggested building a consensus about this issue using the talk page. Please refere to the cited references using the following link: https://ieeexplore.ieee.org/iel7/6287639/8948470/09263253.pdf
— Preceding unsigned comment added by 176.29.83.94 (talk) 16:05, 11 July 2022 (UTC)
- This is not a place for you to self promote. Please stop spamming us. MrOllie (talk) 12:15, 1 August 2022 (UTC)
India Education Program course assignment
editThis article was the subject of an educational assignment supported by Wikipedia Ambassadors through the India Education Program.
The above message was substituted from {{IEP assignment}}
by PrimeBOT (talk) on 19:55, 1 February 2023 (UTC)
Data Mining versus Factor Analysis
editData mining is a large scale effort to increase the possibility of finding something that an investigator doesn’t know when a better procedure is Factor Analysis, a statistical analysis process developed by Dr Benjamin Fructer. This process uses both repeated linear and nonlinear regression to determine factors within the data. Factor Analysis can be used to test hypotheses or investigate a database for variables that are related. It is preferable to large scale data snooping because it provides statistical significance estimates. DoctorDuncan (talk) 01:42, 5 May 2023 (UTC)
- Assuming you mean the author of Introduction to Factor Analysis (1954), I think his name is spelled "Benjamin Fruchter". Factor analysis is much older than that, but my understanding is that the term "data snooping" is typically used pejoratively for the misuse of tools to "provide" statistical significance, not for the specific tools themselves. Perhaps I'm wrong about this, but regardless, you will need a reliable source to add any of this to the article. Wikipedia doesn't publish original research. Grayfell (talk) 05:35, 5 May 2023 (UTC)
Wiki Education assignment: IFS213-Hacking and Open Source Culture
editThis article was the subject of a Wiki Education Foundation-supported course assignment, between 5 September 2023 and 19 December 2023. Further details are available on the course page. Student editor(s): Hacksasaur (article contribs). Peer reviewers: Yaman Shqeirat.
— Assignment last updated by T57fd (talk) 00:23, 1 December 2023 (UTC)
Definition of data mining in IEEE at least 8 definition also mention books name, write name,year of publication etc.
editDefinition of data mining in IEEE at least 8 definition also mention books name, write name,year of publication etc. 203.215.178.62 (talk) 12:50, 2 November 2023 (UTC)
Data Mining Downsides
editI am adding this section to the Data Mining main page because I think it is important to note that there are cons to Data Mining. I feel that illuminating a section such as this will give users a better idea of what Data Mining is and how big of an undertaking it can be, as well as why it can be very difficult for independent persons or small business to data mine. Users who are not very experienced with technology may feel overwhelmed by Data Mining and might need something laid out to them that very easily shows them what to be wary of when it comes to Data Mining. Wkobrien2 (talk) 18:17, 21 February 2024 (UTC)
- I reverted it - advertising materials such as vendor blogs are not considered reliable sources on Wikipedia. MrOllie (talk) 18:30, 21 February 2024 (UTC)
Wiki Education assignment: IFS213-Hacking and Open Source Culture, Fall 2024
editThis article is currently the subject of a Wiki Education Foundation-supported course assignment, between 3 September 2024 and 13 December 2024. Further details are available on the course page. Student editor(s): Apollommcc2024 (article contribs). Peer reviewers: LEN2134881.
— Assignment last updated by KAN2035117 (talk) 02:35, 30 October 2024 (UTC)