Data Analysis Using SQL and Excel

Data Analysis Using SQL and Excel
by Gordon S. Linoff

Data Analysis Using SQL and Excel
List Price: $45.00
Our Price: $27.16
You Save: $17.84 (40%)
Availability: Usually ships in 1-2 business days
Buy Used: from $27.02 (click here)
Category: Book
See more book details and other editions


or

Book Summary Information

Author: Gordon S. Linoff
Edition: Paperback
Audio: English (Unknown); English (Original Language); English (Published)
Published: 2007-10-01
ISBN: 0470099518
Number of pages: 645
Publisher: Wiley

Book Reviews of Data Analysis Using SQL and Excel

Book Review: Comments from a colleague
Summary: 5 Stars

Gordon Linoff and I have written three an a half books together. (Four, if we get to count the second edition of Data Mining Techniques as a whole new book; it didn't feel like any less work.) Neither of us has written a book without the other before, so I must admit to a tiny twinge of regret upon first seeing the cover of this one without my name on it next to Gordon's. The feeling passed very quickly as recollections of the authorial life came flooding back--vacations spent at the keyboard instead of in or on the lake, opportunities missed, relationships strained. More importantly, this is a book that only Gordon Linoff could have written. His unique combination of talents and experiences informs every chapter.

I first met Gordon at Thinking Machines Corporation, a now long-defunct manufacturer of parallel supercomputers where we both worked in the late eighties and early nineties. Among other roles, Gordon managed the implementation of a parallel relational database designed to support complex analytical queries on very large databases. The design point for this database was radically different from other relational database systems available at the time in that no trade-offs were made to support transaction processing. The requirements for a system designed to quickly retrieve or update a single record are quite different from the requirements for a system to scan and join huge tables. Jettisoning the requirement to support transaction processing made for a cleaner, more efficient database for analytical processing. This part of Gordon's background means he understands SQL for data analysis literally from the inside out.

Just as a database designed to answer big important questions has a different structure from one designed to process many individual transactions, a book about using databases to answer big important questions requires a different approach to SQL. Many books on SQL are written for database administrators. Others are written for users wishing to prepare simple reports. Still others attempt to introduce some particular dialect of SQL in every detail. This one is written for data analysts, data miners, and anyone who wants to extract maximum information value from large corporate databases. Jettisoning the requirement to address all the disparate types of database user makes this a better, more focused book for the intended audience. In short, this is a book about how to use databases the way we ourselves use them.

Even more important than Gordon's database technology background, is his many years as a data mining consultant. This has given him a deep understanding of the kinds of questions businesses need to ask and of the data they are likely to have available to answer them. Years spent exploring corporate databases has given Gordon an intuitive feel for how to approach the kinds of problems that crop up time and again across many different business domains:

* How to take advantage of geographic data. A zip code field looks much richer when you realize that from zip code you can get to latitude and longitude and from latitude and longitude you can get to distance. It looks richer still when your realize that you can use it to join in census bureau data to get at important attributes such as population density, median income, percentage of people on public assistance, and the like.

* How to take advantage of dates. Order dates, ship dates, enrollment dates, birth dates. Corporate data is full of dates. These fields look richer when you understand how to turn dates into tenures, analyze purchases by day of week, and track trends in fulfillment time. They look richer still when you know how to use this data to analyze time-to-event problems such as time to next purchase or expected remaining life time of a customer relationship.

* How to build data mining models directly in SQL. This book shows you how to do things in SQL that you probably never imagined possible including generating association rules for market basket analysis, building regression models, and implementing naïve Bayes classifiers and scorecards.

* How to prepare data for use with data mining tools. Although more than most people realize can be done using just SQL and Excel, eventually you will want to use more specialized data mining tools. These tools need data in a specific format known as a customer signature. This book shows you how to create these data mining extracts.

The book is rich in examples and they all use real data. This point is worth saying more about. Unrealistic datasets lead to unrealistic results. This is frustrating to the student. In real life, the more you know about the business context, the better your data mining results will be. Subject matter expertise gives you a head start. You know what variables ought to be predictive and have good ideas about new ones to derive. Fake data does not reward these good ideas because patterns that should be in the data are missing and patterns that shouldn't be there have been introduced inadvertently. Real data is hard to come by, not least because real data may reveal more than its owners are willing to share about their business operations. As a result, many books and courses make do with artificially constructed datasets. Best of all, the datasets used in the book are all available for download at the companion web site http://www.data-miners.com/sql_companion.htm.

I reviewed the chapters of this book as they were written. This process was very beneficial to my own use of SQL and Excel. The exercise of thinking about the fairly complex queries used in the examples greatly increased my understanding of how SQL actually works. As a result, I have lost my fear of nested queries, multi-way joins, giant case statements, and other formerly daunting aspects of the language. In well over a decade of collaboration, I have always turned to Gordon for help using SQL to best advantage. Now, I can turn to this book. And you can too.

Summary of Data Analysis Using SQL and Excel

Useful business analysis requires you to effectively transform data into actionable information. This book helps you use SQL and Excel to extract business information from relational databases and use that data to define business dimensions, store transactions about customers, produce results, and more. Each chapter explains when and why to perform a particular type of business analysis in order to obtain useful results, how to design and perform the analysis using SQL and Excel, and what the results should look like.

Data Warehousing Books

Book Subjects
Most talked about in Data Warehousing Books
A generic and customizable framework for the design of ETL scenarios [An article from: Information Systems] ImageA generic and customizable framework for the design of ETL scenarios [An article from: Information Systems]
by P. Vassiliadis, A. Simitsis, P. Georgantas, Terrov
Elsevier; Digital; Book
Best price: $8.95
Mining and modeling for a metropolitan Atlanta ozone pollution decision-making framework.: An article from: IIE Transactions ImageMining and modeling for a metropolitan Atlanta ozone pollution decision-making framework.: An article from: IIE Transactions
by Zehua Yang, Victoria C.P. Chen, Michael E. Chang, Terrence E. Murphy, Julia C.C. Tsai
Thomson Gale; Published: 2007-06-01; Digital; Book
Best price: $9.95
MSMiner-a developing platform for OLAP [An article from: Decision Support Systems] ImageMSMiner-a developing platform for OLAP [An article from: Decision Support Systems]
by Z. Shi, Y. Huang, Q. He, L. Xu, S. Liu, L. Qin, Ji
Elsevier; Published: 2007-01-01; Digital; Book
Best price: $10.95
The concept of document warehousing for multi-dimensional modeling of textual-based business intelligence [An article from: Decision Support Systems] ImageThe concept of document warehousing for multi-dimensional modeling of textual-based business intelligence [An article from: Decision Support Systems]
by F.S.C. Tseng, A.Y.H. Chou
Elsevier; Published: 2006-11-01; Digital; Book
Best price: $10.95
Going for growth: capturing technology to spur top-line growth.(CEO2CEO SUMMIT): An article from: Chief Executive (U.S.) ImageGoing for growth: capturing technology to spur top-line growth.(CEO2CEO SUMMIT): An article from: Chief Executive (U.S.)
by Russ Mitchell
Chief Executive Publishing; Published: 2005-01-01; Digital; Book
Best price: $5.95
DROWNING IN DATA.: An article from: Strategic Finance ImageDROWNING IN DATA.: An article from: Strategic Finance
by Julie Smith David, Paul John Steinbart
Institute of Management Accountants; Published: 1999-12-01; Digital; Book
Best price: $5.95
Fast Track to MDX ImageFast Track to MDX
by Mark Whitehorn, Robert Zare, Mosha Pasumansky
Springer; Published: 2005-10-15; Paperback; Book
Best price: $41.00
Price in other shops: $69.95
The Microsoft Data Warehouse Toolkit: With SQL Server 2008 R2 and  the Microsoft Business Intelligence Toolset ImageThe Microsoft Data Warehouse Toolkit: With SQL Server 2008 R2 and the Microsoft Business Intelligence Toolset
by Joy Mundy, Warren Thornthwaite
Wiley; Published: 2011-03-08; Paperback; Book
Best price: $33.15
Price in other shops: $50.00
The Kimball Group Reader: Relentlessly Practical Tools for Data Warehousing and Business Intelligence ImageThe Kimball Group Reader: Relentlessly Practical Tools for Data Warehousing and Business Intelligence
by Ralph Kimball, Margy Ross
Wiley; Published: 2010-02-08; Paperback; Book
Best price: $29.99
Price in other shops: $45.00
Data Mining: A Knowledge Discovery Approach ImageData Mining: A Knowledge Discovery Approach
by Krzysztof J. Cios, Witold Pedrycz
Springer; Published: 2007-02-01; Hardcover; Book
Best price: $63.12
Price in other shops: $99.00
Similar Books and other products
Head First Data Analysis: A Learner's Guide to Big Numbers, Statistics, and Good Decisions ImageHead First Data Analysis: A Learner's Guide to Big Numbers, Statistics, and Good Decisions
by Michael Milton
O'Reilly Media; Published: 2009-08-04; Paperback; Book
Best price: $25.55
Price in other shops: $49.99
SQL Cookbook (Cookbooks (O'Reilly)) ImageSQL Cookbook (Cookbooks (O'Reilly))
by Anthony Molinaro
O'Reilly Media; Published: 2005-12-23; Paperback; Book
Best price: $21.99
Price in other shops: $39.95
Microsoft Excel 2010: Data Analysis and Business Modeling ImageMicrosoft Excel 2010: Data Analysis and Business Modeling
by Wayne L. Winston Ph.D.
Microsoft Press; Published: 2011-01-14; Paperback; Book
Best price: $25.98
Price in other shops: $49.99
Microsoft® Office Excel® 2007: Data Analysis and Business Modeling (Bpg -- Other) ImageMicrosoft® Office Excel® 2007: Data Analysis and Business Modeling (Bpg -- Other)
by Wayne L. Winston
Microsoft Press; Published: 2007-05-16; Paperback; Book
Best price: $22.69
Price in other shops: $39.99
Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management ImageData Mining Techniques: For Marketing, Sales, and Customer Relationship Management
by Gordon S. Linoff, Michael J. Berry
*Wiley Computer Publishing; Published: 2011-04-12; Paperback; Book
Best price: $26.82
Price in other shops: $50.00
Concepts of Epidemiology: Integrating the ideas, theories, principles and methods of epidemiology ImageConcepts of Epidemiology: Integrating the ideas, theories, principles and methods of epidemiology
by Raj Bhopal
Oxford University Press, USA; Published: 2008-10-15; Paperback; Book
Best price: $37.92
Price in other shops: $55.95
The Language of SQL: How to Access Data in Relational Databases ImageThe Language of SQL: How to Access Data in Relational Databases
by Larry Rockoff
Course Technology PTR; Published: 2010-06-03; Paperback; Book
Best price: $9.99
Price in other shops: $19.99
Learning SQL ImageLearning SQL
by Alan Beaulieu
O'Reilly Media; Published: 2009-04-27; Paperback; Book
Best price: $22.35
Price in other shops: $39.99
Sams Teach Yourself SQL in 10 Minutes (3rd Edition) ImageSams Teach Yourself SQL in 10 Minutes (3rd Edition)
by Ben Forta
Sams; Published: 2004-04-10; Paperback; Book
Best price: $12.56
Price in other shops: $24.99
Beginning SQL Joes 2 Pros: The SQL Hands-On Guide for Beginners (SQL Exam Prep Series 70-433 Volume 1 of 5) (Sql Design Series) ImageBeginning SQL Joes 2 Pros: The SQL Hands-On Guide for Beginners (SQL Exam Prep Series 70-433 Volume 1 of 5) (Sql Design Series)
by Rick A Morelan
BookSurge Publishing; Published: 2009-12-30; Paperback; Book
Best price: $17.39
Price in other shops: $25.00