<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.3.2" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>DM(X)</title>
	<link>http://www.bogdancrivat.net/dm</link>
	<description>bogdanc on SQL Server Data Mining</description>
	<pubDate>Tue, 06 Jan 2009 10:02:30 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.2</generator>
	<language>en</language>
			<item>
		<title>Querying Rules and Itemsets (like the Data Mining Viewers do)</title>
		<link>http://www.bogdancrivat.net/dm/archives/58</link>
		<comments>http://www.bogdancrivat.net/dm/archives/58#comments</comments>
		<pubDate>Tue, 06 Jan 2009 10:01:46 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[DMX]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/58</guid>
		<description><![CDATA[I will try to continue the series started by Jamie, presenting the other set of queries issued by the Microsoft Association Rules viewer. Recently, a question on these queries appeared on the MSDN Data Mining Forums and the poster raised a very good point: while the stored procedures were intended as internal calls for the [...]]]></description>
			<content:encoded><![CDATA[<p>I will try to continue the series started by <a href="http://www.sqldataminingbook.com/dmbook/post/2008/12/12/Querying-the-Dependency-Net.aspx" target="_blank">Jamie</a>, presenting the other set of queries issued by the Microsoft Association Rules viewer. Recently, a question on these queries appeared on the <a href="http://social.msdn.microsoft.com/Forums/en-US/sqldatamining/thread/74c8ae12-2ded-4f5a-b364-9f78cfa79d57" target="_blank">MSDN Data Mining Forums</a> and the poster raised a very good point: while the stored procedures were intended as internal calls for the built-in viewers, external applications and viewers may want to employ them. </p>
<p>So, here is how the rest of the Association Rules viewer works.</p>
<p>Once the viewer is loaded, the first call is something like:</p>
<p><font face="Courier New"><font color="#0000ff">CALL</font> System.Microsoft.AnalysisServices.System.DataMining.AssociationRules.GetStatistics(&#8217;Customers&#8217;)</font></p>
<p>The single parameter of this stored procedure is the mining model name.&#160; The result is a one-row table containing the following columns:</p>
<table cellspacing="0" cellpadding="2" width="800" border="1">
<tbody>
<tr>
<td valign="top" width="167">
<p align="center"><strong><em>Column</em></strong></p>
</td>
<td valign="top" width="127">
<p align="center"><strong><em>Sample Value</em></strong></p>
</td>
<td valign="top" width="504">
<p align="center"><strong><em>Comments</em></strong></p>
</td>
</tr>
<tr>
<td valign="top" width="167">MAX_PAGE_SIZE</td>
<td valign="top" width="127">2000</td>
<td valign="top" width="504">The maximum server supported page size for fetching rule and itemsets. This parameter ensures the the viewer will not make requests which will make the server go out of memory, details later.</td>
</tr>
<tr>
<td valign="top" width="167">MIN_SUPPORT</td>
<td valign="top" width="127">89</td>
<td valign="top" width="504">Minimum actual support for rules detected by the model</td>
</tr>
<tr>
<td valign="top" width="167">MAX_SUPPORT</td>
<td valign="top" width="127">2439</td>
<td valign="top" width="504">Maximum actual support for rules detected by the model </td>
</tr>
<tr>
<td valign="top" width="167">MIN_ITEMSET_SIZE</td>
<td valign="top" width="127">0</td>
<td valign="top" width="504">Minimum itemset size</td>
</tr>
<tr>
<td valign="top" width="167">MAX_ITEMSET_SIZE</td>
<td valign="top" width="127">3</td>
<td valign="top" width="504">Maximum itemset size</td>
</tr>
<tr>
<td valign="top" width="167">MIN_RULE_PROBABILITY</td>
<td valign="top" width="127">0.401529636711281 </td>
<td valign="top" width="504">minimum actual rule probability</td>
</tr>
<tr>
<td valign="top" width="167">MAX_RULE_PROBABILITY</td>
<td valign="top" width="127">0.993975903614458&#160;&#160;&#160; </td>
<td valign="top" width="504">maximum actual rule probability</td>
</tr>
<tr>
<td valign="top" width="167">MIN_RULE_LIFT</td>
<td valign="top" width="127">0.514182044237125 </td>
<td valign="top" width="504">minimum actual rule importance</td>
</tr>
<tr>
<td valign="top" width="167">MAX_RULE_LIFT</td>
<td valign="top" width="127">2.13833283242171</td>
<td valign="top" width="504">maximum actual rule importance</td>
</tr>
</tbody>
</table>
<p> <a href="http://www.bogdancrivat.net/dm/archives/58#more-58" class="more-link">(more&#8230;)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/58/feed</wfw:commentRss>
		</item>
		<item>
		<title>Update: Data Mining in the Cloud is back up</title>
		<link>http://www.bogdancrivat.net/dm/archives/53</link>
		<comments>http://www.bogdancrivat.net/dm/archives/53#comments</comments>
		<pubDate>Mon, 24 Nov 2008 19:08:01 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[Cloud]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/53</guid>
		<description><![CDATA[To restore your settings:
- in the browser: keep using http://sqlserverdatamining.com/cloud
- in Excel (add-in): go to the Connections button in the Analyze (in the Cloud) ribbon and type the following URL: https://131.107.181.99/CloudDM/TATServices/
With this change, you Excel add-in will once again use HTTPS to secure the data transmission
Thanks for your patience!
]]></description>
			<content:encoded><![CDATA[<p>To restore your settings:</p>
<p>- in the browser: keep using <a href="http://sqlserverdatamining.com/cloud">http://sqlserverdatamining.com/cloud</a></p>
<p>- in Excel (add-in): go to the Connections button in the Analyze (in the Cloud) ribbon and type the following URL: <a href="https://131.107.181.99/CloudDM/TATServices/">https://131.107.181.99/CloudDM/TATServices/</a></p>
<p>With this change, you Excel add-in will once again use HTTPS to secure the data transmission</p>
<p>Thanks for your patience!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/53/feed</wfw:commentRss>
		</item>
		<item>
		<title>Data Mining in the Cloud is temporarily down</title>
		<link>http://www.bogdancrivat.net/dm/archives/52</link>
		<comments>http://www.bogdancrivat.net/dm/archives/52#comments</comments>
		<pubDate>Sun, 16 Nov 2008 04:22:09 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[DMX]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/52</guid>
		<description><![CDATA[On Saturday, November 15th, the connection to the Table Analysis in the Cloud URL is broken. Until the problem is identified and fixed, here are some workarounds:
- For the web interface, use the http://www.sqlserverdatamining.com/cloud URL
- For the Excel add-in, please change the services connection URL. To do that, click the Connections button in the &#34;Analyze [...]]]></description>
			<content:encoded><![CDATA[<p>On Saturday, November 15th, the connection to the Table Analysis in the Cloud URL is broken. Until the problem is identified and fixed, here are some workarounds:</p>
<p>- For the web interface, use the <a href="http://www.sqlserverdatamining.com/cloud">http://www.sqlserverdatamining.com/cloud</a> URL</p>
<p>- For the Excel add-in, please change the services connection URL. To do that, click the Connections button in the &quot;Analyze (in the Cloud)&quot; ribbon and change the destination URL to </p>
<p><a title="http://131.107.181.101/CloudDM/TATServices/" href="http://131.107.181.101/CloudDM/TATServices/">http://131.107.181.101/CloudDM/TATServices/</a></p>
<p><strong>NOTE: This temporary solution does not support SSL. Your data is transmitted in clear</strong></p>
<p>I&#8217;ll post here an update as soon as the servers are up again</p>
<div class="wlWriterSmartContent" id="scid:0767317B-992E-4b12-91E0-4F059A8CECA8:9be4028d-2f16-4e40-bc48-230c202a5dfe" style="padding-right: 0px; display: inline; padding-left: 0px; padding-bottom: 0px; margin: 0px; padding-top: 0px">Technorati Tags: <a href="http://technorati.com/tags/Cloud" rel="tag">Cloud</a></div>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/52/feed</wfw:commentRss>
		</item>
		<item>
		<title>Book&#8217;s Blog</title>
		<link>http://www.bogdancrivat.net/dm/archives/51</link>
		<comments>http://www.bogdancrivat.net/dm/archives/51#comments</comments>
		<pubDate>Tue, 11 Nov 2008 05:19:20 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[DMX]]></category>

		<category><![CDATA[Book]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/51</guid>
		<description><![CDATA[The &#8220;Data Mining with SQL Server 2008&#8243; book  now has a blog. You can check it out at http://www.SqlDataMiningBook.com
Soon enough, there will be some content there:
- various data mining related postings from Jamie and me &#8212; all the product related postings on this blog will be replicated on the book&#8217;s blog . With a bit [...]]]></description>
			<content:encoded><![CDATA[<p>The &#8220;Data Mining with SQL Server 2008&#8243; book  now has a blog. You can check it out at <a href="http://www.SqlDataMiningBook.com">http://www.SqlDataMiningBook.com</a></p>
<p>Soon enough, there will be some content there:</p>
<p>- various data mining related postings from <a target="_blank" href="http://blogs.msdn.com/jamiemac/default.aspx">Jamie</a> and me &#8212; all the product related postings on this blog will be replicated on the book&#8217;s blog . With a bit of discipline, all the postings will be tagged with the relevant chapter number</p>
<p>- errata (well, hopefully that won&#8217;t be the main topic of the blog <img src='http://www.bogdancrivat.net/dm/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> )</p>
<p>- any new downloads or other information that may be relevant for the readers</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/51/feed</wfw:commentRss>
		</item>
		<item>
		<title>Data Mining with SQL Server 2008 + get your own free autographed copy!</title>
		<link>http://www.bogdancrivat.net/dm/archives/50</link>
		<comments>http://www.bogdancrivat.net/dm/archives/50#comments</comments>
		<pubDate>Sat, 08 Nov 2008 06:12:01 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[DMX]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/50</guid>
		<description><![CDATA[ 
&#160;
The new version of the SQL Data Mining book is finally available, at least at Amazon. If you are currently SQL DM user, you have an opportunity to get a free autographed copy&#160; by filling out a short survey about the way you use SQL Server Data Mining. For more details about the survey, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.amazon.com/gp/product/0470277742?ie=UTF8&amp;tag=sqlserverda09-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=0470277742" target="_blank"><img style="border-right: 0px; border-top: 0px; border-left: 0px; border-bottom: 0px" height="244" alt="DMBook" src="http://www.bogdancrivat.net/dm/wp-content/uploads/2008/11/dmbook.jpg" width="194" border="0" /></a> </p>
<p>&#160;</p>
<p>The new version of the SQL Data Mining book is finally available, at least at Amazon. If you are currently SQL DM user, you have an opportunity to get a <strong><u>free autographed copy</u></strong>&#160; by filling out a short <a href="http://www.questionpro.com/akira/TakeSurvey?id=1090672" target="_blank">survey</a> about the way you use SQL Server Data Mining. For more details about the survey, check <a href="http://blogs.msdn.com/jamiemac/default.aspx" target="_blank">Jamie</a>&#8217;s blog <a href="http://blogs.msdn.com/jamiemac/archive/2008/11/07/get-a-free-autographed-copy-of-data-mining-with-sql-server-2008.aspx" target="_blank">post</a>.</p>
<p>More details about the book here: <a title="Data Mining with SQL Server 2008" href="http://www.amazon.com/gp/product/0470277742?ie=UTF8&amp;tag=sqlserverda09-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=0470277742">Data Mining with SQL Server 2008</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/50/feed</wfw:commentRss>
		</item>
		<item>
		<title>New Tool for Cloud Data Mining &#8212; Prediction calculator</title>
		<link>http://www.bogdancrivat.net/dm/archives/48</link>
		<comments>http://www.bogdancrivat.net/dm/archives/48#comments</comments>
		<pubDate>Tue, 16 Sep 2008 07:56:02 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[2008]]></category>

		<category><![CDATA[Cloud]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/48</guid>
		<description><![CDATA[Technorati Tags: Cloud,Data Mining,SQL Server
A new SQL Server Data Mining tool is available now for you to use directly from the browser &#8212; the Prediction Calculator.
The feature (already present in the 2008 version of the Data Mining Add-ins) is a scorecard generator. The tool learns from your data and produces a simple calculator which can [...]]]></description>
			<content:encoded><![CDATA[<p style="display: inline; float: none; margin: 0px; padding: 0px" id="scid:0767317B-992E-4b12-91E0-4F059A8CECA8:bd29b204-8fa4-4117-acc6-d65b42605d28" class="wlWriterSmartContent">Technorati Tags: <a rel="tag" href="http://technorati.com/tags/Cloud">Cloud</a>,<a rel="tag" href="http://technorati.com/tags/Data%20Mining">Data Mining</a>,<a rel="tag" href="http://technorati.com/tags/SQL%20Server">SQL Server</a></p>
<p>A new SQL Server Data Mining tool is available now for you to use directly from the browser &#8212; the Prediction Calculator.</p>
<p>The feature (already present in the <a target="_blank" href="http://www.microsoft.com/sqlserver/2008/en/us/data-mining-addins.aspx">2008 version of the Data Mining Add-ins</a>) is a scorecard generator. The tool learns from your data and produces a simple calculator which can be used to compute a score (between 1 and 1000), score which describes the likelihood of a certain event. If you are not familiar with this kind of tools, think of the personality tests that appear in various magazines or about the FICO score.</p>
<p>When you launch the Prediction Calculator, you need to select a target column and a target value (or range). If you target column has categorical values, then you will need to specify <strong>exactly</strong> your target value. Example: if you want to predict whether some computer will crash in the next month, you will need to select, say, the &#8220;Yes&#8221; value of the &#8220;Will Crash in the Next Month&#8221; column. If your target is numeric, you can specify a range. For example, you can use the prediction calculator to figure out how likely it is that your insurance costs will grow between 20% and 50% in one year.</p>
<p>The result, depending on the columns in your data set, looks more or less like below. As you select different values for various columns, the score goes up or down. If it exceeds a certain threshold then you get a positive prediction, otherwise a negative prediction.</p>
<p>For example, try to play with the calculator below to figure out whether you are a likely Bike Buyer or not:</p>
<p><iframe height="470" width="420" frameBorder="0" src="/PC.htm" marginHeight="0" marginWidth="0"></iframe></p>
<p>How it works: each of the attributes gets a score, depending on the state. For example, having 0 cars may score 100 points, while having 2 cars may score 0 points (people with many cars are less likely to purchase a bike). The points are added up and, if the total score (always between 0 and 1000) exceeds a certain threshold, then the prediction is positive, otherwise negative.</p>
<p>The Prediction Calculator tool allows you to determine the threshold that maximizes your &#8220;profit&#8221;. In the lower part of the prediction calculator result (not visible in the embedded snapshot above) you can specify the costs associated with a False Positive (Type I) or False Negative (Type II) error. You can also specify any profits deriving from a correct positive or negative prediction.</p>
<p>The tool computes the total profit generated by various score thresholds based on your inputs and produces a set of diagrams, one for profit and one for cumulative costs, like below:</p>
<p><img src="http://131.107.181.99/CloudDM/temp/dcd_3/dcp_55c5535-3-2.png?guid=f216a8f4-44d7-49fe-b9fb-abdf88215038" /><img src="http://131.107.181.99/CloudDM/temp/dcd_3/dcp_55c5535-3-3.png?guid=cab030c8-29a8-476f-9134-a5b8d3f3b9a3" /></p>
<p>You can see in the profit diagram that the profit is maximized for a score around 450 or 500. The costs are also minimized in that area.</p>
<p>The tool detects the scorecard threshold that optimizes the profit (and includes it in the page).</p>
<p>Once you produce and tune a prediction calculator, you can embed it in your HTML page, just copy and paste the HTML fragment &#8212; the Bike Buyer calculator above is such a fragment.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/48/feed</wfw:commentRss>
		</item>
		<item>
		<title>Data Mining for the Cloud (or how I spent my summer)</title>
		<link>http://www.bogdancrivat.net/dm/archives/47</link>
		<comments>http://www.bogdancrivat.net/dm/archives/47#comments</comments>
		<pubDate>Fri, 29 Aug 2008 08:19:15 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[Cloud]]></category>

		<category><![CDATA[DMX]]></category>

		<category><![CDATA[Excel Add-ins]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/47</guid>
		<description><![CDATA[This week, at the KDD (Knowledge Discovery and Data Mining) conference, we (as in Microsoft SQL Server Data Mining team) presented the Table Analysis Tools for the Cloud, a preview for a technology that enables anybody to play with some of the Microsoft&#8217;s data mining tools, without any bulky downloads and with zero configuration effort.
Around [...]]]></description>
			<content:encoded><![CDATA[<p>This week, at the <a href="http://www.kdd2008.com/" target="_blank">KDD</a> (Knowledge Discovery and Data Mining) conference, we (as in Microsoft SQL Server Data Mining team) presented the Table Analysis Tools for the Cloud, a preview for a technology that enables anybody to play with some of the Microsoft&#8217;s data mining tools, without any bulky downloads and with zero configuration effort.</p>
<p>Around May this year I practically entered some sort of sabbatical: 3 months to work on an incubation project of my choice (yes, the Microsoft SQL Server organization does this kind of things! if it sounds appealing,&#160; check out our <a href="http://members.microsoft.com/careers/careerpath/technical/default.mspx" target="_blank">recruiting</a> site or, even better, contact directly our SQL Server Data Mining recruiter, Melsa Clarke - melsac <strong><em>AT</em> </strong>microsoft <strong>DOT</strong> com). With some help from <a href="http://blogs.msdn.com/jamiemac" target="_blank">Jamie</a>, various management levels and some nice guys in the <a href="http://www.microsoft.com/sql/dataservices/default.mspx" target="_blank">SQL Server Data Services</a> team, I gathered the infrastructure for a Software as a Service incubation and set up a web incarnation of the Table Analysis Tools add-in for Excel.</p>
<p>Now it is up and running. The entry page is at <a href="http://www.sqlserverdatamining.com/cloud" target="_blank">sqlserverdatamining.com/cloud</a>, so if I got you bored already and you don&#8217;t want to read the rest of this stuff, go ahead and browse that page.</p>
<h4><a href="http://www.sqlserverdatamining.com/cloud"><img style="border-top-width: 0px; border-left-width: 0px; border-bottom-width: 0px; border-right-width: 0px" height="181" alt="image" src="http://www.bogdancrivat.net/dm/wp-content/uploads/2008/08/image.png" width="244" border="0" /></a></h4>
<h4></h4>
<h4>&#160;</h4>
<h4>What it is</h4>
<p>TAT Cloud is a set of canned data mining tasks that you can use without having SQL Server installed on your machine.&#160; It consists of encapsulations of some common data mining problems, such as detecting key influencers, forecasting, generating predictive scorecards or doing market basket analysis. The tasks can be executed directly from your browser: just go to the <a href="http://131.107.181.99/CloudDM/Default.aspx" target="_blank">web page</a>, upload your data (in CSV) format and run a tool from the toolbar. Even better, the tasks can be executed directly from Excel. For this, however, you will need to have Excel 2007 and install an add-in which can be downloaded from <a href="http://131.107.181.99/CloudDM/CloudTatSetup.msi" target="_blank">here</a>.</p>
<p>All tasks work on a table (Excel table or a table in CSV format that you upload to the web interface). All tasks produce reports that can be used to learn more about the analyzed data.</p>
<p>Here is a complete list of features:</p>
<ul>
<li>- Analyze Key Influencers: it detects the columns that impact your target column. It presents a report of those values in other columns that correlate strongly with values in your target column. </li>
<li>- Detect Categories (clustering, for data miners) &#8212; identifies groups of table rows that share similar characteristics. A categories report is generated, which details the characteristics of each category </li>
<li>- Fill From Example &#8212; to some extent, similar to Excel&#8217;s Autofill feature: it learns from a few examples and extends the learned patterns to the remaining rows in the table </li>
<li>- Forecasting &#8212; analyzes vertical series of numeric data, detects periodicity, trends and correlations between series and produces a forecast for those series </li>
<li>- Highlight Exceptions &#8212; finds the interesting (or unusual, or out-of-ordinary) rows in your table </li>
<li>- Scenario Analysis &#8212; What-If and Goal-Seek tools based on a probabilistic model built on top of your data. </li>
<li>- Prediction Calculator &#8212; a tool for generating prediction scorecards </li>
<li>- Market Basket Analysis &#8212; analyzes transaction tables to identify groups of items that appear together in transactions </li>
</ul>
<p>All features work from Excel. However, not all the features are implemented in the web interface. They will show up eventually!</p>
<p>A really <a href="http://sqlserverpedia.com/blog/analysis-services/sql-server-data-mining-in-the-cloud/" target="_blank">nice presentation</a> of how to use the tools was written by <a href="http://www.brentozar.com" target="_blank">Brent Ozar</a> on <a href="http://sqlserverpedia.com/blog/analysis-services/sql-server-data-mining-in-the-cloud/" target="_blank">SQLServerPedia</a>. Thanks, Brent!</p>
<h4>&#160;</h4>
<h4>How it works</h4>
<p>Your data (CSV file or Excel spreadsheet) is uploaded to the web service site. There, Analysis Services crunches it and produces the reports you get either in the browser or in a different spreadsheet. The data and the mining model used for analyzing it are&#160; deleted immediately after processing. If something bad happens and your session does not conclude successfully (fancy wording for &quot;if it crashes&quot;) then both data and models are removed automatically after 15-20 minutes.</p>
<p>You will notice in the Excel add-in (as well as in the web interface) a strange IP address, 131.107.181.99 &#8212; this is IP of the service. It will be changed to something cleaner very soon.</p>
<p>Excel uses HTTPS to connect to the service, in an effort to protect your data. However, you should not use this technology on sensitive data.</p>
<h4>&#160;</h4>
<h4>What it is not</h4>
<p>This is not an official shipping Microsoft technology. It is actually less than even a beta. It may crash, it may produce incorrect results, it may be shot down at any time. BTW, I would appreciate if you posted note here or on the <a href="http://forums.microsoft.com/msdn/ShowForum.aspx?ForumID=81&amp;SiteID=1" target="_blank">Microsoft data mining forums</a> if this happens (particularly about crashes, I should probably know if it gets shut down)!</p>
<h4>&#160;</h4>
<h4>What next</h4>
<p>Well, try the tools: <a href="http://www.sqlserverdatamining.com/cloud" target="_blank">sqlserverdatamining.com/cloud</a> . I recommend downloading the Excel 2007 add-in, rather than using the web application, as it is more functional. Post any questions you might have on the <a href="http://forums.microsoft.com/msdn/ShowForum.aspx?ForumID=81&amp;SiteID=1" target="_blank">Microsoft data mining forums</a>. And check out this blog periodically for announcements (typically new additions to the web interface) or for more details on how this stuff works</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/47/feed</wfw:commentRss>
		</item>
		<item>
		<title>Companion for MS Analysis Services</title>
		<link>http://www.bogdancrivat.net/dm/archives/45</link>
		<comments>http://www.bogdancrivat.net/dm/archives/45#comments</comments>
		<pubDate>Wed, 25 Jun 2008 06:06:41 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[DMX]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/45</guid>
		<description><![CDATA[&#160;
Today I discovered a very nice Analysis Services client tool. Produced by SQLMinds, the tool seems a great addition to Analysis Services. It provides many components, among them a performance tuning service, an OLAP cube browser for the web and a very nice web front end for data mining, the DM Companion tool which can [...]]]></description>
			<content:encoded><![CDATA[<p>&nbsp;</p>
<p>Today I discovered a very nice Analysis Services client tool. Produced by <a href="http://www.sqlminds.com" target="_blank">SQLMinds</a>, the tool seems a great addition to Analysis Services. It provides many components, among them a performance tuning service, an OLAP cube browser for the web and a very nice web front end for data mining, the <a href="http://x32.sqlminds.com/dmcompanion" target="_blank">DM Companion</a> tool which can be launched at <a href="http://x32.sqlminds.com/dmcompanion">http://x32.sqlminds.com/dmcompanion</a>.</p>
<p>So, here are a few really nice things about the DM Companion tool (BTW, a fully working demo is running at the aforementioned URL).</p>
<p>The tool works to some extent like a Data Mining -specific SQL Server Management Studio for the web. Therefore, it allows you connect to the server of your choice (through the <a href="http://www.microsoft.com/technet/prodtechnol/sql/2005/httpasws.mspx" target="_blank">Analysis Services HTTP pump</a>). The demo seems to allow anonymous connections (pretty safe&nbsp; as all the interactions offered by the tool are read-only).&nbsp; Next you get to chose your AS catalog and the mining model you want to use.</p>
<p>For each model you have the option to browse the content or execute predictions.</p>
<p>The prediction feature provides a nice interface for defining singleton predictions, as you can see below:</p>
<p><a href="http://www.bogdancrivat.net/dm/wp-content/uploads/2008/06/image.png"><img style="border-right: 0px; border-top: 0px; border-left: 0px; border-bottom: 0px" height="214" alt="image" src="http://www.bogdancrivat.net/dm/wp-content/uploads/2008/06/image-thumb.png" width="244" border="0"></a> </p>
<p>The interface supports specifying multiple nested table keys as input, so the tool can perform associative predictions as well. It reminds me of the <a href="http://www.sqlserverdatamining.com/XMLAThinMiner" target="_blank">XMLA Thin Miner sample</a> running on SQLServerDatamining,com, however, it looks much better and is nicely integrated with the rest of the application. While prediction functions do not seem to be directly supported, the application is able to predict a cluster for clustering models.</p>
<p>The model browsing features are really nice. Analysis Services includes a set of sample web viewers for Naive Bayes, Trees and Clustering. This application provides some seriously better looking viewers for these algorithms, and extends the suite at least for Neural Networks (a really nice viewer), Sequence Clustering and Association Rules.&nbsp; The DM Companion viewers offer all the features in the sample viewers, with a nicer implementation which uses AJAX and has better graphics, plus a solid set of new features, the most spectacular being the interactive dependency net browser and the pie chart visualization for decision trees, which you can see below.</p>
<p><a href="http://www.bogdancrivat.net/dm/wp-content/uploads/2008/06/image1.png"><img style="border-right: 0px; border-top: 0px; border-left: 0px; border-bottom: 0px" height="191" alt="image" src="http://www.bogdancrivat.net/dm/wp-content/uploads/2008/06/image-thumb1.png" width="244" border="0"></a> <a href="http://www.bogdancrivat.net/dm/wp-content/uploads/2008/06/image2.png"><img style="border-right: 0px; border-top: 0px; border-left: 0px; border-bottom: 0px" height="191" alt="image" src="http://www.bogdancrivat.net/dm/wp-content/uploads/2008/06/image-thumb2.png" width="244" border="0"></a> </p>
<p>&nbsp;</p>
<p>Overall, DM Companion looks like a really nice tool for sharing your data mining implementation on the web</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/45/feed</wfw:commentRss>
		</item>
		<item>
		<title>DMX Pitfalls : Natural Prediction Joins</title>
		<link>http://www.bogdancrivat.net/dm/archives/38</link>
		<comments>http://www.bogdancrivat.net/dm/archives/38#comments</comments>
		<pubDate>Fri, 11 Apr 2008 07:04:16 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[DMX]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/38</guid>
		<description><![CDATA[Some very interesting, although time-consuming project kept me away from posting in the last month. Mea maxima culpa, I will try to catch up and put some new and interesting content here more often. 
Recently, trying to make sense of some prediction results, I realized that the DMX NATURAL PREDICTION JOIN statement has some obvious [...]]]></description>
			<content:encoded><![CDATA[<p>Some very interesting, although time-consuming <a href="http://www.wiley.com/WileyCDA/WileyTitle/productCd-0470277742.html" target="_blank">project</a> kept me away from posting in the last month. Mea maxima culpa, I will try to catch up and put some new and interesting content here more often. </p>
<p>Recently, trying to make sense of some prediction results, I realized that the <a href="http://msdn2.microsoft.com/en-us/library/ms132058.aspx" target="_blank">DMX</a> NATURAL PREDICTION JOIN statement has some obvious and some not so obvious dangers, which deserve some clarification.</p>
<p>To summarize, the PREDICTION JOIN operator is used to apply patterns from a trained mining model against new data. The general syntax is </p>
<blockquote><p>SELECT EducationLevel</p>
<p>FROM MyModel</p>
<p>PREDICTION JOIN </p>
<p>OPENQUERY(MyDataSource, &#8221;SELECT Height, , HairColor, ShoeSize FROM CustomerInfo&#8217;) AS T</p>
<p>ON </p>
<p>T.Height = MyModel.Height AND</p>
<p>T.HairColor = MyModel.HairColor AND</p>
<p>T.ShoeSize= MyModel.ShoeSize</p>
</blockquote>
<p>In the standard PREDICTION JOIN statement, the ON clauses specify how columns from the new data are to mapped to the input columns of the mining model.</p>
<p>The statement supports a simplified format:</p>
<blockquote><p>SELECT EducationLevel</p>
<p>FROM MyModel</p>
<p><strong>NATURAL </strong>PREDICTION JOIN </p>
<p>OPENQUERY(MyDataSource, &#8221;SELECT Height, HairColor, ShoeSize FROM CustomerInfo&#8217;) AS T</p>
</blockquote>
<p>In the simplified format, the columns from the new data are mapped to the input columns of the mining model <strong>by name</strong>. Much easier to type, resulting in a more readable statement!</p>
<p>Now, the natural mapping may end up in three different scenarios. In the first scenario, all the model&#8217;s input columns find a corresponding data column in the input. The mapping is performed correctly and everything works as expected. Nothing to blog about in this case!</p>
<p><strong>The second case</strong>: some of the inputs are mapped correctly to model columns, some are not, although you would expect them to be mapped correctly. More than a simple user error, this situation is actually &#8220;facilitated&#8221; by the way the BI Dev Studio works. Assume you start with a table containing 4 columns: Height, HairColor, ShoeSize, EducationLevel and use BI Dev Studio to create a mining structure and a model. BI Dev Studio will generate more friendly names for the columns and your mining model ends up with the structure below:</p>
<blockquote><p>MyModel</p>
<p>(</p>
<p>Height</p>
<p>[Hair Color],</p>
<p>[Shoe Size],</p>
<p>[Education Level]</p>
<p>)</p>
</blockquote>
<p>&nbsp;</p>
<p> <a href="http://www.bogdancrivat.net/dm/archives/38#more-38" class="more-link">(more&#8230;)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/38/feed</wfw:commentRss>
		</item>
		<item>
		<title>Video Tutorials for Business Intelligence with Excel 2007</title>
		<link>http://www.bogdancrivat.net/dm/archives/37</link>
		<comments>http://www.bogdancrivat.net/dm/archives/37#comments</comments>
		<pubDate>Fri, 14 Mar 2008 07:41:30 +0000</pubDate>
		<dc:creator>bogdan</dc:creator>
		
		<category><![CDATA[Excel Add-ins]]></category>

		<category><![CDATA[Add-ins]]></category>

		<category><![CDATA[Business Intelligence]]></category>

		<category><![CDATA[Excel]]></category>

		<guid isPermaLink="false">http://www.bogdancrivat.net/dm/archives/37</guid>
		<description><![CDATA[MyVBProf.com hosts a set of great video tutorials for performing business intelligence tasks with Excel 2007. Bill Burrows starts with the installation of pre-requisites. He even shows how to make sure that Excel .Net Programmability Support is available! Wow!!
Bill then follows up with a set of tutorials on pivot tables and wraps up with videos [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.myvbprof.com/2007_Version/BI_Tutorial.aspx">MyVBProf.com</a> hosts a set of great video tutorials for performing business intelligence tasks with Excel 2007. <a href="http://www.myvbprof.com/2007_Version/AboutUs.aspx">Bill Burrows</a> starts with the installation of pre-requisites. He even shows how to make sure that Excel .Net Programmability Support is available! Wow!!</p>
<p>Bill then follows up with a set of tutorials on pivot tables and wraps up with videos on using the Table Analysis Tools from the SQL Server Data Mining add-ins for Excel. His approach is accessible, his style is captivating, his examples are meaningful and very well explained.</p>
<p>Therefore:</p>
<p>1. If you have SQL Server 2005 or 2008 CTP these tutorials give you a great start for BI using Excel and Analysis Service.</p>
<p>2. If you do not have SQL Server, get a free evaluation from <a href="http://www.microsoft.com/sql/downloads/trial-software.mspx">http://www.microsoft.com/sql/downloads/trial-software.mspx</a>, then go back to the previous point</p>
<p>3. You don&#8217;t have SQL Server, don&#8217;t want to try it out, don&#8217;t care about the data mining part &#8212; fine, just watch some of the tutorials to see what you miss!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bogdancrivat.net/dm/archives/37/feed</wfw:commentRss>
		</item>
	</channel>
</rss>
