<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.2.1" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Extending Netnography: Researching Online Communities</title>
	<link>http://kozinets.net/archives/94</link>
	<description>Robert Kozinets on Marketing, Media, and Technoculture</description>
	<pubDate>Thu, 21 Aug 2008 03:44:11 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.2.1</generator>

	<item>
		<title>By: rpwagner</title>
		<link>http://kozinets.net/archives/94#comment-480</link>
		<author>rpwagner</author>
		<pubDate>Mon, 12 Nov 2007 10:47:36 +0000</pubDate>
		<guid>http://kozinets.net/archives/94#comment-480</guid>
		<description>Robert,

we (me and some of my colleagues) have published two years ago a paper using data-mining in the orkut (www.orkut.com) virtual community. We have chosen 2 communities ("I love beer" with 52,000 members and "I hate beer" with 32,000 members) and made a probabilistic sample of them. We have "chosen" 400 users from each of these communities. 

From the users, we have collected all the data that they show at their profile, as the number of communities that they participate, how many friends, how many scraps, testimonials, and other information that he or she can describe about himself or herself. 
With these, we tryed to predict if the user participate in one community or in the other

We used two different types of data analysis: 1 - logistics regression; 2 - neural networks.

we got better results with the neural network, and some of the results from this analysis are these:
Training sample: 591
validation sample: 196
variables: 121
neurons: 124 with 3 occult layers
Rsquare: 0,7446
best index in training: 95% of the cases
best index in validation: 91% of the cases

it was a quite interesting paper, unfortunatly we just have it in portuguese.</description>
		<content:encoded><![CDATA[<p>Robert,</p>
<p>we (me and some of my colleagues) have published two years ago a paper using data-mining in the orkut (www.orkut.com) virtual community. We have chosen 2 communities (&#8221;I love beer&#8221; with 52,000 members and &#8220;I hate beer&#8221; with 32,000 members) and made a probabilistic sample of them. We have &#8220;chosen&#8221; 400 users from each of these communities. </p>
<p>From the users, we have collected all the data that they show at their profile, as the number of communities that they participate, how many friends, how many scraps, testimonials, and other information that he or she can describe about himself or herself.<br />
With these, we tryed to predict if the user participate in one community or in the other</p>
<p>We used two different types of data analysis: 1 - logistics regression; 2 - neural networks.</p>
<p>we got better results with the neural network, and some of the results from this analysis are these:<br />
Training sample: 591<br />
validation sample: 196<br />
variables: 121<br />
neurons: 124 with 3 occult layers<br />
Rsquare: 0,7446<br />
best index in training: 95% of the cases<br />
best index in validation: 91% of the cases</p>
<p>it was a quite interesting paper, unfortunatly we just have it in portuguese.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
