Jump to content

NetMiner: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Networkcy (talk | contribs)
No edit summary
Networkcy (talk | contribs)
No edit summary
Line 159: Line 159:
===[ Modules for data transformation ]===
===[ Modules for data transformation ]===
* Direction: Symmetrize, Transpose
* Direction: Symmetrize, Transpose

* Value: Dichotomize, Reverse, Normalize, Recode, Missing, Diagonal
* Value: Dichotomize, Reverse, Normalize, Recode, Missing, Diagonal

* NodeSet
* NodeSet

* LinkSet
* LinkSet

* Matrix
* Matrix

* Layer
* Layer

* Mode
* Mode

* Random
* Random

===[ Modules for network analysis ]===
===[ Modules for network analysis ]===

* Analysis
* Neighbor: Degree(Neighbor), Ego Networks, Structural Hole, Homophily, Assortativity, Equicentrality

* Subgraph: Dyad Census, Triad Census, Triad Combination

* Connection: Shortest Path, All Path Finding, Dependency, Node Connectivity, Link Connectivity, Maximum Flows, Pfnet, Minimum Cutset, Influence, Accessibility

* Cohesion: Component, Bi-Component, Clique, n-Clique, n-Clan, k-Plex, k-Core, Lambda Set, Community, Betweenness, Modularity, Eigenvector, Label Propagation, Cohesive Block, s-Clique

* Centrality: Degree, Coreness, Closeness, Node Betweenness, Link Betweenness, Flow Betweenness, Eigenvector, Status, Power, R.W.Betweenness, Information, Load, Effects, PageRank, HITS, Community

* Equivalence: Structural(Profile, CONCOR), Regular(REGGE, CatRE), Role(Triad, Local), Sim Rank

* Position: Blockmodel(Conventional), Brokerage, Bow-Tie Model

* Properties: Multiple Network, Group, Modularity

* Models: Dyadic Interaction(P1), ERGM(P), Blockmodel(Generalized), Influence Network(Effects, Sequence)

* Two Mode: Degree, Eigenvector Centrality, Collaborative Filtering, Maximum Matching


===[ Modules for statistic analysis ]===
===[ Modules for statistic analysis ]===

* Statistics
* Frequency, Gini Coefficient, Power Law, Descriptives, Crosstabs, ANOVA, Correlation, Autocorrelation, Regression, Logistic Regression

* Multivariate Analysis: MDS, Correspondence, Cluster, Decomposition(Eigenvector, Singular, Spectral), Covariance Matrix, Principal Component, Factor Analysis


===[ Modules for network visualization ]===
===[ Modules for network visualization ]===

* Visualize
* Spring Layouts(2D/3D, 1-mode/2-mode): Kamada & Kawai, Stress Majorization, Eades, Fruchterman & Reingold, GEM, HDE(High-Dimensional Embedding)

* MDS (2D/3D): Classical MDS, Non-metric MDS, Kn-MDS

* Clustered Layouts (2D/3D): Clustered Eades, Clustered-CoLa

* Layered Layout (2D): Dig-Cola

* Circular Laytouts (2D): Circumference, Concentric, Radial

* Others (2D): Fixed, Random


===[ Modules for charts ]===
===[ Modules for charts ]===

Revision as of 05:44, 21 August 2012

NetMiner
Developer(s)Cyram Inc.
Initial releaseDecember 21, 2001 (2001-12-21)
Stable release
4 / July 27, 2011; 13 years ago (2011-07-27)
Written inJava
Operating systemWindows
Available inEnglish
TypeSocial Network Analysis / Visualization
Websitewww.netminer.com

What is NetMiner?

NetMiner is a software tool for exploratory analysis and visualization of large network data. NetMiner 4 embed internal Python-based script engine which equipped with the automatic Script Generator for unskilled users. Then the users can operate NetMiner 4 with existing GUI or programmable script language.

Key Features

Expressive Network Data Model

NetMiner has data structure that can express various types of nodes, links, node attributes and link attributes. In this flexible data structure, almost any social events and research topics can be modelled for network analysis.

Large Scale Network Data Processing

With proper license, computing power, and some algorithmic limitation, NetMiner can handle over 1,000,000 nodes and 10,000,000 links.

Comprehensive Network Measures and Methods

In NetMiner, about 140 analysis modules are mounted. There are 26 modules for data transformation, 66 modules for network analysis, 19 modules for statistical analysis, and 37 modules for network visualization and chart.

Standardized Option and Output Process

All of the analysis sessions in NetMiner are composed of five common analysis components that support standardized analysis process: input, pre-process, main-process, post-process, and output.

Intuitive User Interface

NetMiner is driven by GUI and provides intuitive workflow management interface for complicated analysis.

NetMiner Script Environment

NetMiner provides a python-based script environment. This can access most functions of NetMiner including data management, network analysis, and visualization. Users can create his or her own custom plugins by NetMiner script.

Mix and Match of Numerical and Graphical Output

Exploratory analysis based on the results is available on the graphical output panel. With this visual exploration, intuitive and dynamic interpretation of network analysis results is available.

Integrated Environment for Statistical Analysis

NetMiner provides built-in methods for statistical analysis and visualization.

Screenshots

  • screenshot pictures will be attached here.

File Formats

NetMiner data file format

  • .NMF

Importable/Exportable formats

  • Plain text data: .TXT, .CSV
  • MS-Excel data: .XLS, .XLSX
  • NetMiner 2 data: .NTF
  • UCINet data: .DL, .DAT
  • Pajek data: .NET, .VEC, .CLU, .PER
  • StOCNET data file: .DAT
  • Graph Modelling Language data: .GML(importing only)

Data Structure

Overview: Hierarchy of NetMiner Data Structure

A DataSet, composed of individual data items, is a basic unit for analysis and visualization in NetMiner. A DataSet is used as an input data for all the analysis and visualization Modules in NetMiner. A DataSet is composed of four types of data items: Main Nodeset, Sub Nodeset, 1-mode Network data and 2-mode Network data. A DataSet contains only one Main Nodeset. But multiple 1-mode Network data can be contained in a DataSet. A DataSet contains multiple Sub Nodesets and multiple 2-mode Network data for each Sub Nodeset can be contained in a DataSet.

ProcessLogs which are generated in the analysis and visualization process can be managed with the DataSet that is used for these analysis and visualization processes in a Workfile.

A Project contains independent multiple Workfiles. Number of Main Nodes in each Workfile does not need to be the same. In this way, the hierarchy of NetMiner data structure is as follow:

Project > Workfile > Dataset > Data items

Even if you perform a data analysis in NetMiner using only one data item, you need to recognize the hierarchy of data structure in NetMiner. The basic data file extension ‘NMF’ represents the Project which is in the highest level in the hierarchy of data structure. If you ‘open’ a NMF file, you need to select a Workfile to use in the analysis and then select data item(s) for each analysis and visualization module. Each level in the hierarchy of data structure in NetMiner will be covered in the following topics.

[ Data Item ]

Meta Matrix

The data structure of NetMiner can be explained through a Meta matrix, a matrix whose rows and columns are nodesets. Given that there are nodesets A, B, C and D, the meta set can be represented as shown below. Each cell of a meta matrix represents the relationship between two nodesets. The matrix below shows all the relationships of each nodeset.

Let’s take a closer look by examining the row B in particular. Looking at the relationship among the nodesets, it can be seen that it is related with all nodesets, including itself, centered around B. In NetMiner, a row of a meta matrix is defined as a single workfile, and the nodeset that acts as the reference is defined as a main nodeset, while other remaining nodesets are defined as sub nodesets. In the figure below, B is the main nodeset, while A, C and D are sub nodesets.

A network can be classified into either a 1-mode network or a 2-mode network depending on the nodeset of which the network is composed. In a 1-mode network, the relationships are within the same nodeset, while in a 2-mode network the relationships are between two different nodesets. In the figure below, the cell at row B and column B is a 1-mode network while the rest - the cells at row B and columns A, C and D - are 2-mode networks.

In NetMiner, there can be multiple networks composed of the same nodeset but with different inter-node relationships. In NetMiner a group of such networks are called layers. Looking at it with respect to the concept of meta matrices, layers can be considered to be a group of cells at the same location with multiple meta matrices piled on top of one another.

Main Nodeset ans Sub Nodeset

Nodesets refer to a group of nodes that share some common attribute. In NetMiner, nodesets are broken up into two types: main nodesets and sub nodesets. If there are multiple nodesets in a single workfile, a single nodeset can be defined as the main nodeset; the rest will be defined as sub nodesets. The main nodeset is a standard nodeset. Relationships within a nodeset can be defined only for a main nodeset. For relationship between two nodesets, it is defined between a main nodeset and a sub nodeset. This is to simplify user selection and processing by expressing multiple nodesets in a single dataset and yet focusing on a single nodeset for analysis.

Attribute of Nodeset

In NetMiner, multiple attributes can be defined for a node. Attributes can be number, text, datetime, time, and even missing value.

Node attributes are used for network data analysis or visualization. Specific modules of NetMiner require attributes as input data when a network is analyzed.

Using the attributes, only links that are above a specific value can be extracted to reorganize the network. The attributes can also be considered to be groups, so that a network of attributes can be built as well.

1-mode Network and 2-modeNetwork

A network refers to a group of links of a specific nodeset or those of between nodesets.

A network is defined by using a specific nodeset as the reference, which is called the reference nodeset for a particular network. A 1-mode network or a 2-mode network can be explained according to the type of the reference nodeset.

In 1-mode network, the group is among nodes that are in the same nodeset, and only one nodeset is involved. In NetMiner, networks that are based on a main nodeset are defined as 1-mode networks.

2-mode networks are composed of a group of relationships among nodes that belong to two different nodesets. They are based on two different nodesets. In NetMiner, networks that are based on a main nodeset and a sub nodeset are defined as 2-mode networks.

Representing a Network

A network can be represented as a Matrix Type or as a Link List Type. In Matrix Type, all nodes in a network can be represented including isolates. In Link List Type, only the nodes with links are represented. So the isolate nodes cannot be represented in Link List Type. But for this respect, Link List Type is more efficient to represent a network. In Matrix Type, the weight for each link is given by a value filled in matrix cell. In Link List Type, a link is represented as a row, each of which is composed of source node, target node and the weight.

Multiple Links in NetMiner are links (represented as multiple rows in Link List Type) which have identical source node and target node as follows.

Multiple Links cannot be represented as a Matrix Type. In Matrix Type, a weight for a link from source node to target node can be represented only in a cell in the matrix. So, in NetMiner, a network data including Multiple Link can be represented only as a Link List Type. User can select whether or not to merge the weights of the Multiple Links in the importing process using some methods as “Sum, Average, Max and Min”. And this network data can be represented in both representation types. Even if you imported a network data including Multiple Links without merge option, you need to merge the weights of the Multiple Links while running analysis and visualization process.

Multiple attributes can be defined for a link besides weight. The attribute of link is given to each link not to the two nodes of the link. The link attribute cannot be assigned to any node of each link. The attribute of link must be assigned to each link itself.

Links can be set not only in terms of weight but multiple attributes, which include various types such as number, text, datetime, time, and even missing value. Arbitrary link attribute values of number type can be specified as weight.

[ Data Set ]

Multiple Sub Nodeset

In a NetMiner dataset, there can be multiple nodesets. Only a single nodeset can be defined as the main nodeset and the rest are defined as sub nodesets. This is to simplify user selection and processing by expressing multiple nodesets in a single dataset and yet focusing on a single nodeset for analysis.

Layer

In NetMiner datasets, multiple networks that are based on the same reference nodeset but have different relationships can abe defined. For example, in a nodeset composed of six nodes {A, B, C, D, E, F}, the following two 1-mode network data can be represented.

1-mode Network 1 (Fighting) = {(A,B), (A,C), (B,D), (C,E)}

1-mode Network 2 (Love) = {(A,C), (B,C), (D,E), (A,E)}

A group of networks that share a reference nodeset is called a network layer. There can be 1-mode network layers where the main nodeset is used as the reference nodeset and 2-mode network layers where the main nodeset and a specific sub nodeset are used as the reference nodeset. If the reference nodeset is different, it can’t be defined as a layer. Therefore supposing that there are multiple sub nodesets and 2-mode networks that are connected to each sub nodeset, the number of layers that can be defined is the number of sub nodesets. That is, you can’t group everything together and define as a layer just because it’s a 2-mode network.

Multiple network data in a nodeset can be integrated in the analysis and visualization process. For example, you can extract the common links from the two 1-mode Networks and perform analysis with only that links or merge the weights of each network for analysis.

[ Workfile]

Workfiles are composed of a dataset and a process log. NetMiner uses the concept of a workfile to manage a dataset as well as analysis process using the dataset simultaneously.

A process log shows traces of process execution, and is composed of Session, QuerySets, and Selections. For Session, when a module is selected, a session is created and added to the list. For QuerySets and Selections, when a query or a selection is saved, it is added to the list. Query and Select are used when extacting only the parts in the current dataset that meet the criteria for analysis.

[ Project ]

In NetMiner, a single data file (.nmf) corresponds to a single project, a 1-to-1 correspondence. In a single project, there can be many workfiles.

There are two reasons for grouping multiple workfiles in a single project file. First is that in NetMiner, there can be only one main nodeset in a workfile. In NetMiner, although a 1-mode network that represents relationships among the nodes can be defined for a main nodeset, the same is not possible for sub nodesets. In addition, when it comes to a main nodeset, although a 2-mode network that represents relationships with a sub nodeset can be defined, for a sub nodeset, the same is not possible. For sub nodesets, only 2-mode networks that represent relationships with a main nodeset can be defined. Because of this characteristic, for cases in which a 1-mode network that represents relationships of nodes in a specific sub nodeset is needed or when a 2-mode network that represents relationships between the sub nodeset and another sub nodeset is needed, a new workfile is created in which the sub nodeset is made as the main nodeset. As for the second reason, by managing multiple workfiles in a single project file, management of similar data is made easier.

Workfiles are managed in a tree structure. A new workfile is created when changes occur with a main nodeset or when a data item to change is linked with a previously executed process. To reflect changes with the dataset in a new workfile, by default it is set as a child of the current workfile. The relationships between the parents and the childs form the tree structure. The user can track history of data changes using the tree structure.


Analytical Metrics

[ Modules for data transformation ]

  • Direction: Symmetrize, Transpose
  • Value: Dichotomize, Reverse, Normalize, Recode, Missing, Diagonal
  • NodeSet
  • LinkSet
  • Matrix
  • Layer
  • Mode
  • Random

[ Modules for network analysis ]

  • Neighbor: Degree(Neighbor), Ego Networks, Structural Hole, Homophily, Assortativity, Equicentrality
  • Subgraph: Dyad Census, Triad Census, Triad Combination
  • Connection: Shortest Path, All Path Finding, Dependency, Node Connectivity, Link Connectivity, Maximum Flows, Pfnet, Minimum Cutset, Influence, Accessibility
  • Cohesion: Component, Bi-Component, Clique, n-Clique, n-Clan, k-Plex, k-Core, Lambda Set, Community, Betweenness, Modularity, Eigenvector, Label Propagation, Cohesive Block, s-Clique
  • Centrality: Degree, Coreness, Closeness, Node Betweenness, Link Betweenness, Flow Betweenness, Eigenvector, Status, Power, R.W.Betweenness, Information, Load, Effects, PageRank, HITS, Community
  • Equivalence: Structural(Profile, CONCOR), Regular(REGGE, CatRE), Role(Triad, Local), Sim Rank
  • Position: Blockmodel(Conventional), Brokerage, Bow-Tie Model
  • Properties: Multiple Network, Group, Modularity
  • Models: Dyadic Interaction(P1), ERGM(P), Blockmodel(Generalized), Influence Network(Effects, Sequence)
  • Two Mode: Degree, Eigenvector Centrality, Collaborative Filtering, Maximum Matching

[ Modules for statistic analysis ]

  • Frequency, Gini Coefficient, Power Law, Descriptives, Crosstabs, ANOVA, Correlation, Autocorrelation, Regression, Logistic Regression
  • Multivariate Analysis: MDS, Correspondence, Cluster, Decomposition(Eigenvector, Singular, Spectral), Covariance Matrix, Principal Component, Factor Analysis

[ Modules for network visualization ]

  • Spring Layouts(2D/3D, 1-mode/2-mode): Kamada & Kawai, Stress Majorization, Eades, Fruchterman & Reingold, GEM, HDE(High-Dimensional Embedding)
  • MDS (2D/3D): Classical MDS, Non-metric MDS, Kn-MDS
  • Clustered Layouts (2D/3D): Clustered Eades, Clustered-CoLa
  • Layered Layout (2D): Dig-Cola
  • Circular Laytouts (2D): Circumference, Concentric, Radial
  • Others (2D): Fixed, Random

[ Modules for charts ]

  • Chart

Embedded Tools

  • Matrix Calculator
  • NetMiner Script Workbench

Release History

The first version of NetMiner was released on Dec 21, 2001. And there have been four major updates from 2001 to 2012.

2001.12.21:Ver 1.0.0 Released

  • Network analysis modules and network visualization modules were integrated to one package
  • User interface for data analysis and management was added
  • Generic data structure for multi-layer network was introduced

2003.4.9: Ver 2.0.0 Released

  • Modules for importing external data were introduced
  • Some measures and methods for network analyses and statistical analyses are added and improved
  • Some algorithms for visualization processes are added and improved

2007.5.15: Ver 3.0.0 Released

  • Data structure was improved for huge network analysis
  • Analysis and visualization modules are integrated to support standardized analysis processes
  • Data import modules to access external DB such as Oracle and MS SQL were introduced
  • Environments for visualization and analysis were integrated to one

2011.5.10:Ver 4.0.0 Released

  • Python-based NetMiner Script was introduced
  • Encryption module for nmf format was added
  • Current(2012.6.25) version: 4.0.1.f.111222

About Cyram

  • 회사소개