Module – 3
The node–link diagram family of visual encoding idioms uses the connection channel, where marks
represent links rather than nodes.
The second major family of network encoding idioms are matrix views that directly show adjacency
relationships.
Tree structure can be shown with the containment channel, where enclosing link marks show
hierarchical relationships through nesting.
Connection: Link Marks
most common visual encoding idiom for tree and network data is with node–link diagrams, where
nodes are drawn as point marks and the links connecting them are drawn as line marks.
This idiom uses connection marks to indicate the relationships between items.
Basic Layouts:
Node-Link layout
Spline radial layout
Rectangular horizontal node-link
Bubble tree
Networks are also very commonly represented as node–link diagrams, using connection.
Nodes that are directly connected by a single link are perceived as having the tightest grouping,
while nodes with a long path of multiple hops between them are less closely grouped.
Hops: network-oriented way to measure distances.
Node–link diagrams suited for tasks understanding the network topology:
Force-Directed Placement: widely used idioms for node–link network
network elements are positioned according to a simulation of physical forces where
nodes push away from each other while links act like springs that draw their
endpoint nodes closer to each other.
Easy to implement, understand and explain
nondeterministic, meaning that they will look different each time the algorithm is
run,
problems:
scalability
tiny graph is yield readable layouts, 100 nodes – hairball (you are screwed.. fr)
SFDP
Network of more than 5000 nodes and 10000 edges is using multilevel scalable
force-directed placement algorithm where the edges are colored by the length
(basically gay FDP)
Matrix Views: Network data can also be encoded with a matrix view
adjacency matrix: view, where all of the nodes in the network are laid out along the vertical and
horizontal edges of a square region and links between two nodes are indicated by coloring an area
mark in the cell in the matrix that is the intersection between their row and column. (Fancy way of
describing adj matrix also instead of 1 for link between 2 nodes it is color indicated)
Containment marks effective to show information in hierarchy compared to connection marks
showing pairwise relationship
The idiom of treemaps is an alternative to node–link tree drawings, where the hierarchical
relationships are shown with containment rath ner than connection.(gas it out bish)
compound network, which is the combination of a network and tree;
GrouseFlocks system, users can investigate multiple possible hierarchies and they are shown
explicitly.
Color Theory
Color Vision
Color Space
Luminance, Saturation and Hue
Transparency
Color Vision:
Retina has 2 diff kind of receptors
Rods: actively contribute to vision only in low light black and white low resolution.
(the 144p of human eyes)
Cones: Main sensors – three types of cones each with peak sensitivity and different
Wavelength within visible spectrum
3 opponent color channels:
Red to green
Blue to yellow
Black to white
Color blindness X Color deficiency v/
affects 8% of men affects red to green
Color Spaces
Important consideration in data visualization how color appear in different device/context
Rgb color space: device dependent, digital display most common
CMYK color space 0-100%,printing,
HSL color space H:360* S,L: 0-100%
Lab color Space: r/g axis b/y axis
Luminance Saturation and Hue
Luminance refers to the amount of light emitted or reflected by a color, and is often described as the
perceived brightness of the color.
Saturation refers to the purity or intensity of a color, and is often described as the richness or
vividness of the color.
Hue refers to the actual color of the light, Hue is often represented as an angle around a color wheel,
Transparency(gass it broo lessggooo)
strongly related to the other three color channels
information can be encoded by decreasing the opacity or fully opaque
fully transparent marks cannot convey any information at all with the other three channels.
Single Document Visualization
Word clouds, also known as text clouds or tag clouds, are layouts of raw tokens, colored and sized
by their frequency within a single document
Text clouds and their variations, such as a Wordle, are examples of visualizations that use only term
frequency vectors and some layout algorithm to create the visualization.
Word Tree visualization is a visual representation of both term frequency as well as their context
Size is used to represent the term or phrase frequency
The root of the tree is a user-specified word or phrase of
interest, and the branches represent the various contexts in
which the word or phrase is used in the document.
Text Arc is visual representation of how terms relate to the lines of text in which they appear
Elipse structure
Words with higher frequencies are drawn within th ellipse pulled by its occurrences on the circle
Arc Diagram visualization focused on displaying repetition in text or any sequence.
Repeated subsequences are identified and connected bysemicircular arcs.
thickness of the arcs represents the length of the
subsequence, and the height of the arcs represents the
distance between the subsequences.
Many classes of interaction techniques exist, including:
Navigation
Navigation (also sometimes referred to as exploration) is used to search for a
subset of data to be viewed, the orientation of this view, and the level of detail (LOD).
The subset in question may be one that is recognized by some visual pattern or
one on which further or more detailed exploration is desired.
Selection
In selection, the user isolates a subset of the display components, which will then
be subjected to some other operation, such as highlighting, deleting, masking, or moving to
the center of focus.
Selection can be articulated in many different ways.
Filtering
Filtering, as the name implies, reduces the volume of data to be visualized by
setting constraints specifying the data to be preserved or removed.
filtering is often done prior to viewing the data, to avoid overloading the data display.
Reconfiguring
Reconfiguring the data within a particular visualization can often be used to expose
features or cope with complexity or scale.
Encoding
encoding operations include those that modify the color map used, the size of
graphical entities, and their shape.
Connecting
A frequent use for selection operations is to link the selected data in one view to
the corresponding data in other views.
When the selection data is allowed to be interactively changed, the operator is called
brushing, in which case the user is continuously changing the selection in one view, and the
corresponding linked data in one or more other views is highlighted.
Abstracting/elaborating
it is often desirable to focus in on a subset of the data to acquire details
(elaboration) while reducing the level of detail (abstraction) on other parts of the data set.
One of the most popular techniques of this type is using distortion operators.
Hybrid
Document Collection and Visualizations
most cases of document collection visualizations, the goal is to place similar documents close to each
other and dissimilar ones far apart.
compute the similarity between all pairs of documents and determine a layout.
common approaches are clustering (k-means, hierarchical, expectation maximization (EM), support
vector), and self-organizing maps.
several document collection visualizations, such as self organizing maps, cluster maps, and
themescapes.
Self Organizing Map:
A self-organizing map (SOM) is an unsupervised learning algorithm using a collection of typically 2D
nodes,
Each node has an associated vector of the same dimensionality as the input vectors
We initialize the SOM nodes, typically with random weights.
As we iterate through the input vectors, the radius gets smaller.
Themescapes
Themescapes are summaries of corpora using abstract 3D landscapes in which height and color are
used to represent density of similar documents.
The taller mountains represent frequent themes in the document corpus
Document Cards
Document cards are a compact visualization that represents the document’s key semantics as a
mixture of images and important key terms,
key terms are extracted using an advanced text-mining approach
images and their captions are extracted using a graphical heuristic, semi=semantic image weighting