To Do

  1. Build input for Nauvoo data, similar to http://drc.daacs.org/ceramic-artifact.html
    • Straight HTML page for the site in drc.iath.virginia.edu
  2. Pick the AQ subset and look at the matriarchal and patriarchal lineage differences

Notes

  • Metrics have usefulness in two different areas
    • CS interest
    • Semantic interest (humanities side)
  • Female-oriented chords are not marriages, they are more of a social structure
    • From the male-oriented point of view, we say a woman joins the marriage, however
    • From a female-oriented point of view, we aren’t saying that, ie, Brigham Young joins Zina Huntington’s marriage. There is something fundamentally different in these two views.
  • Really need to look at studying the differences in the matriarchal flow vs the patriarchal flow
    • Measures about these different flows
    • Nodes change fundamentally between the two (Zina’s POV vs Henry’s POV – the nodes mean something completely different)
    • Not really a dual of each other; the dots are sliced and diced between the two different views
      • Sociologically different
      • Parts of the male marriage are scattered across the matriarchal view
  • Look at similar diagrams
    • Zina’s levels = 1 compared with one of her sisters’ levels = 1 would be different
      • Some overlap in parents and siblings, and perhaps some of the husbands
      • Differences in children, husbands, etc
  • Pick the AQ subset and look at the matriarchal and patriarchal lineage differences

To Do

  1. Finish the abstract.
  2. Investigate the connected component claim of the 2003 paper that finding connected components in an evolving graph is NP Complete
    • Is it NPC? What part of the temporal aspect makes it NPC?
      • Would it still be NPC in our different temporal domains? Clearly there would be variation from a single point in time to an interval to multiple points in time.
    • What is the definition of a connected component in a temporal graph?
      • How would that change if we use the different notions of temporal domain (now, interval, disjoint interval, all time)?
    • Would it have different meanings at different time points?
  3. Revisit the 2012 Temporal clustering paper
    • What do they mean by cluster? temporal cluster?
  4. Look at the wikipedia datasets. Does an article node change over time? If the article is merged with another, is that reflected? Or if an article is split?
    • Are there document-document edges along with editor-document edges?

Notes

  1. We have these different notions of time that we want to investigate. Different “places” to do our temporal measures:
    • Now (single point in time)
    • Contiguous Interval (interval with start and end points, inclusive)
    • Non-contiguous Interval (set of intervals, with some points in time not included, possibly disjoint)
    • All time (every point in time)
  2. For disjoint intervals, how do we look at the gaps and include edges or nodes?
    • Do we include edges that are extant completely across the gap(s)?
      • Edges or nodes that exist in all intervals included?
      • Edges or nodes that exist throughout the gaps are included in the interval? That is, for an edge to be considered in a disjoint interval, it must continue to exist from the end of one interval through the beginning of the next to be considered.
  3. We need to look back at how vertices change over time
    • We originally had a notion that connectedness is important based on some inherent characteristics of the node
    • So, for example, we only want to consider circuses that have more than 3 travelling groups
      • When a circus gains a third or loses its third group, the node changes importance to us. It changes whether or not we consider it, or how we rank it.
    • Or, another example, we only want to consider wikipedia articles that have $x$ or more words from a specific list in them.
      • When an article is edited and the $x$th word is added by the editor, that document is now more important to us, or important enough to include as a node in our graph
      • When the $x$th word is removed so the article only has $x-1$ special words, we then treat that document differently
    • Filtering vertices out that don’t have the properties we’re looking for.
      • Alternatively treating them differently

Suggestions

  1. Would minimum spanning tree algorithms be helpful or a useful exercies?
  2. Look into all pairs shortest path, since single source has been done and use DFS and BFS.

Committee Selection

Research advisor, a chair, a minor representative (outside the student’s department and major curriculum study area), and at least one other person.

Committee Members:

  • Gabe Robins
  • Alf Weaver
  • Worthy Martin (Adviser)
  • Glenn Wasson (Outside UVA Contact)

Possible Committee Members:

  • Dave Evans - willing, but would like a domain expert on the committee

Alternate Members:

  • Jim Cahoon

Minor Representative Possibilities:

  • Jeff Holt (Statistics)
  • Dan Keenan (Statistics)
  • Matt Gerber (Systems and Information Engineering)

To Do

Dave’s Suggestions

  • Look up publication venues and see if these types of work are the norm for those venues or the exception.
  • Look up which professors at UVA are publishing in those venues and who would be a good fit to know about this area.
  • Talk with Hongning Wang

Notes

From DHF

  • Other uses for Evolving Graphs
    • Library taxonomies
    • SNAC as archival collections connected by people referenced in them.
    • Historical applications (Ex: protestant relations/connections across the revolution, from Kate here at DHF2014—she should email me soon)
  • Prof to look into
    • Jeffrey Heer

Make sure to include in the presentation:

  1. List a selection of publications that have already been done
  2. List the upcoming and hopeful publications for the future:
    • In the list of future plan items, say where they are hoped to be published and when they are anticipated to be done
  3. Include lots of related work (outside UVA)

Other Notes:

  • Be ready to field detailed and in-depth questions about what you’re doing, what you plan to do, and what others are doing

Questions

  • What are some unexpected things you’ve run into? Unexpected challenges?

Other applications

  1. Electricity coming into a home looks like a DAG (power leaving grid, multiple large panel boxes per building, and then smaller panel boxes, then at the end the individual appliances hanging off them). This DAG will change over time as new items are plugged in and as new buildings are brought into the grid.

To Do

  1. Update the data entry sketch to include the name dropdown for names used at each point (Sealing, Rite, Marriage).
  2. Update the database to add a FK to TempleRites, PersonMarriage, and NonMaritalSealings with a link to the Name table as Name (Action Taken) As