Chapter 2 Data Import

The first key step in using Venndir is to convert data into a recognized format. The two native formats are List of sets and List of signed sets.

There are several formats available:

  1. List of sets, referred to as a setlist
    • a list of character item vectors.
    • sets are represented by names(setlist)
  2. List of signed sets, referred to as a signed setlist
    • a list of signed numeric vectors.
    • signed vectors use values -1 or 1, and store items in names().
    • sets are represented by names(setlist)
  3. Incidence matrix
    • a numeric matrix with items as rownames(), sets as colnames().
    • items are indicated by the value 1 in the appropriate row and column.
  4. Signed incidence matrix
    • a numeric matrix with items as rownames(), sets as colnames().
    • items are indicated by -1 or 1 in the appropriate row and column.
  5. Overlap counts
    • a numeric vector with the number of items in each overlap.
    • the overlap name is stored in names(), in format: 'A', 'B', 'A&B'.
  6. Signed overlap counts
    • a list of numeric vectors, named by overlap set.
    • each overlap is also a list named by the directionality.
  7. Other formats
  8. Specialized formats