Title: | Replication Timing Profiling using DNA Copy Number |
---|---|
Description: | Create, Plot and Compare Replication Timing Profiles. The method is described in Muller et al., (2014) <doi: 10.1093/nar/gkt878>. |
Authors: | Dzmitry G Batrakou |
Maintainer: | Dzmitry G Batrakou <[email protected]> |
License: | GPL-3 |
Version: | 1.1.1 |
Built: | 2024-10-29 03:45:58 UTC |
Source: | https://github.com/cran/Repliscope |
A function to calculate Trep values from a sync-seq experiment calcTrep function fits a Boltzman sigmoid function into relative copy number datapoints for every genomic bin of the provided sync-seq merged dataframe. It then extracts time at which half of the cells have this genomic bin replicated (Trep). The output of the function is a dataframe containing Trep and TrepErr data for every genomic bin in a BED-like format.
calcTrep(ratioDFs, times)
calcTrep(ratioDFs, times)
ratioDFs |
A merged ratios dataframe containing sync-seq samples (dataframe). |
times |
Time series data in the same order as in the ratioDFs (numeric vector). |
TrepDF <- calcTrep(subset(syncSeq[["data"]],chrom=="chrI"),times=c(25,30,35,40,45,50,90))
TrepDF <- calcTrep(subset(syncSeq[["data"]],chrom=="chrI"),times=c(25,30,35,40,45,50,90))
compareRatios takes two ratio dataframes that were binned the same way and uses z-score statistics to find p-values of their differences. The function outputs a combined dataframe containing the two input ratio dataframes in a long format with added 'p.value' column.
compareRatios(ratio1, ratio2)
compareRatios(ratio1, ratio2)
ratio1 |
Ratio dataframe, or a string containing name of a ratio dataframe (dataframe or string). |
ratio2 |
Ratio dataframe, or a string containing name of a ratio dataframe (dataframe or string). |
ratioDFs <- compareRatios(W303norm,Dbf4myc)
ratioDFs <- compareRatios(W303norm,Dbf4myc)
Sequence read coverage ratios for Dbf4-9myc sample (T9394 strain). The cells were stained with DNA dye and sorted based on DNA content into S or G2/M phase fractions. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (S) and non-replicating (G2/M) samples were calculated in 1 kb genomic bins. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted by multiplying them by 1.402 to put the values onto biologically relevant relative copy number scale from 1 to 2.
data(Dbf4myc)
data(Dbf4myc)
data frame with 11356 rows and 7 variables:
short chromosome name
left chromosome coordinate
right chromosome coordinate
replicating sample name
non-replicating sample name
ratio value in the current bin
adjustment factor used for the current ratio
S phase sample: SRA; G2 sample: SRA
Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)
data(Dbf4myc)
data(Dbf4myc)
Guide dataframe for plotting smoothed sortSeq data
data(guide)
data(guide)
Dataframe with 2 rows and 6 variables:
Order to plot data in
Name of replicating sample
Name of non-replicating sample
Should raw data be plotted?
Should smooth data be plotted?
Color to plot the profile in
data(guide)
data(guide)
The function reads BED formatted files. The BED file format defined by UCSC: http://genome.ucsc.edu/FAQ/FAQformat. First three columns ("chrom", "chromStart", "chromEnd") are mandatory. The file fields may be separated by tabs, spaces or commas. If the BED file contains a header, it will be ignored. If a genome mask dataframe is provided, only data intersected with the mask will be retained. Resulting data is ordered by "chromStart" columns.
loadBed(file, genome = NULL, name = NULL)
loadBed(file, genome = NULL, name = NULL)
file |
Path to the BED file (string, mandatory) |
genome |
A mask dataframe to exclude data from the BED file (dataframe, optional). The genome dataframe must contain "chrom" column and may further contain "chromStart" and "chromEnd" columns in this order. |
name |
A string to replace the 'name' column of the loaded BED file with (string, optional). |
W303_G2 <- loadBed(system.file("extdata/W303_G2.bed",package="Repliscope"), name='W303_G2') W303_G2_chrI <- loadBed(system.file("extdata/W303_G2.bed",package="Repliscope"), name='W303_G2',genome=sacCer3[["genome"]])
W303_G2 <- loadBed(system.file("extdata/W303_G2.bed",package="Repliscope"), name='W303_G2') W303_G2_chrI <- loadBed(system.file("extdata/W303_G2.bed",package="Repliscope"), name='W303_G2',genome=sacCer3[["genome"]])
makeGenome is called by plotGenome() and plotCoverage() functions if a genome dataframe is not provided. It creates a BED-like dataframe containing unique chromosome names, their start coordinates (assumed 0), their ends (highest value in the corresponding 'chromEnd' of the BED dataframe) and 'midY' column containing half the max value of the "score" or "ratio" columns per chromosome. This later serves as y coordinate to add chromosome names. Axis name is passed via comment to the output dataframe. Extract it with 'attributes(genome)$axisName'.
makeGenome(DF, region = FALSE)
makeGenome(DF, region = FALSE)
DF |
A BED or ratio dataframe containing either 'score' or 'ratio' column (dataframe). |
region |
String in the format 'chrI:1000-3000' (string, optional). |
genomeDF <- makeGenome(W303_G2)
genomeDF <- makeGenome(W303_G2)
makeLabels is called by plotGenome() and plotCoverage() functions. It creates a dataframe containing two columns: 'ticks' and 'labels'. 'Ticks' contains axis ticks coordinates, 'labels' will contain human readable lables for the ticks (using prefixes and optional units).
makeLabels(theMin, theMax, unit = "")
makeLabels(theMin, theMax, unit = "")
theMin |
Minimum value for the scale (double). |
theMax |
Maximum value for the scale (double). |
unit |
Unit to use for the labels (string, optional) |
labels <- makeLabels(0,1200000,"b")
labels <- makeLabels(0,1200000,"b")
A function to calculate 'score' ratio between two bed dataframes makeRatio merges two supplied bed dataframes, calculates ratio of their "score" values normalises the ratio by the 'score' sums.
makeRatio(bedRep, bedNonRep)
makeRatio(bedRep, bedNonRep)
bedRep |
Bed dataframe containing read counts from a replicating sample (dataframe). The bed dataframe must contain "chrom","chromStart", "chromEnd" and "score" columns. |
bedNonRep |
Bed dataframe containing read counts from a non-replicating sample (dataframe). The bed dataframe must contain "chrom","chromStart", "chromEnd" and "score" columns. |
ratioDF <- makeRatio(W303_S,W303_G2)
ratioDF <- makeRatio(W303_S,W303_G2)
Replication profile for H.volcanii wild isolate DS2. Genomic DNA for deep sequencing was isolated from 100 ml culture in stationary phase (A650 > 1, DS2_stat sample) or 1 litre in exponential phase (A650 0.1, DS2_exp sample). Unique reads for the two samples were calculated in 1 kb genomic bins using ASM2568v1 genome assembly. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted by multiplying them by 1.12 to put the values onto biologically relevant relative copy number scale from 1 to 2.
data(MFAseq)
data(MFAseq)
data frame with 3887 rows and 7 variables:
short chromosome name
left chromosome coordinate
right chromosome coordinate
replicating sample name
non-replicating sample name
ratio value in the current bin
adjustment factor used for the current ratio
DS2_exp exponential phase sample: SRA; DS2_stat stationary sample: SRA
Hawkins et al. (2013) Nature 503(7477):544-547 (PubMed)
data(MFAseq)
data(MFAseq)
A function to normalise ratio values from 'ratio' column of the provided dataframe to fit biologically-relevant scale. It scales values either using supplied 'rFactor' value or automatically to best fit 1 to 2 scale (the upper limit of the scale may be adjusted with the upperLimit parameter). Normalisation factor used is stored in 'ratioFactor' column and also passed as the dataframe comment. To extract it, use 'attributes(mergedBed)$comment'
normaliseRatio(ratioDF, rFactor = NULL, upperLimit = 2, replace = TRUE)
normaliseRatio(ratioDF, rFactor = NULL, upperLimit = 2, replace = TRUE)
ratioDF |
A ratio dataframe containing 'ratio' column (dataframe). |
rFactor |
Value to normalise by, related to replication progression (numeric, optional). |
upperLimit |
Top value for the scale, defaults to 2 (numeric, optional, defaults). |
replace |
Should the existing 'ratio' values be overwritten or stored in a new column (boolean, defaults to TRUE). |
ratioDF <- normaliseRatio(W303) ## scales to 1 to 2 range, replaces original values. ratioDF <- normaliseRatio(W303,rFactor=1.41,replace=FALSE) # (multiplies score values by 1.41 and keeps the original values)
ratioDF <- normaliseRatio(W303) ## scales to 1 to 2 range, replaces original values. ratioDF <- normaliseRatio(W303,rFactor=1.41,replace=FALSE) # (multiplies score values by 1.41 and keeps the original values)
A function to boxplot 'score' column of a BED dataframe, per unique chromosome name in the 'chrom' column. The resulting plot also highlights outliers based on the inter quartile range (IQR). The genome wide median is plotted as a pink line through the boxplots.
plotBed(bed, plotting = TRUE)
plotBed(bed, plotting = TRUE)
bed |
A dataframe containing 'score' and 'chrom' columns (dataframe). |
plotting |
Should the plot object be sent to the default device? (boolean, defaults to TRUE). |
plotBed(W303_S) plotObject <- plotBed(W303_G2,plotting=FALSE)
plotBed(W303_S) plotObject <- plotBed(W303_G2,plotting=FALSE)
A function to scatterplot 'score' column of a BED dataframe plotCoverage function plots values in the 'score' column of the supplied bed dataframe as a function of chromosome coordinates. The genome wide median is plotted as a pink line.
plotCoverage(bed, region = FALSE, plotting = TRUE)
plotCoverage(bed, region = FALSE, plotting = TRUE)
bed |
A dataframe containing 'score','chrom','chromStart' and 'chromEnd' columns (dataframe). |
region |
Only plot for the provided region in the format 'chrI:1000-3000' (string, optional). |
plotting |
Should the plot object be sent to the default device? (boolean, defaults to TRUE). |
plotCoverage(W303_G2) plotObject <- plotCoverage(W303_S,plotting=FALSE)
plotCoverage(W303_G2) plotObject <- plotCoverage(W303_S,plotting=FALSE)
plotGenome plots scatterplot/barplot/polygon of 'score' and/or 'splineSmooth' columns values by genomic coordinates, either for the whole genome or a region. It also allows annotation with additional BED-formatted data. Ggplot2 package is used for plotting.
plotGenome( ratioDFs, geom = "geom_point", ylims = c(1, 2), plotting = TRUE, genome = NULL, region = FALSE, guide = NULL, lines = NULL, circles = NULL, rectangles = NULL, pointers = NULL, colourLines = "#00FF00", colourCircles = "#FFFFFF", colourRectangles = "#FF0000", colourPointers = "#FF7F00" )
plotGenome( ratioDFs, geom = "geom_point", ylims = c(1, 2), plotting = TRUE, genome = NULL, region = FALSE, guide = NULL, lines = NULL, circles = NULL, rectangles = NULL, pointers = NULL, colourLines = "#00FF00", colourCircles = "#FFFFFF", colourRectangles = "#FF0000", colourPointers = "#FF7F00" )
ratioDFs |
A ratio dataframe or combined ratios dataframe containing 'ratio' column (dataframe). |
geom |
ggplot2 geom to use for plotting: "geom_point","geom_ribbon" or "geom_segment" (string, defaults to "geom_point"). |
ylims |
A vector of two values for y axis limits - first is lowest, second is highest (numeric vector, defaults to 1 and 2) |
plotting |
Should the plot object be sent to the default device? (boolean, defaults to TRUE). |
genome |
A mask dataframe to exclude data from the ratio dataframe (dataframe, optional). The genome dataframe must contain "chrom","chromStart" and "chromEnd" columns. |
region |
Only plot for the provided region in the format 'chrI:1000-3000' (string, optional). |
guide |
A dataframe guiding the plotGenome function how to plot the data (dataframe, optional). The guide dataframe must contain the following columns: 'order' (integer) - order to plot data in, 'name.rep' (character) - replicating sample name that matches the one in the ratioDFs dataframe, 'name.nonRep' (character) - non-replicating sample name that matches the one in the ratioDFs dataframe, 'raw' (logical) - should the raw raw data be plotted? 'smooth' (logical) - should the smoothed data be plotted? 'color'(character) - R color to plot the current sample with, both raw and smoothed data. |
lines |
Additionally plot vertical lines from a BED formatted dataframe (dataframe, optional). |
circles |
Additionally plot circles on the chromosome line from a BED formatted dataframe (dataframe, optional). |
rectangles |
Additionally plot rectangles on the chromosome line from a BED formatted dataframe (dataframe, optional). |
pointers |
Additionally plot downward pointing triangles from a BED formatted dataframe (dataframe, optional). |
colourLines |
Colour for 'lines' data (string, defaults to green). |
colourCircles |
Colour for 'circles' data (string, defaults to white). |
colourRectangles |
Colour for 'rectangles' data (string, defaults to red). |
colourPointers |
Colour for 'pointers' data (string, defaults to orange). |
plotGenome(sortSeq,geom="geom_ribbon",guide=guide,region="chrIX:250000-439885", lines=sacCer3[["cen"]],circles=sacCer3[["ori"]]) # plot data as polygon for the specified region of chromosome 9 plotGenome(syncSeq[["data"]],geom="geom_segment",guide=syncSeq[["guide"]], region="chrVII:0-1090944",genome=sacCer3[["genome"]],lines=sacCer3[["cen"]], circles=sacCer3[["ori"]],colourLines="black") plotGenome(MFAseq,region='chr1:0-2848000') # plot marker frequency analysis for H.volcanii isolate DS2
plotGenome(sortSeq,geom="geom_ribbon",guide=guide,region="chrIX:250000-439885", lines=sacCer3[["cen"]],circles=sacCer3[["ori"]]) # plot data as polygon for the specified region of chromosome 9 plotGenome(syncSeq[["data"]],geom="geom_segment",guide=syncSeq[["guide"]], region="chrVII:0-1090944",genome=sacCer3[["genome"]],lines=sacCer3[["cen"]], circles=sacCer3[["ori"]],colourLines="black") plotGenome(MFAseq,region='chr1:0-2848000') # plot marker frequency analysis for H.volcanii isolate DS2
A function to plot a histogram of supplied ratio vector plotRatio plots histogram of values in a supplied vector using ggplot2 and highlights interval between 1 and 2 in green.
plotRatio(ratio, plotting = TRUE)
plotRatio(ratio, plotting = TRUE)
ratio |
A numeric vector containing raw or smoothed ratio values (vector). |
plotting |
Should the plot object be sent to the default device? (boolean, defaults to TRUE). |
plotRatio(W303$ratio) plotObject <- plotRatio(W303$ratio,plotting=FALSE)
plotRatio(W303$ratio) plotObject <- plotRatio(W303$ratio,plotting=FALSE)
A function to scatterplot 'Trep' column of a Trep dataframe plotTrep function plots values in the 'Trep' column of the supplied dataframe as a function of chromosome coordinates. The genome wide median is plotted as a pink line.
plotTrep(TrepDF, region = FALSE, plotting = TRUE)
plotTrep(TrepDF, region = FALSE, plotting = TRUE)
TrepDF |
A dataframe containing 'chrom','chromStart', 'chromEnd' and 'Trep' columns (dataframe). |
region |
Only plot for the provided region in the format 'chrI:1000-3000' (string, optional). |
plotting |
Should the plot object be sent to the default device? (boolean, defaults to TRUE). |
plotTrep(TrepDF,region="chrI")
plotTrep(TrepDF,region="chrI")
A function to remove single chromosome data from a bed dataframe
rmChr(bed, chr)
rmChr(bed, chr)
bed |
A bed dataframe containing 'chrom' column (dataframe, required). |
chr |
Chromosome to remove (string, required). |
bedDF <- rmChr(W303_S,"chrM") ## removes mitochondria
bedDF <- rmChr(W303_S,"chrM") ## removes mitochondria
A function to remove outliers from the "score" column of a supplied bed dataframe There are three methods: max, IQR and median. Max is used to remove 1 or more maximum values; IQR uses interquartile range to detect outliers, while median method can be used to remove data based on genome-wide median.
rmOutliers(bed, method, n = 1, range = 3, loLim = 0.25, hiLim = NULL)
rmOutliers(bed, method, n = 1, range = 3, loLim = 0.25, hiLim = NULL)
bed |
A dataframe containing 'score' column (dataframe, required). |
method |
Method to detect outliers: "max", "IQR" or "median" (string). |
n |
Number of max values to remove (integer,defaults to 1). Use with "max" method. |
range |
Number of IQR above the 3rd or below the 1st IQR to set the threshold (double, defaults to 3). Use with "IQR" method. |
loLim |
Low limit for the median method (double, defaults to 0.25). |
hiLim |
High limit for the median method (double). |
bedDF <- rmOutliers(W303_S,method="max",n=2) ## removes 2 rows of data containing 3 top values bedDF <- rmOutliers(W303_S,method="IQR",range=3) ## removes datapoints outside 3 x IQR above the 3rd # and below the 1st IQR. bedDF <- rmOutliers(W303_S,method="median",loLim=0.25,hiLim=2) # removes datapoints that are lower # than 0.25 x genome median or above 2 x genome median.
bedDF <- rmOutliers(W303_S,method="max",n=2) ## removes 2 rows of data containing 3 top values bedDF <- rmOutliers(W303_S,method="IQR",range=3) ## removes datapoints outside 3 x IQR above the 3rd # and below the 1st IQR. bedDF <- rmOutliers(W303_S,method="median",loLim=0.25,hiLim=2) # removes datapoints that are lower # than 0.25 x genome median or above 2 x genome median.
A function to launch Repliscope in interactive mode (Shiny app).
runGUI()
runGUI()
sacCer3 meta information: chromosome sizes, centromere and replication origin positions.
data(sacCer3)
data(sacCer3)
List containing three dataframes
Chromosome information dataframe
Centromere information dataframe
Replication origin information dataframe
Replication origin information: (OriDB)
Siow et al. (2011) NAR 40(Database issue):D682-6 (PubMed)
data(sacCer3)
data(sacCer3)
A function to smooth ratio values using cubic smoothing spline smoothRatio function splits values from 'ratio' column by chromosome and based the supplied 'groupMin' and 'split' parameters and then applies smooth.spline() function from R stats package. The supplied dataframe may contain multiple ratios, i.e. ratios produced using multiple replicating samples and/or multiple non-replicating samples. This must be reflected in 'name.rep' and 'name.nonRep' columns. In other words, different ratio dataframes may be merged using rbind() function before calling smoothRatio() function.
smoothRatio(ratioDF, groupMin = 5, splitNum = 5)
smoothRatio(ratioDF, groupMin = 5, splitNum = 5)
ratioDF |
A ratio dataframe or combined ratios dataframe containing 'ratio' column (dataframe). |
groupMin |
Minimum number of values required to make a group (integer, defaults to 5). |
splitNum |
Minimum number of adjacent bins with missing values to close current group (integer, defaults to 5). |
ratioDF <- smoothRatio(W303norm)
ratioDF <- smoothRatio(W303norm)
Replication profiles for wild type and Dbf4-9myc samples (T7107 and T9394 strains). The cells were stained with DNA dye and sorted based on DNA content into S or G2/M phase fractions. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (S) and non-replicating (G2/M) samples were calculated in 1 kb genomic bins. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted by multiplying them by 1.41 and 1.402 for wild type and Dbf4-9myc samples, respectively, to put the values onto biologically relevant relative copy number scale from 1 to 2. The relative copy number values were smoothed using cubic spline and compared using z score statistics.
data(sortSeq)
data(sortSeq)
data frame with 22696 rows and 10 variables:
short chromosome name
left chromosome coordinate
right chromosome coordinate
replicating sample name
non-replicating sample name
ratio value in the current bin
adjustment factor used for the current ratio
Group number of the current bin
Smoothed ratio value
Significance of ratio difference between Dbf4myc and W303 samples
Dbf4myc S phase sample: SRA; Dbf4myc G2 sample: SRA; W303 S sample: SRA; W303 G2 sample: SRA
Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)
data(sortSeq)
data(sortSeq)
Replication profiles of wild type S. cerevisiae arrest-release samples (AUY077 strain). The cells were arrested in G1 with alpha-factor followed by release using pronase. The samples were collected before the release (aFactor) and various time intervals after the release (25min,30min,35min,40min,45min,50min and 90min) Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (post-release) and non-replicating (aFactor) samples were calculated in 1 kb genomic bins. The ratios were created by dividing 'score' values from replicating samples by non-replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted based on bulk genome replication (as determined by flow cytometry), to put the values onto biologically relevant relative copy number scale from 1 to 2. The relative copy number values were smoothed using cubic spline.
data(syncSeq)
data(syncSeq)
List containing two data frames
syncSeq replication profiles data. Columns: chrom (short chromosome name), chromStart (left chromosome coordinate), chromEnd (right chromosome coordinate), name.rep (replicating sample name), name.nonRep (non-replicating sample name), ratio (ratio value in the current bin), ratioFactor (adjustment factor used for the current ratio), group (Group number of the current bin), splineSmooth (Smoothed ratio value)
Guide dataframe for plotting the syncSeq data order (Order to plot data in), name.rep (Name of replicating sample), name.nonRep (Name of non-replicating sample), raw (Should raw data be plotted?), smooth (Should smooth data be plotted?), color (Color to plot the profile in)
Müller et al. (2014) NAR 42(1):e3 (PubMed)
data(syncSeq)
data(syncSeq)
Trep is median replication time, expressed in minutes after release G1 arrest. It is calculated from multiple relative copy number datapoints across timeseries of a cell cycle experiment. For every genomic bin, a sigmoid function is fitted and its midpoint is reported.
data(TrepDF)
data(TrepDF)
data frame with 11341 rows and 5 variables:
short chromosome name
left chromosome coordinate
right chromosome coordinate
calculated Trep value
error from sigmoid function fitting
Müller et al. (2014) NAR 42(1):e3 (PubMed)
data(TrepDF)
data(TrepDF)
A function to remove outliers from the "ratio" column of a supplied ratio dataframe trimRatio is applied to the calculated ratio of read counts from a replicating to a non-replicating samples.
trimRatio(ratioDF, loLim, hiLim)
trimRatio(ratioDF, loLim, hiLim)
ratioDF |
A ratio dataframe containing 'ratio' column (dataframe, required). |
loLim |
Low limit threshold (double, required). |
hiLim |
High limit threshold (double, required). |
W303 <- trimRatio(W303,0.5,1.5)
W303 <- trimRatio(W303,0.5,1.5)
Sequence read coverage ratios for wild type sample (T7107 strain). The cells were stained with DNA dye and sorted based on DNA content into S or G2/M phase fractions. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (S) and non-replicating (G2/M) samples were calculated in 1 kb genomic bins. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads.
data(W303)
data(W303)
data frame with 11350 rows and 7 variables:
short chromosome name
left chromosome coordinate
right chromosome coordinate
replicating sample name
non-replicating sample name
ratio value in the current bin
adjustment factor used for the current ratio
S phase sample: SRA; G2 sample: SRA
Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)
data(W303)
data(W303)
Sequence read coverage for wild type non-replicating sample (T7107 strain). The cells were stained with DNA dye and sorted based on DNA content into G2/M phase fraction. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads were calculated in 1 kb genomic bins.
data(W303_G2)
data(W303_G2)
data frame with 11350 rows and 5 variables:
short chromosome name
left chromosome coordinate
right chromosome coordinate
sample name
read number in current bin
Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)
data(W303_G2)
data(W303_G2)
Sequence read coverage for wild type replicating sample (T7107 strain). The cells were stained with DNA dye and sorted based on DNA content into S phase fraction. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads were calculated in 1 kb genomic bins.
data(W303_S)
data(W303_S)
data frame with 11820 rows and 5 variables:
short chromosome name
left chromosome coordinate
right chromosome coordinate
sample name
read number in current bin
Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)
data(W303_S)
data(W303_S)
Sequence read coverage ratios for wild type sample (T7107 strain). The cells were stained with DNA dye and sorted based on DNA content into S or G2/M phase fractions. Extracted DNA was sequenced and mapped to sacCer3 genome. Unique reads for replicating (S) and non-replicating (G2/M) samples were calculated in 1 kb genomic bins. The ratio was created by dividing 'score' values from replicating sample by non- replicating sample 'score' values, adjusted by total number of reads. The ratio values were further adjusted by multiplying them by 1.41 to put the values onto biologically relevant relative copy number scale from 1 to 2.
data(W303norm)
data(W303norm)
data frame with 11340 rows and 7 variables:
short chromosome name
left chromosome coordinate
right chromosome coordinate
replicating sample name
non-replicating sample name
ratio value in the current bin
adjustment factor used for the current ratio
S phase sample: SRA; G2 sample: SRA
Natsume et al. (2013) Mol Cell 50(5):661-74 (PubMed)
data(W303norm)
data(W303norm)