Skip to main content

Table 1 Sample set and sequencing data summary

From: Genome-wide somatic mutation analysis of sinonasal adenocarcinoma with and without wood dust exposure

Sample

Wood dust

Exposure level

Exposure probability

Exposure type

Tobacco

Subtype

Tumor cell

%

No. of libraries

Coverage (avg)

Coverage (sd)

Coverage (median)

Duplicate%

SNC12

exposed

medium

probable

S, H

NA

ITAC

50

1

30.3

69.2

29

13.8

SNC19

exposed

medium

definite

S

smoker

ITAC

20

1

34.3

69.9

33

16.3

SNC41

non-exposed

no exposure

no exposure

no exposure

non-smoker

ITAC

40

2

31

72.7

30

11.4

SNC48

non-exposed

no exposure

no exposure

no exposure

NA

non-ITAC

90

4

67.2

154.2

64

10.6

SNC72

non-exposed

no exposure

no exposure

no exposure

smoker

non-ITAC

25

2

33.8

96.2

32

11.3

SNC78

exposed

medium

probable

S, H

NA

ITAC

40

1

34.3

66

31

15

SNC105

exposed

medium

definite

S, H

smoker

ITAC

30

1

29.4

62.3

27

17.4

SNC131

exposed

low

definite

S, H

NA

non-ITAC

60

5

54.2

131.4

47

10.2

SNC142

exposed

medium

definite

S, H

NA

ITAC

30

3

82.9

164.1

79

10.2

SNC176

exposed

medium

definite

S, H

non-smoker

non-ITAC

80

1

36.5

66.4

33

14.6

SNC186

non-exposed

low

possible

no exposure

non-smoker

non-ITAC

90

4

53.6

121

48

10.1

SNC214

exposed

medium

definite

S, H

smoker

ITAC

70

5

84.4

196.7

78

7.5

SNC215

non-exposed

no exposure

no exposure

no exposure

smoker

non-ITAC

80

2

47.1

129.4

42

20.6

SNC229

exposed

medium

probable

S, H

smoker

ITAC

70

4

62.4

147.6

58

11.3

SNC232

non-exposed

no exposure

no exposure

no exposure

non-smoker

non-ITAC

80

1

28.6

63.5

27

12.4

SNC233

exposed

medium

definite

S

smoker

non-ITAC

70

1

30.7

72.6

29

11.9

  1. Tumor cell percentage was determined by visual inspection of hematoxylin-eosin-stained tissue slides. Multiple libraries were prepared per sample when sample availability permitted. Coverage indicates the number of sample sequence reads aligning to reference sequence bases, presented as averages (avg), standard deviations (sd) and medians over the genome. Duplicate percentage indicates the fraction of sequence reads identified as duplicates by Picard’s MarkDuplicates tool, and is calculated as the average of all libraries when applicable. S = softwood exposure, H = hardwood exposure, ITAC = intestinal-type adenocarcinoma