GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:39:57 Sequence gi568815595r:185818363_186026735 : 208373 bp : 42.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 2085 2080 6 1.05 1.02 Term - 6229 6141 89 2 2 107 50 39 0.379 -1.16 1.01 Init - 6595 6421 175 1 1 64 88 305 0.890 27.66 1.00 Prom - 22982 22943 40 -5.05 2.03 PlyA - 23728 23723 6 1.05 2.02 Term - 29970 29665 306 0 0 10 50 223 0.625 4.83 2.01 Init - 30898 30851 48 1 0 52 91 37 0.829 1.40 2.00 Prom - 40464 40425 40 -5.15 3.00 Prom + 43704 43743 40 -4.35 3.01 Init + 47291 47366 76 1 1 41 91 41 0.423 1.00 3.02 Term + 51508 51635 128 2 2 76 46 112 0.697 3.26 3.03 PlyA + 51752 51757 6 1.05 4.11 PlyA - 51869 51864 6 1.05 4.10 Term - 73257 73082 176 1 2 64 54 82 0.149 -0.66 4.09 Intr - 82148 81963 186 0 0 25 70 164 0.471 7.04 4.08 Intr - 100076 100003 74 1 2 26 99 85 0.617 1.53 4.07 Intr - 102825 102742 84 2 0 72 111 146 0.968 13.32 4.06 Intr - 103764 103649 116 0 2 40 94 53 0.976 -0.57 4.05 Intr - 105622 105434 189 1 0 93 100 143 0.997 14.86 4.04 Intr - 107264 107102 163 1 1 113 94 163 0.999 18.46 4.03 Intr - 108372 108239 134 0 2 20 82 111 0.877 2.22 4.02 Intr - 119561 119463 99 2 0 -15 87 196 0.037 8.59 4.01 Init - 141678 141592 87 0 0 47 85 95 0.120 5.79 4.00 Prom - 143287 143248 40 -5.35 5.00 Prom + 143606 143645 40 -7.35 5.01 Init + 149975 150169 195 1 0 85 36 81 0.453 1.59 5.02 Intr + 150221 150355 135 1 0 -1 42 191 0.506 5.44 5.03 Intr + 153253 153502 250 0 1 71 33 407 0.690 29.59 5.04 Intr + 156804 156994 191 1 2 66 113 176 0.737 16.28 5.05 Term + 161296 161475 180 0 0 67 43 201 0.992 10.13 5.06 PlyA + 162487 162492 6 1.05 6.00 Prom + 163466 163505 40 -9.85 6.01 Init + 165734 165980 247 1 1 26 38 281 0.547 12.71 6.02 Intr + 168001 168243 243 2 0 32 9 236 0.775 6.65 6.03 Intr + 168679 169023 345 2 0 64 30 178 0.636 4.04 6.04 Intr + 174462 174665 204 1 0 34 78 120 0.629 3.85 6.05 Term + 174847 175004 158 2 2 44 42 159 0.639 4.01 6.06 PlyA + 177718 177723 6 1.05 7.03 PlyA - 179090 179085 6 1.05 7.02 Term - 180828 180696 133 2 1 44 46 102 0.132 -1.82 7.01 Init - 188682 187556 1127 0 2 60 53 217 0.611 9.10 7.00 Prom - 190046 190007 40 -4.85 8.02 PlyA - 190613 190608 6 1.05 8.01 Term - 207337 207194 144 1 0 103 37 101 0.570 3.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 125197 125375 179 1 2 71 66 160 0.842 10.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:185818363_186026735|GENSCAN_predicted_peptide_1|87_aa MNKLYIGNLSPAVTADDLRQLFGDRKLPLAGQVLLKSGYAFVDYPDQNWAIRAIETLSAL DSALALGAAALGRPPCRSEGSRGSPAP >gi568815595r:185818363_186026735|GENSCAN_predicted_CDS_1|264_bp atgaacaagctttacatcgggaacctgagccccgccgtcaccgccgacgacctccggcag ctctttggggacaggaagctgcccctggcgggacaggtcctgctgaagtccggctacgcc ttcgtggactaccccgaccagaactgggccatccgcgccatcgagaccctctcggctctg gactctgccctcgccctcggggccgcagccttggggcgccccccgtgccgctccgaaggc tcccgaggctcgccggctccttag >gi568815595r:185818363_186026735|GENSCAN_predicted_peptide_2|117_aa MLTESKLQMVPKGALQDKNPEKEASRGAQRAGSLQTLHHWGNPNGSRAPRDPSCLIGNPT KPSADSGEPALRRASLSPGSSFSEAVFVLQCVHTVPYSSFNKPPLKQHVSVFKFTKM >gi568815595r:185818363_186026735|GENSCAN_predicted_CDS_2|354_bp atgcttacagaaagtaaactacaaatggttcctaaaggagcccttcaggacaaaaacccg gaaaaggaagctagcagaggggctcagagggcaggcagcctgcagacgctccatcactgg ggcaatccaaatggcagccgggctccccgggacccttcatgtctgattggaaatcctacc aaaccctctgctgattcaggagagcctgcgttgcggagggcctctctgagccctggctcc agcttttctgaggctgtgttcgttttgcagtgtgtccatacggttccttactcctctttc aataaacccccattgaaacagcacgttagtgtattcaagttcactaagatgtag >gi568815595r:185818363_186026735|GENSCAN_predicted_peptide_3|67_aa MSYNRTTGYNAAIKMLILKAKKNKHDIPSRSPLITRRLERPYGNTPADRQLQLSQPHTRH MGDETPR >gi568815595r:185818363_186026735|GENSCAN_predicted_CDS_3|204_bp atgtcctacaataggacaacaggatataatgcagccattaaaatgttaattttgaaggct aagaaaaataagcatgatattccctctaggagcccactcatcacacgtcgtttggagagg ccctacggaaacacaccagctgacaggcagcttcagttatcccagccccacaccagacat atgggggatgaaacccccagatga >gi568815595r:185818363_186026735|GENSCAN_predicted_peptide_4|435_aa MLSGKERVSGVTELSLEQRHQPKCCVSSLEGARGWQLRLKHIDRRQQPGVMSDSGEQNYG ERESRSASRSGSAHGSGKSARHTPARSRSKEDSRRSRSKSRSRSESRSRSRRSSRRHYTR SRSRSRSHRRSRSRSYSRDYRRRHSHSHSPMSTRRRHVGNRANPDPNCCLGVFGLSLYTT ERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKERANGMELDGRRIR VDFSITKRPHTPTPGIYMGRPTYGSSRRRDYYDRGYDRGYDDRDYYSRSYRRRSPSPYYS RGGYRSRSRSRSYSPPESRRVSSESLPDRKLLTPGTSRLLTQAKGSSKPSSPSVHESSQQ PLEQGQAYKWPFMPLTKSLLPHPPYIRQQISTGLPSSEKVYLCQSVGRHKLPYLKDPPAQ EFCSHPPSAVAFGGY >gi568815595r:185818363_186026735|GENSCAN_predicted_CDS_4|1308_bp atgttatcaggaaaagagagagtcagtggagtaacagaactttccctggagcagcgacac cagccgaaatgctgtgtttcttctctggaaggtgcaagaggttggcagcttcgattgaag cacatcgaccggcgacagcagccaggagtcatgagcgacagcggcgagcagaactacggc gagcgggaatcccgttctgcttccagaagtggaagtgctcacggatcggggaaatctgca aggcatacccctgcaaggtctcgctccaaggaagattccaggcgttccagatcaaagtcc aggtcccgatctgaatctaggtctagatccagaagaagctcccgaaggcattatacccgg tcacggtctcgctcccgctcccatagacgatcacgtagcaggtcttacagtcgagattat cgtagacggcacagccacagccattctcccatgtctactcgcaggcgtcatgttgggaat cgggcaaatcctgatcctaactgttgtcttggagtatttgggctgagcttgtacaccaca gaaagagatctaagagaagtgttctctaaatatggtcccattgccgatgtgtctattgta tatgaccagcagtctaggcgttcaagaggatttgcctttgtatattttgaaaatgtagat gatgccaaggaagctaaagaacgtgccaatggaatggagcttgatgggcgtaggatcaga gttgatttctctataacaaaaagaccacatacgccaacaccaggaatttacatggggaga cctacctatggcagctctcgccgtcgggattactatgacagaggatatgatcggggctat gatgatcgggactactatagcagatcatacagaaggcggtcaccttctccttactatagt cgtggaggatacagatcacgttccagatctcgatcatactcacctccagaaagtaggagg gtgtccagtgagtcactccctgaccggaaactgctgacaccaggcaccagcaggttgctg actcaggccaaggggagctccaagccctcatcaccctcagtgcatgaatcttcacagcag ccattggagcagggccaggcttacaaatggcctttcatgccacttaccaagagtttgctt ccacaccctccatatatcaggcagcaaatatcaacagggctgccatcttcagaaaaggtc tatctatgtcagagtgttgggcggcataagctaccctaccttaaggatcctcctgcccag gaattttgcagccatcctccttcagctgtagccttcgggggctactga >gi568815595r:185818363_186026735|GENSCAN_predicted_peptide_5|316_aa MGICEGPCSTPCSRYCSSSTLNYEISDLLSVNILPDPSCVSSSLDPAGAQGGSVARAILE SKKFARLGAEVVKGDLNDKASVDSALKGVYGAFLVTNFWDPLNQDKEVCRGKLVADSAKH LGLKHVVYSGLENVKRLTDGKLEVPHFDSKGEVEEYFWSIGIPMTSVRVAAYFENFLAAW RPVKASDGDYYTLAVPMGDVPMDGISVADIGAAVSSIFNSPEEFLGKAVGLSAEALTIQQ YADVLSKVLGKEVRDAKITPEAFEKLGFPAAKEIANMCRFYEMKPDRDVNLTHQLNPKVK SFSQFISENQGAFKGM >gi568815595r:185818363_186026735|GENSCAN_predicted_CDS_5|951_bp atggggatatgtgaagggccatgtagcaccccctgcagtaggtactgcagcagctctact ctgaattatgagatttctgaccttctttcagtgaacattcttcctgatccttcctgtgtt tcttcttctcttgaccctgcaggagctcaaggtggctctgtggccagggcaattttggag agcaaaaaatttgcacgccttggagctgaggtggtcaaaggtgacctgaatgataaagca tcggtggacagtgccttaaaaggtgtctatggggccttcttggtgaccaacttctgggac cctctcaaccaagataaggaagtgtgtcgggggaagctggtggcagactccgccaagcac ctgggtctgaagcacgtggtgtacagcggcctggagaacgtcaagcgactgacggatggc aagctggaggtgccgcactttgacagcaagggcgaggtggaggagtacttctggtccatt ggcatccccatgaccagtgtccgcgtggcggcctactttgaaaactttctcgcggcgtgg cggcccgtgaaagcctctgatggagattactacaccttggctgtaccgatgggagatgta ccaatggatggtatctctgttgctgatattggagcagccgtctctagcatttttaattct ccagaggaatttttaggcaaggccgtggggctcagtgcagaagcactaacaatacagcaa tatgctgatgttttgtccaaggttttggggaaagaagtccgagatgcaaagattaccccg gaagctttcgagaagctgggattccctgcagcaaaggaaatagccaatatgtgtcgtttc tatgaaatgaagccagaccgagatgtcaatctcacccaccaactaaatcccaaagtcaaa agcttcagccagtttatctcagagaaccagggagccttcaagggcatgtag >gi568815595r:185818363_186026735|GENSCAN_predicted_peptide_6|398_aa MLVLLLLASPLATLLVILSPTQGTFATLKSHQDYTRGRDNSLKGNWGTVRKCVLNGPQAL SILPSNRAEHSGNVVLTWTPHRKRECGATVSSGYRQGRGSAWLTGGKTGVTGLSVEALRG NRESPQEVLGDQTQRFPRAGLVDEGNKPDVAQKGVVATVPTGDRLIPEPSDFVEGLGLGE AVGIVSDWPDHEGKEHPQNARVLLFGSGELLGVIIAAIITLISTVFLCPFYREVVGVGCG VKGLRSLSAGLTPICVITNKAFPHCFWQMFGHGEKKEAKKYFPCSSDCVIHMSHHQQPKA SSEAAEHFSSKTALCDPRSACSAEITCVWHRPSLLILAGFQGVKKLGFNSQGGKKDVLGS GGIGAMWVVLEGFGMPNRLGSLRFHPHNISQAMRRPLL >gi568815595r:185818363_186026735|GENSCAN_predicted_CDS_6|1197_bp atgctggtgctcctgctgctggcttctccgctggccacactgcttgtcatcctctcccca acccaagggacctttgcaactctgaaatcccatcaagactacacaaggggcagagacaat tcactaaaaggaaactggggcactgttaggaagtgcgtattgaacggccctcaggcactg tccattctcccaagtaacagagcagaacactctggcaatgtagtcctgacttggactcct cataggaaaagagaatgcggagccacggtgtcctcaggctaccggcaaggccgtggatct gcgtggctaacggggggcaagacaggggtcacggggctgagtgtggaagccctgcgtgga aatcgagagagccctcaggaggtcttgggggatcaaacccaaaggtttccaagagccggg ctggtagatgaaggaaacaagccagatgtagcacagaagggagtggtggcgactgtaccg acaggagaccgattaattcctgaaccctctgattttgtggaagggctgggacttggtgaa gctgtaggtatcgtgagtgactggcctgatcacgaggggaaggagcacccacagaatgcc agggtcctcctctttggcagtggggaactattgggtgttattatagcagctatcatcacc ttaatcagtacagtatttttatgccccttttatagagaagtagtgggtgttggctgtggg gtgaaggggttgaggagcttgtcagcaggcctcactcctatctgtgttattacaaataaa gcattccctcattgcttctggcaaatgtttggccatggggaaaagaaagaagcaaaaaaa tactttccatgctcctctgactgtgtcatccacatgagccaccaccagcagcccaaagcc agttccgaagcagcagaacatttttcgagcaaaacagcactctgtgaccctcggagtgct tgttcagctgaaattacttgtgtctggcacagaccttctcttttgattttggcaggattt cagggagtgaaaaagctagggtttaacagccaaggagggaagaaggatgtgctggggagc gggggaataggggccatgtgggtagtgctggaaggattcgggatgccgaacaggcttggc tccttgcggttccacccccacaacatctctcaagccatgcgccgtcctctcctctag >gi568815595r:185818363_186026735|GENSCAN_predicted_peptide_7|419_aa MSELPFTIASKRIKYLGIQLTRDVKDLCKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPMSFFTELEKTTLKFIWHQKRAHITKSILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDLWNRTEPSEITPHIYNYLIFDKPEKNKQWGKHSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLHVRPKTIKTIEENLGITIQDIGMGKD FMSKTPKAMATKAKIDKWDLIELKSFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRI YNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMQTKTTMRYH LTPVRMAIIKKSGNNRPKTAKVVGNLALSHIAGGDVNWYKLFGRNLAECIKNYKKVCIL >gi568815595r:185818363_186026735|GENSCAN_predicted_CDS_7|1260_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggacgtgaaggacctctgcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgagtttcttcacagaattggaaaaaactactttaaagttcatatggcaccaaaaaaga gcccacatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtattggtaccaaaac agagatatagatctatggaacagaacagagccctcagaaataacaccgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaagcattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttacatgttagacctaaaacc ataaaaaccatagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaatagacaaatgggatcta attgaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaaaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcg aaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaa aaatgctcaccatcactggctatcagagaaatgcaaaccaaaaccacaatgagataccat ctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagacccaaaactgcc aaggttgtgggaaacctggcactctcacacattgctggtggcgatgtaaattggtacaaa ctttttggaagaaatctggcagaatgtatcaagaattataaaaaagtgtgtatcttgtga >gi568815595r:185818363_186026735|GENSCAN_predicted_peptide_8|47_aa DVSLKFPVVVNQGKYCPQKKWQMCGGLFNSLKTRGCNCHLVDEDQRC >gi568815595r:185818363_186026735|GENSCAN_predicted_CDS_8|144_bp gatgtatcactgaagttcccagtggttgtcaaccaggggaagtactgcccccaaaagaaa tggcagatgtgtgggggcctttttaatagccttaagacaaggggctgcaactgccattta gtggatgaggaccagagatgttaa