GENSCAN 1.0 Date run: 20-Jan-117 Time: 09:30:03 Sequence gi568815595r:9402214_9652679 : 250466 bp : 43.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 26726 26796 71 1 2 79 89 28 0.618 2.56 1.02 Intr + 31632 31737 106 0 1 65 68 55 0.658 1.42 1.03 Intr + 32121 32272 152 2 2 108 110 97 0.897 12.86 1.04 Intr + 32611 32669 59 1 2 88 99 5 0.936 0.13 1.05 Intr + 33515 33693 179 0 2 65 91 109 0.627 8.54 1.06 Intr + 34637 34675 39 1 0 45 116 70 0.938 3.92 1.07 Intr + 38243 38485 243 1 0 107 115 142 0.999 16.49 1.08 Intr + 39380 39528 149 1 2 72 69 120 0.634 7.53 1.09 Intr + 39915 40032 118 0 1 39 94 89 0.974 5.07 1.10 Intr + 41095 41204 110 0 2 30 99 129 0.948 7.28 1.11 Intr + 42940 43087 148 1 1 28 74 139 0.839 6.74 1.12 Intr + 43444 43527 84 0 0 63 96 111 0.998 9.42 1.13 Intr + 44837 45094 258 1 0 73 78 171 0.996 12.26 1.14 Intr + 45473 45793 321 1 0 62 115 216 0.999 17.76 1.15 Intr + 46175 46417 243 1 0 80 95 70 0.879 4.59 1.16 Intr + 51526 51655 130 0 1 85 98 14 0.350 2.37 1.17 Intr + 62320 62459 140 2 2 97 45 81 0.572 4.88 1.18 Intr + 68246 68716 471 1 0 100 106 149 0.938 11.18 1.19 Intr + 71023 71324 302 0 2 128 103 23 0.384 3.33 1.20 Intr + 72236 72369 134 2 2 104 99 107 0.453 13.69 1.21 Intr + 92063 92155 93 0 0 84 67 69 0.491 4.34 1.22 Term + 94866 94909 44 1 2 110 39 40 0.034 -1.38 1.23 PlyA + 96029 96034 6 1.05 2.06 PlyA - 96173 96168 6 1.05 2.05 Term - 98367 98348 20 1 2 95 46 17 0.291 -3.32 2.04 Intr - 99425 99306 120 0 0 87 71 90 0.588 7.67 2.03 Intr - 103990 103709 282 2 0 60 -6 513 0.464 36.49 2.02 Intr - 150634 150061 574 1 1 125 91 904 0.345 86.71 2.01 Init - 181560 181393 168 0 0 66 77 80 0.345 4.23 2.00 Prom - 184506 184467 40 -4.96 3.00 Prom + 184962 185001 40 -7.96 3.01 Sngl + 200099 200473 375 1 0 49 48 238 0.641 12.05 3.02 PlyA + 200830 200835 6 1.05 4.00 Prom + 218586 218625 40 -3.66 4.01 Init + 247371 247529 159 2 0 92 100 315 0.936 30.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 160168 160149 20 2 2 151 45 30 0.829 3.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:9402214_9652679|GENSCAN_predicted_peptide_1|1197_aa MSIAIPLGVTTSDTSYSDMAAGSDPESVEASPAVNEKSVYSTHNYGTTQRHGCRGLPYAT IIPRSDLNGLPSPVEERCGDSPNSEGETVPTWCPCGLSQDGFLLNCDKCRGMSRGKVIRL HRRKQDNISGGDSSATESWDEELSPSTVLYTATQHTPTSITLTVRRTKPKKRKKSPEKGR AAPKTKKIKAFREGSRKSLRMKNSPSEAQNLDENTTEGWENRIRLWTDQYEEAFTNQYSA DVQNALEQHLHSSKEFVGKPTILDTINKTELACNNTVIGSQMQLQLGRVTRVQKHRKILR AARDLALDTLIIEYRGKVMLRQQFEVNGHFFKKPYPFVLFYSKFNGVEMCVDARTFGNDA RFIRRSCTPNAEVRHMIADGMIHLCIYAVSAITKDAEVTIAFDYEYSNCLPTIGAETRRR KARRKELEMEQQNEASEENNDQQSQEVPEKVTVSSDHEEVDNPEEKPEEEKEEVIDDQEN LAHSRRTREDRKVEAIMHAFENLEKRKKRRDQPLEQSNSDVEITTTTSETPVGEETKTEA PESEVSNSVSNVTIPSTPQSVGVNTRRSSQAGDIAAEKLVPKPPPAKPSRPRPKSRISRY RTSSAQRLKRQKQANAQQAELSQAALEEGGSNSLVTPTEAGSLDSSGENRPLTGSDPTVV SITGSHVNRAASKYPKTKKYLVTEWLNDKAEKQECPVECPLRITTDPTVLATTLNMLPGL IHSPLICTTPKHYIRFGSPFIPERRRRPLLPDGTFSSCKKRWIKQALEEGMTQTSSVPQE TRTQHLYQSNENSSSSSICKDNAGSKSPQLATPGSSHPGEEECRNGYSLMFSPVTSLTTA SRCNTPLQFELCHRKDLDLAKVGYLDSNTNSCADRPSLLNSGHSDLAPHPSLGPTSETGF PSRSGDGHQTLVRNSDQAFRTEFNLMYAYSPLNAMPRADGLYRGSPLVGDRKPLHLDGGY CSPAEGFSSRYEHGLMKDLSRGSLSPGGERACEGVPSAPQNPPQRKKVSLLEYRKRKQEA KENSAGGGGDSAQSKSKSAGAGQGSSNSVSDTGAHGVQGSSARTPSSPHKKFSPSHSSMS HLEAVSPSDSRGTSSSHCRPQENISSRWMVPTSVERLREGGSIPKVLRSSVRVAQKGEPS PTWESNITEKDSVNNLVSCFTEKMDAIGQERTELLPCPPPRKRPRPPPTLPTSPKAI >gi568815595r:9402214_9652679|GENSCAN_predicted_CDS_1|3594_bp atgagcattgcaatccctctgggagtcaccacatcagatacatcctactcagatatggct gctggatcagaccctgaatctgtggaggctagtccagcagttaatgagaagagcgtgtat tccactcataattatgggaccactcagaggcatgggtgtcgaggactgccttatgctacg atcatccctcgttctgacctgaatggcctgccgtcgcctgtagaggaacgctgtggagac agcccgaactctgaaggagaaactgtacctacctggtgtccttgtggtctttctcaggat ggcttccttctcaactgtgacaagtgcaggggaatgagcagggggaaggttattagactt catcggcggaagcaggacaacatatcaggtggggatagcagtgcaacagaaagctgggat gaggagctttctccttccactgtgttgtatacagcaacacagcacacacctacaagcatc accttaactgttagaagaaccaaacccaagaagcggaaaaagagtccagaaaagggtcgt gcagcaccaaagacgaagaaaatcaaggcatttcgagagggatcccggaagtccttgcgg atgaagaattctccctctgaagcacagaatttagatgagaatacaactgagggctgggaa aatcggataagactatggactgaccagtatgaagaagctttcactaatcagtacagtgca gatgtacagaacgcgcttgaacaacacctacattctagcaaggaatttgtgggcaaacct actattttagacactattaataagactgaattggcctgtaataacacagttattggttcc caaatgcagttacagctgggaagagtcactcgtgttcaaaagcaccggaagatcctgagg gctgcaagagatttggctttggacactcttataatagagtatcgtgggaaagtcatgtta cgacagcaatttgaggtcaatgggcatttcttcaaaaaaccatacccctttgtgctcttc tactcaaaattcaatggtgtagagatgtgtgtggatgcccgtactttcggtaatgatgct cggttcatcagaagatcatgtacaccaaatgcagaggtgcgacacatgattgcagatggg atgattcacctgtgcatctatgctgtgtctgccatcaccaaggatgctgaggtcaccata gcatttgattatgagtatagtaactgcctacccaccattggagcagagactagacgtaga aaagcacgacggaaagagctagagatggagcagcagaatgaggcttcagaggagaataat gaccagcaatcacaagaagttccagaaaaagtaactgtatccagtgatcatgaggaagta gacaatccagaagaaaaaccagaagaagagaaagaagaggttatagatgaccaggagaac ctagctcatagcaggaggaccagggaagatagaaaggtagaagccatcatgcatgctttt gaaaacttagagaaaagaaagaagcggcgggatcagcccttggaacagagcaactctgat gtagagattactacaaccacctcagagactcctgttggtgaagagacaaaaactgaagcc cctgaatctgaagttagcaactctgtttcaaatgttaccatcccaagcaccccacagagt gttggtgtgaatacccggaggtcttcccaagcaggggatattgctgcagaaaaactagtc cccaagccacctccagcaaagccttctaggccccggccgaagagtcgaatttctcggtac aggaccagttcagcccaaagactaaagcgtcagaagcaggccaatgcacagcaggcagaa ttgtcacaagctgccttggaagagggaggaagtaacagtttagtaactcctactgaagct ggaagtctagacagttcaggagaaaacaggccattaacagggtctgacccaactgtggtg tcaattactggatcccatgtcaaccgtgctgcatctaaataccccaaaaccaaaaagtat ctagttacagaatggttgaatgacaaagcagagaagcaagagtgccctgttgagtgccct ttacgtatcacaacggatccaactgtactggcaacgaccctaaacatgttaccaggtctt atccattccccgttaatttgcaccacccccaaacactacattcgctttggctcacccttt atccctgagagacgtcgaaggccccttctgcctgatggcacattcagctcctgtaagaag cgctggataaaacaagccttagaagaagggatgactcaaacatcatctgtaccccaagag actagaactcagcacctataccaaagcaatgagaatagtagctcttctagtatctgcaaa gacaatgcaggctcaaagagtccccagctggccacacctggctcatctcacccaggagaa gaggagtgtcgaaatggatacagcctcatgttttcaccagtcacatctcttactactgct agtcgctgcaacactcctctacagtttgagctttgtcaccgaaaagacctggatttggca aaagtaggataccttgactccaacactaacagctgtgctgatagaccttccctactcaac tcaggtcattctgacctggctcctcatccctccctcggacccacttctgagactggtttc ccaagcagaagtggagatggacatcagaccctcgtgagaaactcagaccaggcatttcgg acagagttcaacttgatgtatgcctactcccctttgaatgctatgcctcgagcagatgga ctgtatcgaggatctcctctagtgggggataggaagcctttacatttggatgggggatat tgttcccctgcagaaggattttccagcagatatgaacatggcttaatgaaagacctctct cgtggatccttgtcacctggtggtgaaagggcctgtgaaggagtcccatctgccccccag aacccaccacagaggaaaaaagtatccctgctggagtaccgaaaacggaaacaagaagct aaggaaaattctgctggtgggggaggtgactctgcacagagcaaaagcaagtctgcagga gctgggcaaggcagcagtaactccgtttccgacactggtgcccatggtgtgcagggatcc tcagcccgaactccatcttcccctcacaaaaaattctccccatctcattcctctatgtcc catttggaggcggtaagcccatcagattccagaggcacttcttcatctcactgcagacct caagagaatatcagcagtaggtggatggttcccacatcagtagaacgactccgagaagga gggagcatccccaaggtcctccgaagcagcgtgagggtggcccaaaagggagagccctct cccacatgggagagtaacatcacagagaaagactcagtaaacaaccttgtttcctgcttc actgagaagatggatgctatcgggcaagaacgcactgaacttttgccctgccctccaccc cgaaagagaccaagacctcctcctacgctgcccacgtcaccaaaagccatttaa >gi568815595r:9402214_9652679|GENSCAN_predicted_peptide_2|387_aa MDASWSEGPHKELTHERMQFPHPGDFLSLAPINQQPQFSNPLPSTIPLKAPAQNSLQLPD APSSAPGPALAGSLTQGRFSRASPAPARQPGAPSRAGGPRQPPPPPAPPPGTMLPSQEAS KLYHEHYMRNSRAIGVLWAIFTICFAIINVVVFIQPYWVGDSVSTPKPGYFGLFHYCVGS GLAGRELTCRGSFTDFSTIPSSAFKAAAFFVLLSMVLILGCITCFSLFFFCNTATVYKIC AWMQLLAALCLVLGCMIFPDGWDAETIRDMCGAKTGKYSLGDCSVRWAYILAIIGILNAL ILSFLAFVLGNRQTDLLQEELKPENKGECAVGWGAAAGARPAHPEGELHRGMHRALLSAA DRGPDQPRVRKDFQALEVAGGGLGPES >gi568815595r:9402214_9652679|GENSCAN_predicted_CDS_2|1164_bp atggatgccagctggtctgaaggaccccacaaggagctaactcatgaaagaatgcagttt ccacatcctggtgatttcctctcccttgccccgatcaaccaacaaccccaattttccaac cctttaccttccacgatccccttaaaagccccagcccagaactccttgcagctcccagac gcccccagctctgccccaggccccgccctcgcgggttccttgactcagggccgcttctcg cgtgcgtccccagctcccgcccggcagcccggcgctccgtcccgggccggcggcccccgc cagccgccgccgccgccggccccgcctccgggcaccatgctgccctcgcaggaggcctcc aagctctaccacgagcactacatgcggaactcgcgggccatcggcgtgctgtgggccatc ttcaccatctgcttcgccatcatcaacgtggtggtcttcatccagccctactgggtgggc gacagcgtgagcacccccaagcctggctacttcggcctcttccactactgcgtgggcagc gggctggcgggccgcgagctcacctgccggggctccttcaccgacttcagcaccatcccg tccagcgccttcaaggcggccgccttcttcgtgctgctctccatggtgctgatcctcggc tgcatcacctgcttttcgcttttcttcttctgcaacacggctacggtctacaagatctgc gcctggatgcagctcttggcagctctgtgcctcgtcctgggctgcatgatctttcctgat ggctgggatgccgagaccatccgggacatgtgtggggccaagacggggaagtactccctg ggggactgttcagtgcgctgggcatacatcctggccatcatcggcatcctcaacgccctc atcctctccttcctcgccttcgtgctgggcaaccggcaaacagacctgctgcaggaggag ctcaagccggagaacaaaggcgagtgtgctgtgggctggggggcagcggcgggagccagg cctgctcaccctgagggagagctgcaccgagggatgcacagggccctgctgtcagctgca gaccggggccctgatcagcccagagtcagaaaagactttcaggctctggaggtagcggga gggggacttggacctgagagttag >gi568815595r:9402214_9652679|GENSCAN_predicted_peptide_3|124_aa MDFIGCVRGKGGKVLVHCEAGISHSPTSCKSSPMKTRQLCLKETFNYIKQRRSMIWSNFG FMDQLLQKESEIPPSTPTPSLPLAKRGSRLFTEGHLQILSPGLQDAYCTFPSPVLALVPT HRTV >gi568815595r:9402214_9652679|GENSCAN_predicted_CDS_3|375_bp atggacttcattggctgtgtcaggggaaaaggaggcaaggtcctggtccactgtgaggct ggaatctcccattcacccaccagctgcaagtcttcccccatgaagaccagacagctctgc ctgaaggagaccttcaattacatcaagcagaggaggagcatgatctggtccaattttggc ttcatggaccagctcctacagaaggaatctgagatcccaccctccacgcccacccccagc ctccctcttgccaagcgaggcagcaggctcttcactgaaggtcatttgcagatactgagc ccaggcctgcaggatgcctactgcacattcccttccccggtgctggcactggtgcccacc caccggacagtctaa >gi568815595r:9402214_9652679|GENSCAN_predicted_peptide_4|53_aa MAGARAAAAAASAGSSASSGNQPPQELGLGELLEEFSRTQYRAKDGSGTGGSK >gi568815595r:9402214_9652679|GENSCAN_predicted_CDS_4|159_bp atggccggcgctcgggccgccgccgccgctgcctcggcggggtcctcggcctcttcaggc aaccagccgcctcaggagctggggcttggggagctgctggaggagttctcccggactcag taccgggccaaggatggcagcgggaccggcggctctaag