GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:00:52 Sequence gi568815589f:116597743_116799701 : 201959 bp : 41.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 5702 5875 174 1 0 77 49 224 0.993 12.04 1.02 PlyA + 7290 7295 6 1.05 2.07 PlyA - 8041 8036 6 1.05 2.06 Term - 8560 8478 83 2 2 114 34 84 0.075 2.48 2.05 Intr - 20730 20582 149 2 2 87 96 173 0.726 16.96 2.04 Intr - 22701 22568 134 0 2 105 40 120 0.694 7.42 2.03 Intr - 28949 28803 147 2 0 110 86 -8 0.574 0.71 2.02 Intr - 54051 53786 266 1 2 94 83 272 0.503 23.61 2.01 Init - 55247 55160 88 2 1 81 44 47 0.700 0.45 2.00 Prom - 68896 68857 40 -5.75 3.00 Prom + 71519 71558 40 -3.75 3.01 Init + 72566 72617 52 1 1 51 60 58 0.239 0.68 3.02 Intr + 77269 77445 177 2 0 75 36 110 0.052 3.47 3.03 Term + 81938 83094 1157 0 2 15 42 535 0.113 32.65 3.04 PlyA + 83103 83108 6 1.05 4.00 Prom + 84812 84851 40 -4.05 4.01 Sngl + 100001 101962 1962 1 0 78 50 2005 0.998 189.39 4.02 PlyA + 104189 104194 6 1.05 5.00 Prom + 111968 112007 40 -4.45 5.01 Init + 121931 122147 217 1 1 81 48 199 0.492 14.10 5.02 Term + 123937 124037 101 2 2 52 41 105 0.338 -0.49 5.03 PlyA + 124516 124521 6 -0.45 6.09 PlyA - 124583 124578 6 1.05 6.08 Term - 124797 124658 140 1 2 87 38 49 0.198 -3.06 6.07 Intr - 127659 127375 285 1 0 93 102 42 0.453 2.49 6.06 Intr - 128208 128029 180 1 0 114 121 246 0.995 29.42 6.05 Intr - 131354 131250 105 0 0 121 94 126 0.996 15.77 6.04 Intr - 134142 134044 99 1 0 53 84 96 0.920 4.86 6.03 Intr - 135835 135657 179 0 2 85 99 252 0.833 24.64 6.02 Intr - 150319 150097 223 2 1 15 78 138 0.135 1.76 6.01 Init - 156273 156213 61 0 1 64 70 98 0.326 5.06 6.00 Prom - 156553 156514 40 -3.65 7.03 PlyA - 158884 158879 6 1.05 7.02 Term - 165767 165697 71 0 2 102 48 71 0.570 1.62 7.01 Init - 170576 170531 46 2 1 85 103 40 0.540 6.10 7.00 Prom - 172693 172654 40 -2.25 8.03 PlyA - 173302 173297 6 1.05 8.02 Term - 179452 179315 138 1 0 84 38 107 0.301 2.28 8.01 Intr - 191463 191376 88 2 1 89 95 104 0.683 10.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 29205 29167 39 0 0 65 76 41 0.865 0.86 S.002 Term - 115573 115467 107 2 2 82 45 117 0.806 4.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:116597743_116799701|GENSCAN_predicted_peptide_1|57_aa MVTGDEQIKEVRDWIVGQKIHKNPDLTYCDGKNPDEEKAANQELKILKEPVSVEARG >gi568815589f:116597743_116799701|GENSCAN_predicted_CDS_1|174_bp atggtcactggagatgagcagatcaaagaagtgagagactggattgttggacagaaaatc cacaaaaaccctgaccttacctattgtgatggcaagaacccagatgaagagaaagctgca aaccaggaacttaaaatacttaaagaaccagtttcagttgaggcaagaggatga >gi568815589f:116597743_116799701|GENSCAN_predicted_peptide_2|288_aa MKNDSLKTVTLEWSIDRRDVERQRGKAAQKTTELGSKKELKSMPFITYLSGLLTAQMLSD DQLISGVEIRCEEKGRCPSTCHLCRRPGKEQLSPTPVLLEINRVVPLYTLIQDNGTKEAL EASYFISSKTSGQKRYQPHCPPSCDHNCLQKGLYLHSPPQFPFNGFKAFKSALMSSYWCS GKGDVIDDWCRCDLSAFDANGLPNCSPLLQPVLRLSPTVEPSSTVVSLEWVDVQPAIGTK VSDYILQHKKVDEYTDTDLYTGLAPSLSPSEKPSLGQFRDAVRKAKHK >gi568815589f:116597743_116799701|GENSCAN_predicted_CDS_2|867_bp atgaagaatgactccctaaagacagtaacattggaatggagcattgatagaagagatgtg gagaggcagaggggaaaagcagcacagaagaccacagagctgggcagcaagaaggagctc aagtccatgcccttcatcacctacctctcaggtttgctgacagcccagatgctgtcagat gaccagctcatttcaggtgtggagattcgctgtgaggagaaggggcgctgtccatctacc tgtcacctttgccgccggccaggcaaggagcagctgagccccacaccagtgctgctggaa atcaaccgtgtggtgccactttataccctcatccaagacaatggcacaaaggaggcctta gaagcttcttattttatatcttcaaaaacttctggacaaaaaaggtatcaaccccactgt cctccaagctgtgatcacaattgtctccaaaagggactctacctccatagcccaccccaa tttcctttcaatggcttcaaggccttcaagagtgcactgatgagttcctactggtgctca gggaaaggggatgtgatcgatgactggtgcaggtgtgacctcagcgcctttgatgccaat gggctccccaactgcagcccccttctgcagccggtgctgcggctgtccccaacagtggag ccctccagtactgtggtctccttggagtgggtggatgttcagccagctattgggaccaag gtctccgactatattctgcagcataagaaagtggatgaatacacagacactgacctgtac acaggtttggcgccctctttgtctccttcagagaagccatcgctcggtcagttccgggac gctgttaggaaagccaagcataagtag >gi568815589f:116597743_116799701|GENSCAN_predicted_peptide_3|461_aa MMIMMTIPPDKVSTRAGVGKVNPLGSYKFRGLSRIALVATCSWFSAPPRSSNGSRGQPKW LPSSFGLGLTLVLYWWEHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKM FFETNENKDTTYQNLWDTFKAVCRGKFIAINAHKRKQERSKIDTLTSQLKELKKQEQTHS KASRRQEITKIRAEVKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDA IKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTIPRLNQEEVESLNRP ITGSEIVAIINSLPTKKSPGPDGFTAEFYQGYKEELVSFLLKLFQSTEKEGILPNSFYEA SIILIPKPGRDTTKKENFRSISLINIDAKILNKILANRIQQHIKKRIHHDQVGFIPGMQG WFNICKSINAIQHINRTKDKNHMIISIDAEKAFDKIQQPSC >gi568815589f:116597743_116799701|GENSCAN_predicted_CDS_3|1386_bp atgatgatcatgatgacaatccccccggacaaagtgtccacaagggcaggagtgggcaag gtgaaccccctgggcagttacaaattcaggggcttgtccaggattgcccttgtggctacc tgctcatggttcagtgcccccccacgctccagcaatggatccagaggccagcccaagtgg ctgcctagttcttttggactggggctgactctggtactctattggtgggaacacagtgca atcaaacttgaactcaggattaagaaactcactcaaaaccgctcaactacatggaaactg aacaacctgctcctgaatgactactgggtacataacgaaatgaaggcagaaataaagatg ttctttgaaaccaatgagaacaaagacacaacataccagaatctctgggacacattcaaa gcagtgtgtagagggaaatttatagcaataaatgcccataagagaaagcaggaaagatct aaaattgacaccctaacatcacaattaaaagaactaaaaaagcaagagcaaacacattca aaagctagcagaaggcaagaaataactaaaatcagagcagaagtgaaggaaatagagaca caaaaaacccttcaaaaaattaatgaatccaggagctggttttttgaaaggatcaacaaa attgatagaccactagcaagactcataaagaagaaaagagagaagaatcaaatagacgca ataaaaaatgataaaggggatatcaccaccgatcccacggaaatacaaactaccatcaga gaatactacaaacacctctacgcaaataaactagaaaatctagaagaaatggataaattc ctggacacatacaccatcccaagactaaaccaggaagaagttgaatctctgaatagacca ataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaaaagagtccagga ccagatggattcacagccgaattctaccaggggtacaaggaggaactggtatcattcctt ctgaaactattccaatcaacagaaaaagagggaatcctccctaactcattttatgaggcc agcatcatcctgataccaaagccgggcagagacacaaccaaaaaggagaattttagatca atatccttgataaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccag cagcacatcaaaaagcgtatccaccatgatcaagtgggcttcatccctgggatgcaaggc tggttcaatatatgcaaatcaataaatgcaatccagcatataaacagaaccaaagacaaa aatcacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaaccttca tgctaa >gi568815589f:116597743_116799701|GENSCAN_predicted_peptide_4|653_aa MAAAAASHLNLDALREVLECPICMESFTEEQLRPKLLHCGHTICRQCLEKLLASSINGVR CPFCSKITRITSLTQLTDNLTVLKIIDTAGLSEAVGLLMCRSCGRRLPRQFCRSCGLVLC EPCREADHQPPGHCTLPVKEAAEERRRDFGEKLTRLRELMGELQRRKAALEGVSKDLQAR YKAVLQEYGHEERRVQDELARSRKFFTGSLAEVEKSNSQVVEEQSYLLNIAEVQAVSRCD YFLAKIKQADVALLEETADEEEPELTASLPRELTLQDVELLKVGHVGPLQIGQAVKKPRT VNVEDSWAMEATASAASTSVTFREMDMSPEEVVASPRASPAKQRGPEAASNIQQCLFLKK MGAKGSTPGMFNLPVSLYVTSQGEVLVADRGNYRIQVFTRKGFLKEIRRSPSGIDSFVLS FLGADLPNLTPLSVAMNCQGLIGVTDSYDNSLKVYTLDGHCVACHRSQLSKPWGITALPS GQFVVTDVEGGKLWCFTVDRGSGVVKYSCLCSAVRPKFVTCDAEGTVYFTQGLGLNLENR QNEHHLEGGFSIGSVGPDGQLGRQISHFFSENEDFRCIAGMCVDARGDLIVADSSRKEIL HFPKGGGYSVLIREGLTCPVGIALTPKGQLLVLDCWDHCIKIYSYHLRRYSTP >gi568815589f:116597743_116799701|GENSCAN_predicted_CDS_4|1962_bp atggctgcagcagcagcttctcacctgaacctggatgccctccgggaagtgctagaatgc cccatctgcatggagtccttcacagaagagcagctgcgtcccaagcttctgcactgtggc cataccatctgccgccagtgcctggagaagctattggccagtagcatcaatggtgtccgc tgtcccttttgcagcaagattacccgcataaccagcttgacccagctgacagacaatctg acagtgctaaagatcattgatacagctgggctcagcgaggctgtggggctgctcatgtgt cggtcctgtgggcggcgtctgccccggcaattctgccggagctgtggtttggtgttatgt gagccctgccgggaggcagaccatcagcctcctggccactgtacactccctgtcaaagaa gcagctgaggagcggcgtcgggactttggagagaagttaactcgtctgcgggaacttatg ggggagctgcagcggcggaaggcagccttggaaggtgtctccaaggaccttcaggcaagg tataaagcagttctccaggagtatgggcatgaggagcgcagggtccaggatgagctggct cgctctcggaagttcttcacaggctctttggctgaagttgagaagtccaatagtcaagtg gtagaggagcagagttacctgcttaacattgcagaggtgcaggctgtgtctcgctgtgac tacttcctggccaagatcaagcaggcagatgtagcactactggaggagacagctgatgag gaggagccagagctcactgccagcttgcctcgggagctcaccctgcaagatgtggagctc cttaaggtaggtcatgttggccccctccaaattggacaagctgttaagaagccccggaca gttaacgtggaagattcctgggccatggaggccacagcgtctgctgcctctacctctgtt acttttagagagatggacatgagcccggaggaagtggttgccagccctagggcctcacct gctaaacagcggggtcctgaggcagcctccaatatccagcagtgcctctttctcaagaag atgggggccaaaggcagcactccaggaatgttcaatcttccagtcagtctctacgtgacc agtcaaggtgaagtactagtcgctgaccgtggtaactatcgtatacaagtctttacccgc aaaggctttttgaaggaaatccgccgcagccccagtggcattgatagctttgtgctaagc ttccttggggcagatctacccaacctcactcctctctcagtggcaatgaactgccagggg ctgattggtgtgactgacagctatgataactccctcaaggtatataccttggatggccac tgcgtggcctgtcacaggagccagctgagcaaaccatggggtatcacagccttgccatct ggccagtttgtagtaaccgatgtggaaggtggaaagctttggtgtttcacagttgatcga ggatcaggggtggtcaaatacagctgcctatgtagtgctgtgcggcccaaatttgtcacc tgtgatgctgagggcaccgtctacttcacccagggcttaggcctcaatctggagaatcgg cagaatgagcaccacctggagggtggcttttccattggctctgtaggccctgatgggcag ctgggtcgccagattagccacttcttctcggagaatgaggatttccgctgcattgctggc atgtgtgtggatgctcgtggtgatctcatcgtggctgacagtagtcgcaaggaaattctc cattttcctaagggtgggggctatagtgtccttattcgagagggacttacctgtccggtg ggcatagccctaactcctaaggggcagctgctggtcttggactgttgggatcattgcatc aagatctacagctaccatctgagaagatattccaccccatag >gi568815589f:116597743_116799701|GENSCAN_predicted_peptide_5|105_aa MTIETSLERGVHKTGSCETSLTNVAEEDEPAEETEMQQSRGEVENHEDLVSPKQTQKVPH TGSRQLCQSLLKGPVLPALDSTLLTCSVFQSPTDYWKAQNSHLVI >gi568815589f:116597743_116799701|GENSCAN_predicted_CDS_5|318_bp atgaccatagaaacaagtctagaaagaggggtccataagacaggctcctgtgaaacttca cttactaatgttgcagaagaggatgaacctgcagaggagacagagatgcaacagtccaga ggggaagtagaaaatcatgaagatctagtatcaccgaaacaaacacagaaagtgcctcac acaggaagcagacaactttgtcaaagcctactgaaaggcccagttcttcctgcattagat tctaccttattgacctgttcggtgttccagtcccctacggactactggaaagctcagaat tcccacctggtgatataa >gi568815589f:116597743_116799701|GENSCAN_predicted_peptide_6|423_aa MDAAGAGVGAAGITQQGLLQVEGKVLEEALVTAAQLVLSVMTGQGIPVLIHTGKGDGGRS GQGEDNVPLACNFFQFQGSQLPGQELVTAMPGMLNPQQSPLPRFGSLSAPLHRENNFIKD FPQLADGLLVIPLPVEEQCRGVLSEPLPDLQLLTVMGSLLPPKADMSTVCEEVLCFNGKA PEPVIDPGDIRYDEAMGYPMVQQWRVRSNLYRVKLSTITLAAGFTNVLKILTKESSREEL LSFIQHYGSHYIAEALYGSELTCIIHFPSKKVQQQLWLQYQKGFLGEPKKILGSSLPRRT QFLWKNNHLITKQTVSILSCGEAFILRKGQQISFSSDSQLLSELEFLRPGAGDQSLPTHL VKVIPFPPQFSLPVKWVPLVSLDIRNKCPSLPRFHNDGTLATFMIWGNHWKLLESQGRWG ECA >gi568815589f:116597743_116799701|GENSCAN_predicted_CDS_6|1272_bp atggacgcagcaggggcgggggtgggggcagcgggcatcacacagcagggcctgttgcaa gtggaggggaaagttctggaagaggccctggtgacagcagctcagcttgttctcagtgtt atgacaggacaagggatcccagtgctaatacacacaggcaaaggagatggagggcgaagt ggccaaggagaagataatgttcctctagcctgcaatttctttcaatttcagggatcgcaa ttaccagggcaggagttagtaacagccatgcctggaatgcttaatcctcagcagtctcca cttcctcgatttggcagtctctctgctcctctccacagggagaacaacttcatcaaggac tttccccagctggccgatgggctgttggtgatcccgctgccggtggaggagcagtgccgg ggggtcctctccgagccccttccagacctccaactgctcactgtgatggggagtttacta cctcctaaagcagacatgtccacggtctgtgaggaggtgctgtgctttaatggaaaggcc cctgaacctgtcattgatcctggagatatcaggtatgatgaggccatgggttaccccatg gtgcagcagtggcgggtccggagcaacctctaccgtgtgaagctcagcaccatcaccctc gcagcaggcttcactaatgttctcaagatcctgaccaaggagagcagtcgggaggagctg ctgtccttcatccagcactatggctcccactacatcgcagaggccctctatggctcagag ctcacctgcatcatccactttcccagcaagaaggtccagcagcagctgtggctccagtat cagaaaggcttcttgggagaacccaagaagatcttgggttcttctctcccaagaagaact caattcctttggaaaaacaatcacctcataaccaagcagacggtttccattctgtcctgt ggtgaggcttttatcctcaggaagggccagcagatttcattttcctctgactctcagcta ttatcagaattggaattcctcagaccaggtgctggagaccagtctctgcccactcacctt gtgaaggttattccttttccacctcaattttctcttcctgtgaaatgggtacctctggtg tccctggatattagaaacaaatgccctagccttcccaggtttcataatgatgggaccctg gccacttttatgatctggggaaaccactggaaattactggaaagccaagggagatggggt gaatgtgcctga >gi568815589f:116597743_116799701|GENSCAN_predicted_peptide_7|38_aa METSSFGKLEVINNHGSLSDGFGLEKKFWVFEDGFRGR >gi568815589f:116597743_116799701|GENSCAN_predicted_CDS_7|117_bp atggagaccagcagttttggcaaattggaggtcatcaataaccacggaagtctgagtgat gggtttggcctggagaagaagttttgggtatttgaagatggttttcgggggcgctga >gi568815589f:116597743_116799701|GENSCAN_predicted_peptide_8|75_aa XVGGKKDDEVIVLTCHAVVQRTKVALKDIPTRERGESLWPLPLEQYHASAPIGSIVQPDL VTTQIHTAQLESGAV >gi568815589f:116597743_116799701|GENSCAN_predicted_CDS_8|228_bp nntgtaggaggaaagaaagacgatgaagtgattgttcttacttgtcatgccgtggttcag agaactaaagttgctctaaaggatatccctaccagagagaggggtgagagtctctggccc ttacccttggagcagtaccatgcatcagcacctattggctccattgtccaaccagatttg gtaacaacacaaattcacacagcacagcttgaatcaggggcagtttga