GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:46:26 Sequence gi568815594f:77058533_77265921 : 207389 bp : 42.38% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 317 312 6 1.05 1.07 Term - 7873 7766 108 1 0 72 50 82 0.297 0.33 1.06 Intr - 17079 16940 140 1 2 113 100 62 0.816 9.16 1.05 Intr - 17745 17568 178 0 1 27 29 156 0.402 2.17 1.04 Intr - 18098 18036 63 2 0 61 107 35 0.495 0.70 1.03 Intr - 24099 23858 242 1 2 16 50 251 0.641 10.45 1.02 Intr - 24562 24429 134 0 2 19 -2 169 0.219 0.37 1.01 Init - 31398 31346 53 0 2 77 64 34 0.064 0.59 1.00 Prom - 36509 36470 40 -6.25 2.02 PlyA - 37729 37724 6 1.05 2.01 Sngl - 54911 54033 879 2 0 81 39 886 0.979 78.74 2.00 Prom - 55740 55701 40 -8.25 3.00 Prom + 56091 56130 40 -4.45 3.01 Init + 57473 57561 89 1 2 70 90 5 0.442 -0.93 3.02 Intr + 61913 62094 182 2 2 -18 99 156 0.437 4.69 3.03 Term + 69378 69565 188 1 2 84 53 171 0.792 9.87 3.04 PlyA + 73171 73176 6 1.05 4.00 Prom + 89165 89204 40 -6.15 4.01 Init + 92387 92433 47 1 2 96 37 88 0.266 4.71 4.02 Intr + 95434 95630 197 1 2 29 70 91 0.135 -0.46 4.03 Intr + 99969 100138 170 1 2 104 107 104 0.917 12.64 4.04 Intr + 100835 100972 138 1 0 95 63 86 0.867 6.54 4.05 Intr + 102189 102439 251 2 2 84 86 42 0.885 -1.09 4.06 Intr + 102948 103026 79 0 1 60 106 83 0.966 5.93 4.07 Intr + 103117 103215 99 0 0 105 86 12 0.809 2.09 4.08 Intr + 105742 105947 206 0 2 91 102 97 0.922 8.48 4.09 Intr + 119531 119592 62 2 2 92 48 21 0.001 -4.04 4.10 Term + 125123 125307 185 0 2 92 48 128 0.823 5.92 4.11 PlyA + 125403 125408 6 1.05 5.00 Prom + 126238 126277 40 -5.65 5.01 Init + 128328 128387 60 2 0 88 81 35 0.777 4.11 5.02 Intr + 132129 132156 28 2 1 120 110 8 0.650 2.97 5.03 Term + 135602 135750 149 0 2 94 41 113 0.866 4.28 5.04 PlyA + 136454 136459 6 1.05 6.00 Prom + 136995 137034 40 -2.35 6.01 Init + 150114 150161 48 2 0 95 84 7 0.062 2.08 6.02 Term + 154499 154702 204 1 0 85 32 134 0.135 3.89 6.03 PlyA + 162168 162173 6 1.05 7.11 PlyA - 162899 162894 6 1.05 7.10 Term - 165728 165535 194 0 2 68 33 194 0.887 8.50 7.09 Intr - 169192 169147 46 1 1 58 67 57 0.207 -2.34 7.08 Intr - 172598 172383 216 2 0 33 81 119 0.131 3.48 7.07 Intr - 176939 176811 129 2 0 50 43 89 0.074 0.57 7.06 Intr - 177832 177808 25 0 1 131 93 14 0.277 3.31 7.05 Intr - 184971 184745 227 0 2 20 53 165 0.320 2.06 7.04 Intr - 185185 185053 133 0 1 43 70 81 0.446 1.53 7.03 Intr - 189271 188999 273 0 0 74 56 145 0.102 5.63 7.02 Intr - 202996 202898 99 0 0 29 103 90 0.166 2.91 7.01 Intr - 203587 203285 303 0 0 21 77 167 0.320 3.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 37100 37036 65 2 2 71 111 121 0.988 9.30 S.002 Init - 38006 37935 72 2 0 65 81 60 0.954 4.12 S.003 Sngl + 89462 89749 288 1 0 29 46 211 0.912 6.34 S.004 Init + 123335 123464 130 1 1 40 79 108 0.894 3.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:77058533_77265921|GENSCAN_predicted_peptide_1|305_aa MSRAGERLPQQECQATIRKSPEKGEGLSSPQSHPGKAGTFGKEGAKEKSSGLSDWNHSNM IPENTVIEEHLGKFGVICLEDLIHETAFPGKHFQESSWFLCPFHHSVARHATKNRVGFLK EMGTAGYQGECINQLIRQLNQTQNLPPPTQCGSEKNRNSIYMSTQPFRSFKENESTDLRL FLATRKRPPSFHPSANDFPSPSPFGRTSLYFRWTGKAAGRNRKVGRVVPRAPPPRAYENE VVTGPRMTPRHHCEAYSSAGEEEEEEEEEKVATASWVAGRSKGYHEVSRAFGKPEIVFPV GKGNH >gi568815594f:77058533_77265921|GENSCAN_predicted_CDS_1|918_bp atgagcagagcaggagagaggctcccacaacaggaatgtcaggcaaccatcagaaaatct cctgaaaaaggtgaaggcttatcaagccctcaaagccacccaggcaaagcaggcactttt ggcaaagaaggagcaaaggaaaagagctcaggtttaagcgactggaatcattccaacatg attcctgaaaacacagtgattgaggagcacctggggaagtttggtgtcatttgcttggaa gacctcattcatgaaactgccttcccagggaagcatttccaggagagctcatggttcttg tgccctttccaccactcagtggcccgtcatgctaccaaaaatagagtgggcttcctcaag gagatgggcacagctggctatcagggtgaatgcatcaatcagctcatccgccagctgaac cagacccagaatctccctccacccacacaatgtggcagtgagaaaaaccgcaattctatt tacatgtcaacgcaacccttccgctccttcaaggaaaatgagtccacggaccttcggctt ttcttagccacgagaaaacgtcctccctccttccatcctagtgccaacgacttccccagc ccgtctcctttcggccgcacttctctgtactttcgttggaccgggaaggcggcggggagg aacaggaaggttggccgagtcgtcccgcgcgcaccgcctccgcgcgcctatgagaatgag gtggtaacgggcccccggatgaccccgcgtcaccactgtgaggcctacagctctgccggg gaggaggaggaggaggaagaggaggagaaggtagctacagcaagctgggtagcaggcaga tccaaaggatatcatgaagtttccagggcctttggaaaaccagagattgtctttcctgtt ggaaaaggcaatcactag >gi568815594f:77058533_77265921|GENSCAN_predicted_peptide_2|292_aa MAAAGAPDGMEEPGMDTEAKTVATEALARPLNCLEAEATAGAGAGTVAEDSGTARGSLQP AAGPSPGPWRPRVPGFCQQRRRLGRQRRQRAGGPEDHLEQDQARREVPPGQHRLRTETED PLITGLPPAIQKVMYKGLVPEDKTLREIKVTSGAKIMVVGSTINDVLAVNTPKDAAQQDA KAKDNKKEPLCRQKQHRKALDKGKPEDVMPSVKGAQEHLQKVPLPSMYNKSGGKVRFTLK LEQHRLWIGSKERTEKLPMGSIKNVVSEHMEGHEDYHMMGFSWAPRKPLTTD >gi568815594f:77058533_77265921|GENSCAN_predicted_CDS_2|879_bp atggcagcggccggggccccggatggcatggaggaacctggcatggacaccgaagctaag accgtggcgaccgaggcgctggcacggcccctcaactgcttggaggccgaagccacggca ggagcgggggcagggacggtggccgaggactccggcaccgcgcgaggcagcctgcagcct gcagccggccccagcccaggcccctggagaccccgtgtcccaggcttctgtcagcaacgg cgaagactcgggcggcagcgcagacagcgagctggtggacctgaggatcatctggaacaa gaccaagcacgacgtgaagttccccctggacagcacaggctccgaactgaaacagaagat ccactgattacaggtctcccgcctgccatacagaaagtcatgtataagggactcgtccct gaggataagacattgagagaaataaaagtgaccagtggggccaagatcatggtggttggc tccaccatcaatgatgttttagcagtaaacacacccaaagatgctgcgcagcaggatgca aaggccaaagacaacaagaaggagcctctctgcaggcagaaacaacacaggaaagcgttg gataaaggaaaacccgaagatgtgatgccatctgttaagggggcccaggagcacctacaa aaggtacccctgcccagcatgtacaataagtccggaggaaaagtgagattcaccttgaag ttagaacaacaccgactgtggattggcagtaaagagcggactgagaaattgcccatggga tccataaaaaatgtggtcagtgaacatatggaaggacatgaagactaccacatgatgggt ttcagttgggccccacgaaagcctcttactactgactaa >gi568815594f:77058533_77265921|GENSCAN_predicted_peptide_3|152_aa MGTHTLLGPSLIGFGLNYLGSEQPNRREQGSKKPGLINWTKLVFYTINLPSEADGDLVSS SSSIQHLGESSPGIHFVYPISLSYFFKGAIGTRVAYFTQSAVHERLSVNSSVKGQGDSLL HLPFLTPEFLFGVQEESGHTNCVKDDECGRPY >gi568815594f:77058533_77265921|GENSCAN_predicted_CDS_3|459_bp atggggacacacaccttgctgggccccagtctaataggttttggattgaattacttggga agtgaacaaccaaatagaagggaacaagggagcaaaaagccagggctgatcaactggacc aagttggtgttctacaccatcaacctgcccagtgaagcagatggagacctggtgtcttca agtagcagcatccagcatttgggtgaatcaagtccaggaattcactttgtttatcctatt tcactgagttatttttttaaaggggcaattggaactagagtggcttatttcactcagtct gcagtccatgaacgtttaagtgttaacagctcagtgaagggtcagggtgacagcctcttg cacctgccatttttgacacccgagttcttgttcggtgtccaggaagaatcaggtcacacg aactgtgtgaaggatgatgaatgtggaagaccttattga >gi568815594f:77058533_77265921|GENSCAN_predicted_peptide_4|477_aa MPALKMEEEPQAKERSMPGTQCVFTKFRLNKKLGFIVKNHIFEEKDDIWKIVAILMPYEV QGSLSHLGKHCNCPMRVPYFQDYILCVVSLLQMKDLGAEHLAGHEGVQLLGLLNVYLEQE ERFQPREKGLSLIEATPENDNTLCPGLRNAKVEDLRSLANFFGSCTETFVLAVNILDRFL ALMKVKPKHLSCIGVCSFLLAARIVEEDCNIPSTHDVIRISQCKCTASDIKRMEKIISEK LHYELEATTALNFLHLYHTIILCHTSERKEILSLDKLEAQLKACNCRLIFSKAKPSVLAL CLLNLEVETLKSVELLEILLLVKKHSKINDTEFFYWRELVSKCLAEYSSPECCKPDLKKL VWIVSRRTAQNLHNSYYSVPELPTIPEGGCFDESERIHGKLNGNQKVHAWAAWPQNRLPI GPTRSQGAQEPTGVVHELSFLGGKHNGEEQRVGLKDKEKTLCPFPCLSSFRDGSGQL >gi568815594f:77058533_77265921|GENSCAN_predicted_CDS_4|1434_bp atgccggctttaaagatggaggaagagccacaagccaaggaacgcagcatgcctggcact cagtgtgtattcactaaatttcgattgaataaaaagctaggatttattgtcaaaaaccat atctttgaggagaaagatgatatttggaaaattgtagcaattttgatgccttatgaagtt cagggaagtctaagtcatttagggaagcactgcaactgtcccatgagagtgccctatttt caagattacattctctgtgtggtgtctttactgcagatgaaggatttgggggcagagcac ttggcaggtcatgaaggggtccaacttctcgggttgttgaacgtctacctggaacaagaa gagagattccaacctcgagaaaaagggctgagtttgattgaggctaccccggagaatgat aacactttgtgtccaggattgagaaatgccaaagttgaagatttaaggagtttagccaac ttttttggatcttgcactgaaacttttgtcctggctgtcaatattttggacaggttcttg gctcttatgaaggtgaaacctaaacatttgtcttgcattggagtctgttcttttttgctg gctgctagaatagttgaagaagactgcaatattccatccactcatgatgtgatccggatt agtcagtgtaaatgtactgcttctgacataaaacggatggaaaaaataatttcagaaaaa ttgcactatgaattggaagctactactgccttaaactttttgcacttataccatactatt atactttgtcatacttcagaaaggaaagaaatactgagccttgataaactagaagctcag ctgaaagcttgcaactgccgactcatcttttcaaaagcaaaaccatctgtattagccttg tgccttctcaatttggaagtggaaactttgaaatctgttgaattactggaaattctcttg ctagttaaaaaacattccaagattaatgacactgagttcttctactggagagagttggtt tctaaatgcctagccgagtattcttctcctgaatgttgcaaaccagatcttaagaagttg gtttggatcgtttcaaggcgcacagcccagaacctccacaacagctactatagtgttcct gagctgccaacgatacctgaggggggttgttttgatgaaagtgaaagaatacatgggaaa ttgaacggtaatcagaaggtccatgcttgggcagcatggccacagaacaggctccccatt ggcccaactagaagccagggagcacaagagcctactggtgtagttcatgaactcagtttc ctgggtggaaagcacaatggagaagagcagagagtgggtctgaaggacaaagagaagaca ctgtgtcccttcccttgtttgtcctctttcagagatggttctggacaactctag >gi568815594f:77058533_77265921|GENSCAN_predicted_peptide_5|78_aa MTHADCITGSLPFGFWMVWPAVLCSVLTTDFSKATLIANLLKLRKLQEKYTGLEVLRLKQ AQQSLELPFSYLESQSLY >gi568815594f:77058533_77265921|GENSCAN_predicted_CDS_5|237_bp atgacccacgctgactgcatcactggctccttgccttttggcttctggatggtttggcca gcagtgctgtgtagtgtgctgactacagactttagcaaagccactcttatagctaactta ctgaagttacgtaaactccaagagaaatacacgggtttagaggttctacgtttaaagcaa gcgcagcaatctttagaacttccttttagctatttggaatcccaaagtctctattaa >gi568815594f:77058533_77265921|GENSCAN_predicted_peptide_6|83_aa MEPLLWVGACPGVSFGVMAAAVCQFTEHQDHSPSNPLSCRHANSCCHLVTAMQTHHDRAN YYQQSRPVLSLLDYWFKESKMIA >gi568815594f:77058533_77265921|GENSCAN_predicted_CDS_6|252_bp atggagcccctgctttgggttggtgcttgccctggtgtgtcctttggggtcatggctgct gcagtttgccaattcactgagcaccaagaccactctccttcaaatcccctgagttgtaga catgccaattcctgctgccacctagtaacagcaatgcaaactcaccatgatagggctaat tactatcagcaaagcagacctgtgctgtccctgctggactactggtttaaggagtccaaa atgatagcctaa >gi568815594f:77058533_77265921|GENSCAN_predicted_peptide_7|548_aa XYILEETRIYTIKREKPRFQVKLGLYPERRLISSPHGWVQPRHGTQVVSLPEQRAKGNTQ GEAHQYFHASAPSPPGPHHPGLSLQGLAAFSTERGQMCPAARQAVRDFYIRGEKSKTSLD AFYCRPYVELTGTSWLKIWVLPPGKREPSAEITAEAKGNLAVDIGGERMRIRCDPRSHYS HRSCILSQHLEFTQERRPEGHGVAALRACVARSLCAAQGLTWQPLSLGYDHWQCENGLVN LSLLYTKSQIKADIGPGAHTLSGRNIQIMTFHVYTKGSLILKYGTELSTRADSSNSSGRS KGNHRKSLAVSFPSFFYLDSFQGKTIIPSKGKASSRTEPLDTDGQGCGAQDHSLPLHDFR QQHLQPFCSTALGFLDPFACVRTHMQPESPIADLPNQGIQAYPEDPQQETKELVTSCYFS TLHQGCLSNLMPKPCIYISSPKTQDNTNGFANLAQATQLDDKLLFTWILGSLGKAASEIG EGPQGLSECLSEEGKDGMALGQEILKLQATFDQGREKSFRWSKPFRLTLNYLDLYMLRSS DAERAFNS >gi568815594f:77058533_77265921|GENSCAN_predicted_CDS_7|1647_bp nnctacatcttagaggaaactagaatatatacaatcaagagggaaaagccaagattccaa gtcaagctgggcctttacccggaaaggagactcatctcctcacctcacggatgggtccag ccaagacatgggactcaggtagtaagcctccctgagcagagagcaaaagggaatactcaa ggagaagcacatcagtacttccatgcttctgctccatctccaccaggaccacaccatcct ggcctgtcactacaaggcctagctgccttttctacagaaaggggacagatgtgtcctgca gctagacaagcagttcgagatttctacatccggggtgaaaaaagtaagacctccctggat gcattctattgcagaccctacgtggagttgactggcacttcatggctgaagatctgggtc ctaccacccggaaagcgtgaaccttcagctgagatcacagctgaggccaaagggaatttg gcagtggatattggaggagagaggatgaggattcgttgtgacccaagatcccactacagc cacaggagctgcattttgtcacagcatcttgagtttactcaggaaaggaggcctgaggga catggagttgctgctctgagagcctgtgtagccaggtccctctgtgcagcacaggggctg acttggcagccattaagtctcgggtatgaccattggcagtgtgaaaacggactagtaaac ctatctctgctttacaccaaatcacaaatcaaagctgacataggcccaggagcccatact ttatctggaagaaatatccagattatgacattccatgtctacaccaaagggtccctgata ctaaaatatgggacagaactatccacaagggctgacagcagcaatagctcaggcagaagt aaaggaaaccacagaaagtctttggctgtgtcttttccttcattcttctatctagactct ttccaaggaaaaaccatcataccctctaagggcaaagcatcaagtcgtactgagccatta gacactgacggccaagggtgtggagcccaagaccactccctgcctctgcatgacttcaga cagcagcacctgcagccattttgcagcactgcacttgggttcttagacccgtttgcctgt gtgcgcacacatatgcagccagaatcacccattgcagaccttccaaaccagggaatccag gcctacccagaggacccacagcaggaaacaaaagagctagttacttcctgctacttttct acacttcaccagggctgcttgagtaacctcatgcccaagccctgtatttacatctctagc ccaaagacccaagacaatacaaatggctttgctaatcttgcccaggccacacaattggat gacaagctgctgtttacctggattctggggtccttggggaaggcagccagtgaaattgga gaggggccacaaggtctttcagagtgtctgagtgaagagggtaaagatggaatggcctta ggacaagagatcttgaagcttcaggctacctttgaccaaggcagagaaaagtctttcagg tggtccaagccatttagacttacattgaactaccttgatttatacatgctacgatcatca gatgcagaaagggcgtttaacagttaa