GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:48:42 Sequence gi568815593r:14551941_14753326 : 201386 bp : 43.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15294 15471 178 2 1 68 44 85 0.006 1.59 1.02 Intr + 25208 25319 112 0 1 83 48 70 0.000 1.94 1.03 Intr + 29075 29198 124 2 1 15 105 55 0.001 0.49 1.04 Intr + 29859 30018 160 2 1 52 99 86 0.108 5.66 1.05 Intr + 49025 49184 160 0 1 75 78 102 0.384 6.95 1.06 Intr + 49273 49304 32 1 2 122 33 8 0.259 -3.53 1.07 Intr + 49411 49502 92 2 2 56 93 9 0.193 -2.09 1.08 Intr + 50243 50392 150 1 0 83 95 44 0.271 4.96 1.09 Intr + 55390 55518 129 0 0 35 111 91 0.338 6.99 1.10 Intr + 56808 57077 270 2 0 63 91 129 0.933 8.44 1.11 Term + 58201 58374 174 0 0 117 41 143 0.998 10.26 1.12 PlyA + 58427 58432 6 1.05 2.00 Prom + 62351 62390 40 -2.46 2.01 Init + 62640 62825 186 2 0 84 30 149 0.973 7.76 2.02 Term + 63414 63569 156 2 0 70 35 108 0.664 1.63 2.03 PlyA + 63605 63610 6 1.05 3.04 PlyA - 64738 64733 6 1.05 3.03 Term - 71992 71989 4 0 1 117 45 0 0.003 -5.02 3.02 Intr - 80851 80803 49 2 1 118 97 13 0.069 2.94 3.01 Init - 87827 87629 199 2 1 77 99 150 0.310 14.07 3.00 Prom - 88038 87999 40 -4.06 4.02 PlyA - 88262 88257 6 -0.45 4.01 Sngl - 88878 88348 531 0 0 50 42 334 0.473 21.07 4.00 Prom - 95482 95443 40 -4.86 5.02 PlyA - 95987 95982 6 1.05 5.01 Sngl - 101386 99998 1389 1 0 86 42 1458 0.999 137.08 5.00 Prom - 102591 102552 40 -4.96 6.00 Prom + 108915 108954 40 -6.16 6.01 Init + 112901 113037 137 1 2 73 73 245 0.941 19.12 6.02 Intr + 124690 124741 52 1 1 64 71 7 0.013 -4.49 6.03 Intr + 129524 129667 144 1 0 81 80 173 0.863 16.28 6.04 Intr + 135581 135706 126 1 0 69 42 127 0.977 7.08 6.05 Intr + 138099 138368 270 2 0 55 111 154 0.935 12.14 6.06 Intr + 140914 141088 175 0 1 95 28 293 0.626 23.51 6.07 Intr + 145559 145601 43 0 1 73 76 67 0.025 1.30 6.08 Intr + 153671 153779 109 2 1 54 74 81 0.112 3.59 6.09 Intr + 155990 156190 201 1 0 45 95 71 0.757 2.98 6.10 Term + 156624 156806 183 2 0 63 48 131 0.985 4.14 6.11 PlyA + 157313 157318 6 -0.45 7.14 PlyA - 157717 157712 6 1.05 7.13 Term - 159370 159257 114 1 0 53 48 141 0.985 5.17 7.12 Intr - 161033 160934 100 1 1 92 94 135 0.999 14.61 7.11 Intr - 161727 161604 124 1 1 82 63 195 0.967 16.04 7.10 Intr - 164895 164766 130 0 1 120 103 131 0.991 18.07 7.09 Intr - 166000 165920 81 1 0 88 33 72 0.738 1.53 7.08 Intr - 166448 166367 82 1 1 82 111 8 0.549 2.14 7.07 Intr - 169506 169430 77 0 2 75 37 123 0.880 4.21 7.06 Intr - 175770 175576 195 0 0 92 87 47 0.842 4.61 7.05 Intr - 181998 181849 150 0 0 28 40 105 0.054 0.06 7.04 Intr - 189982 189887 96 1 0 115 103 139 0.981 18.31 7.03 Intr - 194022 193930 93 0 0 130 100 90 0.998 14.56 7.02 Intr - 197366 197232 135 2 0 100 64 60 0.865 5.56 7.01 Intr - 199299 199129 171 0 0 141 71 193 0.901 23.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:14551941_14753326|GENSCAN_predicted_peptide_1|526_aa FRSRFHIFGYLFGNAPLYWYRFTVLVHFHAADKDLPETGKKKRLNWTYSSTWLGRPQNHA LVPTLAALEEAFSLPLHCGSPSLGWQRPELTPSACRSQFVSGTVYIGEKGSVQQDFHFLH FSSGKGPSDRQSAGENNKGSEPGRRRAAVASDRPLQTTGRGPAPVRSAAGMAATRSPTRA RERERSGAPAAGSDQVHSWMLATSQALDTVWRMAKGFVMLAVSFLVAAICYFRRLHLYSG HKLKWWIGYLQRKFKRNLSVEAEVDLLSYCAREWKGETPRNKLMRKAYEELFWRHHIKCV RQVRRDNYDALRSVLFQIFSQGISFPSWMKEKDIVKLPEKLLFSQGCNWIQQYSFGPEKY TGSNVFGKLRKYVELLKTQWTEFNGIRDYHKRGSMCNTLFSDAILEYKLYEALKFIMLYQ VTEVYEQMKTKKVIPSLFRLLFSRETSSDPLSFMMNHLNSVGDTCGLEQIDMFILGYSLE VKIKVFRLFKFNSRDFEVCYPEEPLRDWPEISLLTENDRHYHIPVF >gi568815593r:14551941_14753326|GENSCAN_predicted_CDS_1|1581_bp ttccgaagtcgattccacattttcggatatctttttggcaatgccccactctactggtac cgatttactgtattagtccattttcatgctgctgataaagacttacctgagactgggaag aaaaagaggcttaattggacttacagttccacatggctggggaggcctcagaatcatgcc ttggtgcccactctggccgcgcttgaggaggccttcagcctgccgctgcactgtgggagt ccctctctgggctggcagaggccggagctgactccttctgcttgccggagtcagtttgtc tctggaactgtgtatattggagaaaaaggaagtgttcaacaggattttcattttcttcat ttttcttctggaaaaggcccatctgaccggcagagtgcgggggagaacaacaagggaagc gagcccgggcgccggcgggcggccgtcgcgtctgacagaccactgcagaccacgggccga ggcccagcgcccgtccgcagcgcggccggcatggcggcgacaaggagccccacgcgggca agggagcgggagcggtctggcgctcccgccgcaggaagtgaccaagttcactcctggatg ctagctacaagccaagccttagacactgtctggagaatggcaaaaggctttgtgatgttg gcagtttcatttctggtggctgccatctgctacttccggaggctacatttatattcaggg cacaagctgaaatggtggattggatatctgcagagaaaattcaaaaggaacctcagtgtg gaggcagaggttgatttactcagttattgtgcaagagaatggaaaggagagacaccccgt aacaagctgatgaggaaggcttatgaggagctattttggcggcatcacattaaatgtgtt cgacaagtaaggagagataactatgatgctctcagatcagtgttatttcagatattcagc cagggcatctcttttccatcatggatgaaagaaaaggacattgttaagcttcctgaaaaa ctgctgttttcacaaggttgtaattggattcagcagtacagttttggtcctgagaagtat acaggctcgaatgtgtttggaaaactacggaaatatgtggaattattgaaaacacagtgg actgaatttaatggcattagagattatcacaagagaggaagtatgtgcaacacccttttt tcagatgccattctggaatataaactttatgaagctttaaagttcatcatgctgtatcaa gtcactgaagtttatgaacaaatgaagactaaaaaggtcattcccagtctttttagactc ctgttttccagggagacatcctctgatcctttgagcttcatgatgaatcacctgaattct gtaggcgacacatgtggactagagcagattgatatgtttatacttggatactcccttgaa gtaaagataaaagtgttcagactgttcaagtttaactccagagactttgaagtctgctac ccagaggagcctctcagggactggccggagatctccctgctgaccgagaacgaccgccac taccacattccagtcttttaa >gi568815593r:14551941_14753326|GENSCAN_predicted_peptide_2|113_aa MVYKWTEKHSLTYSATYGLECSVEQPEVYTMSLHLKLMECVGGDSSSMWEQCGPRASMEE GCHVPGLPLGAGFSAGLRFMERTPGQGEEKQLVQTDDEGEDQKSCLGVVEVSV >gi568815593r:14551941_14753326|GENSCAN_predicted_CDS_2|342_bp atggtttataagtggacagaaaaacactcactcacgtattcagccacgtatggtctggag tgctctgtagaacagcccgaagtgtacaccatgtctctgcacttgaagctcatggaatgt gttggaggagactcaagctccatgtgggaacagtgtggcccaagagccagcatggaggag ggctgtcatgtacctggacttcctttgggtgccggcttttctgctggactaagattcatg gaaagaaccccagggcaaggtgaggagaagcagctggtacagactgatgacgaaggagag gaccagaaaagctgcttgggtgtggtggaagtttcagtgtaa >gi568815593r:14551941_14753326|GENSCAN_predicted_peptide_3|83_aa MAEALIKYKPSVKGRAQLGVQAFADVLLIIPKVLDQNSGFDLQETLVKIQAECSESGQLV GVDLDTGVIIIASEFPKSFVKSR >gi568815593r:14551941_14753326|GENSCAN_predicted_CDS_3|252_bp atggcagaagccctgattaaatataagcccagtgtaaagggcagggcacagcttggagtc caagcatttgctgatgtgttgctcattattccaaaggttcttgatcagaactctggtttt gaccttcaggaaacattagttaaaattcaagcagaatgttcagaatcaggtcagcttgtg ggtgtggacctggatacaggtgtcatcataattgctagtgagtttcccaaaagctttgtg aagtccagatga >gi568815593r:14551941_14753326|GENSCAN_predicted_peptide_4|176_aa MQIQHTTASLIAKVATAQDDITGDSTTSNVLIIEKLLKQEDLYISEGLHLRIITEGFEAA KEKALRFLEEVKIRKEMDRETLINVARTSLHTKVHAELADALTEAVVDSILAIKRQDEPI DLFMVVIMEMKHKSETDTSLIRGLVLDHGAWHPDMKKRVEDVYILKCNVSLEYEKT >gi568815593r:14551941_14753326|GENSCAN_predicted_CDS_4|531_bp atgcaaattcaacacacaacagcttccttaatagcaaaggtagcaacagcccaggatgat ataactggtgatagtacaacttccaatgtcctaatcattgaaaagctactgaaacaggag gatctctacatttctgaaggccttcatctcagaataattactgaaggatttgaagctgca aaggaaaaggcccttcggtttttggaagaagtcaaaataaggaaagagatggacagggaa acacttataaatgtggccagaacatctcttcatactaaagttcatgctgaacttgcagat gccttaacagaggctgtagtggactccattttggccattaaaagacaagatgaacctatt gatctcttcatggttgtgatcatggagatgaaacataaatctgaaactgatacaagctta atcagagggcttgttttggaccacggagcatggcatcctgatatgaagaaaagggtggag gatgtatacattctcaagtgtaacgtgtcattagaatatgaaaaaacataa >gi568815593r:14551941_14753326|GENSCAN_predicted_peptide_5|462_aa MGKEKTHINIVVIGHVDSGKSTTTGHLIYKCSSIDKRTIEKFEKETAEMGKGSFKYGWVL DKLKAERERGITIDISLWKFETSKYYVTITDAPGHRDFIKNMITGTSQADCAVPIVAAGV GEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEPPYSQKRYEEIVKEVSTYIKK IGYNPDTVAFVPISGWNGNNMLEPSANMPWFKGWKVTRKDGNASGTMLLEALDCILPPTR PTDKPLRLPLQDVYKIGGIGSVPVGRVETGILKPGMVVTFAPVNVTTEVKSVEMHHEALG EALPGDSVGFHVKNVSVKDVRRGNVAGDSKNDPPMEAAGFTAQVIILNHPGQISAGYAPV LDCHTAHIACKFAELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCAESFSDYPP LGRFAVRDMRHTVAVGVIKAVDKKAAGAGKVTNSAQKAQMAK >gi568815593r:14551941_14753326|GENSCAN_predicted_CDS_5|1389_bp atgggaaaggaaaagactcatatcaacattgtcgtcattggacacgtagattcgggcaag tccaccactactggccatctgatctataaatgcagtagcatcgacaaaagaaccattgaa aaatttgagaaggagactgctgagatgggaaagggctctttcaagtatggctgggtcttg gataaactgaaagctgagcgtgaacgtggtatcaccattgatatctccttgtggaaattt gagaccagcaagtactatgtgactatcactgatgctccaggacacagagacttcatcaaa aacatgattacagggacatctcaggctgactgtgctgtcccgattgttgctgctggtgtt ggtgaatttgaagctggtatctctaagaatgggcagacccgagagcatgcccttctggct tacacactgggtgtgaaacaactaattgttggtgttaacaaaatggattccactgagcca ccctacagccagaagagatatgaggaaattgttaaggaagtcagcacttacattaagaaa attggctacaaccccgacacagtagcatttgtgccaatttctggttggaatggtaacaac atgctggagccaagtgctaacatgccttggttcaagggatggaaagtcacccgtaaggat ggcaatgccagtggaaccatgctgcttgaggctctggactgcatcctaccaccaactcgt ccaactgacaagcccttgcgcctgcctctccaggatgtctacaaaattggtggtattggt agtgttcctgttggccgagtggagactggtattctcaaacccggtatggtggtcaccttt gctccagtcaacgttacaacagaagtaaaatctgtcgaaatgcaccatgaagctttgggt gaagctcttcctggggacagtgtgggcttccatgtcaagaatgtgtctgtcaaggatgtt cgtcgtggcaacgttgctggtgacagcaaaaatgacccaccaatggaagcagctggcttc actgctcaggtgattatcctgaaccatccaggccaaataagcgctggctatgcccctgta ttggattgccacacggctcatattgcatgcaagtttgctgagctgaaggaaaagattgat cgccgttctggtaaaaagctggaagatggccctaaattcttgaagtccggtgatgctgcc attgttgatatggttcctggcaagcccatgtgtgctgagagcttctcagactatccacct ttgggtcgctttgctgttcgtgatatgagacacacagttgcggtgggtgtcatcaaagca gtggacaagaaggctgctggagctggcaaggtcaccaactctgcccagaaagctcagatg gctaaatga >gi568815593r:14551941_14753326|GENSCAN_predicted_peptide_6|479_aa MPQPEAWPGASCAETPAREAAATARDGGKAAASGQPRPEMQCPAEQSCLGVHRNEDCREE SRKGYEEVSQKFTSIRRVRGDNYCALRATLFQAMSQAVGLPPWLQDPELMLLPEKLISKY NWIKQWKLGLKFDGKNEDLVDKIKESLTLLRKKWAGLAEMRTAEARQIACDELFTNEAEE YSLYEAVKFLMLNRAIELYNDKEKGKEVPFFSVLLFARDTSNDPGQLLRNHLNQVGHTGG LEQVEMFLLAYAVRHTIQVYRLSKYNTEEFITVYPTDPPKDWPVVTLIAEDDRHYNIPVR VCYIEIYYQECRNGFCFRQFPDQYIIKQPKYLIKSKEQRICHCKACGCGSPGKCKQVAFC APFKESQTGEFPANVQIRQYSLSHVAGRSNPTVTFQAQASSSSFCSVVVSLTPFATEIPL DDGTRLEETQENMPVLSFASKLLGCDLSSPSEENVRRRITCFKVKGLNKSCQISSIVVA >gi568815593r:14551941_14753326|GENSCAN_predicted_CDS_6|1440_bp atgccccagcccgaagcgtggccaggcgcgagctgcgccgagacgccggcgcgggaggcg gcggccacggcgcgggacggcgggaaggcggcggccagcgggcagccgcggcccgagatg cagtgcccggccgagcagtcttgtcttggtgtgcacaggaatgaggactgcagggaggag agcaggaagggctatgaagaggtttctcagaagttcacctccatacggcgagtccgtggt gataattactgtgcactgagggccacgctgttccaggccatgagccaggctgtggggctg ccgccctggctgcaggacccggagctcatgctgttaccagaaaaactcataagcaaatac aactggatcaagcaatggaaacttggactgaaatttgatgggaagaatgaggacctggtt gataaaattaaagagtcccttactctgctgaggaagaagtgggcaggcttggctgaaatg agaactgctgaagcaagacagatagcttgtgatgaactattcacaaatgaggcggaggaa tatagcctctatgaagctgtaaaatttctaatgctaaacagagccattgaactatataat gataaagagaaaggaaaggaagtaccatttttctctgtgcttctgtttgctcgggacaca tcaaatgacccaggacagcttctgaggaaccacctcaaccaggtgggacacactggtggt cttgaacaggttgaaatgttccttcttgcctatgctgtgcgccacaccatccaggtgtac cggctctccaagtacaacacggaagaattcatcacagtctaccccaccgacccacccaag gactggccagtggtaacgctcattgctgaggacgatcggcactataacatccccgtcaga gtgtgttatattgaaatctactaccaggaatgtcggaatgggttttgctttagacagttc cctgatcagtacatcattaagcaaccaaaatatctcattaaatccaaggagcagcgtatt tgccactgcaaggcttgtggatgtggcagccctgggaaatgcaaacaagtggccttctgt gccccatttaaagaatcccaaacgggagagtttcctgccaatgtgcaaatccgtcaatac tccctgagccatgtggccggcaggtccaatcctaccgtgactttccaagcgcaagcctcg tctagttctttttgctcagttgttgtctcactgacgccctttgccactgagatccctctg gatgatgggactcgattagaagaaacgcaggagaatatgccagttctcagttttgcctca aagttgttaggctgtgacttaagcagcccaagtgaagaaaatgtacgaagaaggatcacg tgctttaaggtgaaaggtttgaacaagtcctgccaaatcagctccattgtggtggcctga >gi568815593r:14551941_14753326|GENSCAN_predicted_peptide_7|515_aa VVFVAILLHSHLECREPLLIPILSLYMGALVRCTTLCLGYYKNIHDIIPDRSGPELGGDA TIRKMLSFWWPLALILATQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMPYGW LTEIRAVYPAFDKNNPSNKLVSTSNTVTAAHIKKFTFVCMALSLTLKDSVQKPDISLTGR LVQTLPTRMRHQRGESKDVAPLASWLSEPSTSSEASQTSSKLTINSQGEGKAKQKLECGT LSIVLRSCEKNQERIKAAEKRNRSWTTLCGLGAWRPLLFELPVIVQTPDQTNRFQFRYPA KTQSGLCSFFHSFDLLTLEAFVKVWFPGCLLILIEYASSKNRKSKMLQNLKLLSADMTLK LCFVMFWTPNVSEKILIDIIGVDFAFAELCVVPLRIFSFFPVPVTVRAHLTGWLMTLKKT FVLAPSSVLRIIVLIASLVVLPYLGVHGATLGVGSLLAGFVGESTMVAIAACYVYRKQKK KMENESATEGEDSAMTDMPPTEEVTDIVEMREENE >gi568815593r:14551941_14753326|GENSCAN_predicted_CDS_7|1548_bp gttgtttttgtagccattttgcttcacagtcacctggaatgccgggagcccctgctcatc ccgatcctctccttgtacatgggcgcacttgtgcgctgcaccaccctgtgcctgggctac tacaagaacattcacgacatcatccctgacagaagtggcccggagctggggggagatgca acaataagaaagatgctgagcttctggtggcctttggctctaattctggccacacagaga atcagtcggcctattgtcaacctctttgtttcccgggaccttggtggcagttctgcagcc acagaggcagtggcgattttgacagccacataccctgtgggtcacatgccatacggctgg ttgacggaaatccgtgctgtgtatcctgctttcgacaagaataaccccagcaacaaactg gtgagcacgagcaacacagtcacggcagcccacatcaagaagttcaccttcgtctgcatg gctctgtcactcacgcttaaagacagtgtacagaaaccagacatcagcctgacggggcgc ttagtccagacactcccgacacgcatgcgccatcaacgtggagaaagcaaagatgtggcg cctctggcctcttggctttccgagcccagcaccagctcggaggcttcccaaacaagttcc aagttgactattaatagtcaaggagaaggaaaggctaaacagaagctggagtgtggaact ttatcaatagtcctcagatcctgtgagaaaaaccaggaaaggattaaagctgcagagaaa aggaataggagttggacaaccctctgcgggcttggcgcatggcgccccctgctgtttgag ttacctgtgattgttcaaaccccagatcagactaaccgcttccagttccgataccccgcg aagacccagtctgggctttgtagttttttccatagttttgacctcctcacattggaggcc tttgtcaaagtctggtttcctggttgcctgctcattttgattgagtatgcctcatccaaa aatcgaaaatccaaaatgctccaaaatctgaaacttttgagcgctgacatgactctcaaa ctctgtttcgtgatgttttggacacccaacgtgtctgagaaaatcttgatagacatcatc ggagtggactttgcctttgcagaactctgtgttgttcctttgcggatcttctccttcttc ccagttccagtcacagtgagggcgcatctcaccgggtggctgatgacactgaagaaaacc ttcgtccttgcccccagctctgtgctgcggatcatcgtcctcatcgccagcctcgtggtc ctaccctacctgggggtgcacggtgcgaccctgggcgtgggctccctcctggcgggcttt gtgggagaatccaccatggtcgccatcgctgcgtgctatgtctaccggaagcagaaaaag aagatggagaatgagtcggccacggagggggaagactctgccatgacagacatgcctccg acagaggaggtgacagacatcgtggaaatgagagaggagaatgaataa