GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:04:38 Sequence gi568815593f:172883861_173120328 : 236468 bp : 45.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3049 3056 8 0 2 70 109 4 0.369 0.91 1.02 Intr + 4839 4900 62 0 2 143 64 117 0.085 13.18 1.03 Intr + 13142 13214 73 0 1 81 84 84 0.051 5.76 1.04 Intr + 17041 17230 190 1 1 91 72 95 0.256 7.79 1.05 Intr + 22262 22320 59 1 2 126 69 -12 0.190 -1.62 1.06 Intr + 25807 25901 95 1 2 70 65 219 0.847 17.41 1.07 Intr + 30854 30978 125 0 2 98 86 233 0.998 24.40 1.08 Intr + 31658 31830 173 1 2 81 66 71 0.783 2.94 1.09 Intr + 36504 36624 121 0 1 76 60 58 0.614 2.30 1.10 Intr + 40145 40249 105 1 0 104 99 124 0.833 15.51 1.11 Intr + 42649 42709 61 0 1 98 50 48 0.789 0.31 1.12 Intr + 48576 48676 101 1 2 127 91 255 0.999 29.53 1.13 Intr + 51328 51450 123 0 0 101 113 282 0.985 32.78 1.14 Term + 66849 66956 108 2 0 128 48 170 0.778 15.51 1.15 PlyA + 69734 69739 6 1.05 2.00 Prom + 72154 72193 40 -6.16 2.01 Init + 76014 76181 168 2 0 104 98 311 0.788 33.23 2.02 Intr + 84599 84739 141 1 0 87 86 225 0.998 22.75 2.03 Term + 85553 85681 129 1 0 99 36 190 0.999 13.08 2.04 PlyA + 85837 85842 6 1.05 3.00 Prom + 89478 89517 40 -7.16 3.01 Init + 91651 92225 575 0 2 103 70 554 0.602 49.69 3.02 Term + 92315 92519 205 2 1 -15 39 192 0.629 0.84 3.03 PlyA + 92813 92818 6 1.05 4.00 Prom + 96839 96878 40 -4.46 4.01 Init + 100001 100104 104 1 2 120 98 88 0.976 12.93 4.02 Term + 103190 103343 154 2 1 41 43 102 0.105 -1.61 4.03 PlyA + 105191 105196 6 1.05 5.03 PlyA - 105653 105648 6 1.05 5.02 Term - 150491 150349 143 0 2 61 41 79 0.574 -1.41 5.01 Init - 152661 152544 118 0 1 82 49 93 0.795 5.22 5.00 Prom - 165692 165653 40 -2.56 6.06 PlyA - 165701 165696 6 1.05 6.05 Term - 171979 171875 105 1 0 43 42 160 0.856 5.31 6.04 Intr - 173001 172835 167 1 2 127 23 59 0.718 2.98 6.03 Intr - 195227 195133 95 1 2 52 86 77 0.320 3.51 6.02 Intr - 202045 201909 137 1 2 24 91 132 0.351 6.47 6.01 Init - 204350 204333 18 2 0 62 89 31 0.405 0.65 6.00 Prom - 204687 204648 40 -9.85 7.00 Prom + 204899 204938 40 -4.96 7.01 Init + 206464 207541 1078 0 1 69 67 688 0.091 59.36 7.02 Intr + 224764 224958 195 2 0 33 95 129 0.942 7.49 7.03 Intr + 226662 226851 190 1 1 92 99 86 0.425 8.74 7.04 Term + 228446 228545 100 2 1 79 38 84 0.440 0.10 7.05 PlyA + 230203 230208 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 4839 4908 70 0 1 143 42 126 0.914 10.91 S.002 Term + 42852 43024 173 1 2 75 44 128 0.834 5.09 S.003 Init + 223077 223125 49 2 1 77 58 66 0.879 1.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:172883861_173120328|GENSCAN_predicted_peptide_1|467_aa MDRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVYRSCPKGLRRMS PKTYRAIERRKHHIFGVFQLSEACGPQEALQVWEQLKVTVLVFWVAGPLWQLLCSWWPLI GANTTSKVKRSVNELYVDDPDKDSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHI DNSMKIPLNNGAGCRFEGQFSINKNQELQILSQVGERSALGRAEAPGGSWQGFAQHVPAF LEARYCHSGASQQRCEAAPFDRKLTSSQMSEFACFEQQRQPHQRPHGDLVSSASPAGGGQ DGVPGNFHVSTHSATAQPQNPDMTHVIHKLSFGDTLQVQNIHGAFNALGGADRLTSNPLA SHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSHTGRIIPAIWFRYDLSPITVKYT ERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAWKKIQLGKMH >gi568815593f:172883861_173120328|GENSCAN_predicted_CDS_1|1404_bp atggacaggtttgacatctacaggaaggtgcccaaggaccttacgcagccaacgtacacc ggggccattatctccatctgctgctgcctcttcatcctcttcctcttcctctcggagctc accggatttataacgacagaagtgtataggagttgcccgaagggcctccgcaggatgagc cctaagacttacagagcaatcgagaggagaaagcatcacatctttggggtcttccagttg tcagaagcttgtggcccccaggaagccctacaggtttgggaacaactgaaggtcacagtt ctcgtcttctgggtggctggccctctgtggcagctcctttgctcttggtggcctctgatt ggggccaacactacaagcaaggtcaagcgcagtgtgaacgagctctatgtcgatgaccca gacaaggacagcggtggcaagatcgacgtcagtctgaacatcagtttacccaatctgcac tgcgagttggttgggcttgacattcaggatgagatgggcaggcacgaagtgggccacatc gacaactccatgaagatcccgctgaacaatggggcaggctgccgcttcgaggggcagttc agcatcaacaagaaccaggagcttcagatcctaagtcaagtgggtgaacgcagtgccctt gggagggccgaggcacccggtggcagctggcagggttttgctcagcacgtgccggccttc ctcgaagctcggtactgtcacagtggagcctctcaacaacgctgtgaggcagcaccattt gacaggaagttgaccagcagccagatgtcagagtttgcgtgttttgagcagcagcggcag cctcaccagagaccccacggtgacctcgtttcatcagccagccccgcaggaggaggacag gatggggtccccggcaacttccacgtgtccacacacagtgccacagcccagccacagaac ccagacatgacgcatgtcatccacaagctctcctttggggacacgctacaggtccagaac atccacggagctttcaatgctctcgggggagcagacagactcacctccaaccccctggcc tcccacgactacatcctgaagattgtgcccacggtttatgaggacaagagtggcaagcag cggtactcctaccagtacacggtggccaacaaggaatacgtcgcctacagccacacgggc cgcatcatccctgcaatctggttccgctacgacctcagccccatcacggtcaagtacaca gagagacggcagccgctgtacagattcatcaccacgatctgtgccatcattggcgggacc ttcaccgtcgccggcatcctggactcatgcatcttcacagcctctgaggcctggaagaag atccagctgggcaagatgcattga >gi568815593f:172883861_173120328|GENSCAN_predicted_peptide_2|145_aa MKFNPFVTSDRSKNRKRHFNAPSHVRRKIMSSPLSKELRQKYNVRSMPIRKDDEVQVVRG HYKGQQIGKVVQVYRKKYVIYIERVQREKANGTTVHVGIHPSKVVITRLKLDKDRKKILE RKAKSRQVGKEKGKYKEELIEKMQE >gi568815593f:172883861_173120328|GENSCAN_predicted_CDS_2|438_bp atgaagttcaatcccttcgttacctcggaccgcagtaaaaaccgcaaacgtcacttcaat gccccctcacacgtgcgcaggaagatcatgtcatccccgctctccaaggagctgcggcag aagtacaatgtccgctccatgcccatccgcaaggacgacgaggtccaggtagttcgagga cactacaaaggtcagcaaattggcaaggtagtccaggtgtacagaaagaaatatgtcatc tacatcgagcgggtgcagcgtgagaaggccaacggcacaactgtccacgtgggcattcac ccaagcaaggtggttatcaccaggctaaaactggacaaggatcggaaaaaaattcttgaa cgcaaagccaagtctcgacaagttggaaaagagaaaggcaaatataaagaagaacttatt gagaaaatgcaggaataa >gi568815593f:172883861_173120328|GENSCAN_predicted_peptide_3|259_aa MSGFSTEERATPFSLKYRVFLKNEKGQYMSPFQDIPIYADKDVFHVVVEVPLWSNAKMEV ATKDPLNPIKQDVKKRKLRYVANLFPYKGYVWNYGAIPQTWEDPGHNDKHTGCCGDNDPI DVCEIGSKVCARGEIIGVKVLGILAMIDEGETDWKVIAINMDDPDAGNYTDISDVKRLKP GYLEATVDWFRSTHDHWKALVTKKMNGKGISCMNKTVSESLFKCDPDAARAIVDALPPPC ESACTVPTDVDTWFHHQKN >gi568815593f:172883861_173120328|GENSCAN_predicted_CDS_3|780_bp atgagcggcttcagcactgaggagcgcgccacgcccttctccctgaagtaccgagtcttc ctcaaaaatgagaaaggacaatatatgtctccatttcaagatattccaatttatgcagat aaggatgtgttccacgtggtagttgaagtaccactgtggtctaatgcaaaaatggaggtt gctacaaaggaccctttaaaccccattaaacaagatgtgaaaaaaagaaaacttcgctat gttgcgaatttgttcccgtataaaggatatgtctggaactatggtgccatccctcagact tgggaagacccagggcacaatgataaacatactggctgttgtggtgacaatgacccaatt gatgtgtgtgaaattggaagcaaggtatgtgcaagaggtgaaataattggtgtgaaagtt ctaggcatattggctatgattgacgaaggggaaaccgactggaaagtcattgccattaat atggatgatcctgatgcaggcaattatactgatatcagtgatgtcaaacggctgaaaccc ggctacttagaggctactgtggactggtttagaagcactcatgaccattggaaagcatta gtgactaagaaaatgaatggaaaaggaatcagctgcatgaacaaaactgtgtctgagagc ctgttcaagtgtgatcctgatgctgccagagccattgtggatgctttaccaccaccctgt gaatctgcctgcacagtaccaacagacgtggatacgtggttccatcaccagaagaactaa >gi568815593f:172883861_173120328|GENSCAN_predicted_peptide_4|85_aa MAYHGLTVPLIVMSVFWGFVGFLVPWFIPKGPNRGARITGMSHHARLEDTFFKQKEEQKK LEELKVKAMGKGPLVIGGIKKSGKK >gi568815593f:172883861_173120328|GENSCAN_predicted_CDS_4|258_bp atggcgtatcacggcctcactgtgcctctcattgtgatgagcgtgttctggggcttcgtc ggcttcttggtgccttggttcatccctaagggtcctaaccggggtgctagaattacaggc atgagccaccacgcccggctggaagatacattttttaaacaaaaagaggagcagaagaaa ctggaggagctgaaagtgaaggccatggggaaggggcccctggtcataggtggaattaag aaatccggcaaaaaataa >gi568815593f:172883861_173120328|GENSCAN_predicted_peptide_5|86_aa MTFAWKGSGAQEHFRYPLNGLNGGPSSLGPDKHWMECGNGRQKVNIEKVNQANTLKCTDA QQNMKHRIINLVQVWPGFAGLWQKVQ >gi568815593f:172883861_173120328|GENSCAN_predicted_CDS_5|261_bp atgacgttcgcctggaagggctctggagcgcaggaacattttcggtatcctctgaacggg ctaaatggaggtccttccagtctgggccctgacaaacactggatggagtgtggcaacgga aggcaaaaagttaacatcgagaaagttaaccaagccaacactttaaaatgcacagatgct caacaaaatatgaaacacaggataatcaacttggtgcaagtttggccaggttttgcaggg ctgtggcaaaaggtccagtaa >gi568815593f:172883861_173120328|GENSCAN_predicted_peptide_6|173_aa MEEELQTPFLLVGTQTDLRDDPSTIEKLAKNKQKPITPETAKKLTHDLKAISLSLCDLMP SSYNWYVVIIIALILSPDSYEKEGRLPAAAAPGSEAAGKGRAARQDGRPALPIPGLAYCP PASPIYRRPALLCNRPGPGHGFRGTGINQTACTFEERDPYTTVQPECHYYRSA >gi568815593f:172883861_173120328|GENSCAN_predicted_CDS_6|522_bp atggaagaagagctgcagactcctttcttgcttgttgggacccaaactgatctcagagat gacccctctactattgagaaacttgccaagaacaaacagaagcctatcactccagagact gctaaaaagctgacccatgacctgaaggccatcagtctctccttatgtgacctcatgccg tcgtcctacaactggtatgtagtcataatcattgccctaattctctcccctgattcctat gagaaagaaggccgactgccggcagcagcggccccgggctcggaggcagcggggaagggc cgggcggcccggcaggacggacgcccggcgctgcccatccccggcctagcctactgcccg cccgcgagtcccatctaccgccgccccgcgcttttatgtaaccgtcccgggccggggcac ggattccgaggcacagggatcaatcagactgcgtgcacctttgaagagagagacccgtat acgactgtacaaccagaatgccactactacagaagcgcctag >gi568815593f:172883861_173120328|GENSCAN_predicted_peptide_7|520_aa MNYQQNPRDNFLSLEDCKDIENLESFTDVLDNEGALTSNWEQWDTYCEDLTKYTKLTSCD IWGTKEVDYLGLDDFSSPYQDEEVISKTPTLAQLNSEDSQSVSDSLYYPDSLFSVKQNPL PSSFPGKKITSRAAAPVCSSKTLQAEVPLSDCVQKASKPTSSTQIMVKTNMYHNEKVNFH VECKDYVKKAKVKINPVQQSRPLLSQIHTDAAKENTCYCGAVAKRQEKKGMEPLQGHATP ALPFKETQELLLSPLPQEGPGSLAAGESSSLSASTSVSDSSQKKEEHNYSLFVSDNLGEQ PTKCSPEEDEEDEEDVDDEDHDEGFGSEHELSENEEEEEEEEDYEDDKDDDISDTFSEPG YENDSVEDLKEVTSISSRKRGKRRYFWEYSEQLTPSQQERMLRPSEWNRDTLPSNMYQKN GLHHGKYAVKKSRRTDVEDLTPNPKKLLQIGNELRKLNKVISDLTPVSELPLTARPRSRK EKNKLASRACRLKKKAQYEANKVKLWGLNTEYGKPKVLRS >gi568815593f:172883861_173120328|GENSCAN_predicted_CDS_7|1563_bp atgaactaccaacagaatcctagagacaactttctttctttggaggactgcaaagacatt gaaaatctggagtctttcacagatgtcctggataatgagggtgctttaacctcaaactgg gaacagtgggatacatactgtgaagacctaacgaaatataccaaactaaccagctgtgac atctggggaacaaaagaagtggattacttgggtcttgatgacttttctagtccttaccaa gatgaagaggttataagtaaaactccaactttagctcaacttaatagtgaggactcacag tctgtttctgattccctttattaccccgattcacttttcagtgtcaaacaaaatccctta ccctcttcattccctggtaaaaagatcacaagcagagcagctgctcctgtgtgttcttct aagactctgcaggctgaggtccctttgtcagactgtgtccaaaaagcaagtaaacccact tcaagcacacaaatcatggtgaagaccaacatgtatcataatgaaaaggtgaactttcat gttgaatgtaaagactatgtaaaaaaggcaaaggtaaagatcaacccagtgcaacagagc cggcccttgttgagccagattcacacagatgcagcaaaggagaacacctgctactgtggt gcagtggcaaagagacaagagaaaaaagggatggagcctcttcaaggtcatgccactccc gctttgccttttaaagaaacccaggaactattactaagtcccctgccccaggaaggtcct gggtcacttgcagcaggagagagcagcagtctttctgccagtacatcagtctcagattca tcccagaaaaaagaagagcacaattattctctttttgtctccgacaacttgggtgaacag ccaactaaatgcagtcctgaagaagatgaggaggacgaggaggatgttgatgatgaggac catgatgaaggattcggcagtgagcatgaactgtctgaaaatgaggaggaggaagaagag gaagaggattatgaagatgacaaggatgatgatattagtgatactttctctgaaccaggc tatgaaaatgattctgtagaagacctgaaggaggtgacttcaatatcttcacggaagaga ggtaaaagaagatacttctgggagtatagtgaacaacttacaccatcacagcaagagagg atgctgagaccatctgagtggaaccgagatactttgccaagtaatatgtatcagaaaaat ggcttacatcatggaaaatatgcagtaaagaagtcacggagaactgatgtagaagacctg actccaaatcctaaaaaactcctccagataggcaatgaacttcggaaactgaataaggtg attagtgacctgactccagtcagtgagcttcccttaacagcccgaccaaggtcaaggaag gaaaaaaataagctggcttccagagcttgtcggttaaagaagaaagcccagtatgaagct aataaagtgaaattatggggcctcaacacagaatatggtaaacctaaagttttaagaagt tga