GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:13:26 Sequence gi568815597r:58476188_58677093 : 200906 bp : 39.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 1378 1240 139 1 1 93 100 141 0.971 16.15 1.00 Prom - 2008 1969 40 -5.95 2.00 Prom + 4449 4488 40 -4.25 2.01 Init + 14083 14509 427 0 1 81 58 277 0.513 20.51 2.02 Term + 16416 17503 1088 1 2 46 37 452 0.750 27.17 2.03 PlyA + 17572 17577 6 1.05 3.00 Prom + 18760 18799 40 -3.65 3.01 Init + 36296 36494 199 1 1 84 68 136 0.844 10.31 3.02 Term + 41971 42335 365 2 2 43 42 132 0.131 -2.16 3.03 PlyA + 42603 42608 6 1.05 4.08 PlyA - 43059 43054 6 1.05 4.07 Term - 46701 46639 63 0 0 85 39 95 0.721 1.21 4.06 Intr - 51148 51074 75 1 0 59 98 48 0.574 1.69 4.05 Intr - 58144 57971 174 1 0 48 100 184 0.504 14.81 4.04 Intr - 71666 71548 119 0 2 33 91 110 0.002 5.06 4.03 Intr - 72818 72672 147 0 0 59 70 69 0.000 1.49 4.02 Intr - 74314 74230 85 1 1 -11 66 87 0.000 -4.93 4.01 Init - 85719 85462 258 0 0 35 108 185 0.479 12.38 4.00 Prom - 92923 92884 40 -6.85 5.02 PlyA - 94621 94616 6 1.05 5.01 Sngl - 100969 99998 972 1 0 111 43 1786 0.934 170.88 5.00 Prom - 101527 101488 40 -8.05 6.02 PlyA - 101889 101884 6 1.05 6.01 Sngl - 103501 103418 84 1 0 93 48 250 0.996 10.18 6.00 Prom - 106667 106628 40 -3.65 7.00 Prom + 108335 108374 40 -5.75 7.01 Init + 109920 109998 79 2 1 81 92 82 0.959 9.17 7.02 Intr + 110262 110522 261 1 0 65 58 137 0.602 4.94 7.03 Intr + 112800 112849 50 1 2 54 50 52 0.011 -4.52 7.04 Term + 117164 117295 132 1 0 127 49 69 0.432 4.01 7.05 PlyA + 118450 118455 6 1.05 8.04 PlyA - 120089 120084 6 1.05 8.03 Term - 122161 122003 159 1 0 78 55 102 0.097 2.86 8.02 Intr - 125316 125172 145 2 1 76 17 125 0.044 3.56 8.01 Init - 128021 127939 83 2 2 67 98 30 0.060 2.39 8.00 Prom - 131161 131122 40 -5.35 9.07 PlyA - 131887 131882 6 1.05 9.06 Term - 135421 135300 122 2 2 77 54 75 0.187 0.66 9.05 Intr - 143728 143575 154 1 1 125 107 26 0.581 6.92 9.04 Intr - 144068 144002 67 1 1 57 93 60 0.633 1.39 9.03 Intr - 147615 147459 157 1 1 41 85 95 0.280 2.65 9.02 Intr - 148475 148311 165 0 0 35 44 125 0.264 1.81 9.01 Init - 148604 148502 103 2 1 60 80 142 0.673 9.08 9.00 Prom - 151916 151877 40 -5.75 10.11 PlyA - 152033 152028 6 1.05 10.10 Term - 155277 154863 415 2 1 50 48 395 0.241 25.45 10.09 Intr - 164632 164555 78 0 0 71 76 57 0.000 0.55 10.08 Intr - 180049 179894 156 0 0 97 83 77 0.452 6.30 10.07 Intr - 185324 185219 106 0 1 105 77 48 0.983 3.75 10.06 Intr - 189444 189312 133 0 1 85 97 46 0.984 4.50 10.05 Intr - 191039 190851 189 2 0 43 67 190 0.996 11.26 10.04 Intr - 191734 191660 75 1 0 78 100 40 0.884 2.99 10.03 Intr - 192495 192445 51 0 0 93 87 19 0.578 0.49 10.02 Intr - 197463 197386 78 0 0 78 78 58 0.854 2.63 10.01 Intr - 199393 199290 104 2 2 36 103 94 0.845 4.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 70723 70772 50 0 2 94 97 32 0.911 3.25 S.002 Intr + 74414 74622 209 0 2 91 67 135 0.975 9.57 S.003 Term + 74708 74902 195 1 0 50 42 183 0.991 6.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_1|47_aa MAGDGIILIGLDLSGSNIQADDGYESTPPKQNGYCVLGEMQPRGLRX >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_1|141_bp atggcaggggatggcattatcttgattggcttggacctatcaggatccaacattcaagct gatgatgggtatgagtcaacaccacccaaacagaatggctactgcgtgttaggagagatg caaccacggggcttacgaann >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_2|504_aa MAQELRDECTSLSSRFDQLEEKVSVMEDEINEMKQEEKVREKRIKRNEQSLQEIWDYVKR PNLHLIGVPESDRENGTKLENTLQDIIQENFPNLARQANIQIQEIQRMPQRYSSRRATPR HIIVRFTKVEMKEKMLRAAREKEIQTTIREYYKHLYTNKLENLEEMDKFLDTYSLPRLNQ EEVESLKRPITGSEIGAIINSLPTKKSPGPDGFTAEFYQRYKKELVPFLLKLFQSTEKER ILPNSFYEASIILIPKPGRDTTKKENFRPISLMNINAKILNKILANQIQQHIEKLIHHDQ VGFIPGMQGWFNIRKSINVIQHINRTNEKNHMIISIDAEKAFDKIQQPFMLKTLNKLGID GTYLKIVRAIYDKLTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEK EIKGIQLGKEQVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLY TNNRQTESQIMSELPFTIASKRIK >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_2|1515_bp atggcacaagaactacgtgacgaatgcacaagcctcagtagccgattcgatcaactggaa gaaaaggtatcagtgatggaagatgaaattaatgaaatgaagcaagaagagaaggttaga gaaaaaagaataaaaagaaatgaacaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacatctgattggtgtacctgaaagtgacagggagaatggaaccaagttggaa aacactctgcaggatattatccaagagaacttccccaatctagcaaggcaggccaacatt caaattcaggaaatacagagaatgccacaaagatactcctcgagaagagcaactccaaga cacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagctaga gagaaagaaatacaaactaccatcagagaatactataaacacctctacacaaataaacta gaaaatctggaagaaatggataaattcctggacacatacagtctcccaagactaaaccag gaagaagttgaatctctgaagagaccaataacaggatctgaaattggggcaataattaat agcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccagagg tacaagaaggagctggtaccattccttctgaaactattccaatcaacagaaaaagagaga atcctccctaactcattttatgaggccagcatcatcctgataccaaagcctggcagagac acaacaaaaaaagagaattttagaccaatatccctgatgaacatcaatgcaaaaatcctc aataaaatactggcaaaccaaatccagcagcacatcgaaaagcttatccaccatgatcaa gtgggcttcatccctgggatgcaaggctggttcaacatacgcaaatcaataaacgtaatc cagcatataaacagaaccaacgagaaaaaccacatgattatctcaatagatgcagaaaag gcctttgacaaaattcaacagcccttcatgctaaaaactctcaataaattaggtattgat gggacgtatctcaaaatagtaagagctatttatgacaaactcacagccaatatcatactg aatgggcaaaaattggaagcattccctttgaaaactggcacaagacagggatgccctctc tcaccactcctattcaacatagtgttggaagttctggccagggcaatcaggcaagagaaa gaaataaagggtattcaattaggaaaagagcaagtcaaattgtccctgtttgcagatgac atgattgtatatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatag >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_3|187_aa MWKQPECSSANEQIKKMWYKHTSKYYLALKKEILSYATIRINLEDIKLSEISQSQMDKYE TIPLTCAWATKVKFCQKEKRRGERGEGRGERGEGRGERGEGRGERRGEEKRREEKRREEK RREEKRREEKRREGKRREGKKRKEKRKENIIIAYWICFLLVAGWRMEWRGVGLESMKAIM SLLQKFR >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_3|564_bp atgtggaaacaacctgaatgttcatcagcgaatgaacagattaagaaaatgtggtataaa catacatcaaaatattatttggccttaaaaaaggaaatcctgtcatatgcaacaatacgg ataaaccttgaggacattaagctaagtgaaataagccagtcacaaatggacaaatacgag acgattccacttacatgtgcctgggcaacaaaagtgaaattctgtcagaaagaaaagagg agaggggagaggggagaggggagaggggagaggggagaggggagaggggagaggggagag gggagaggagagaggagaggagaagagaagagaagagaagagaagagaagagaagagaag agaagagaagagaagagaagagaagagaagagaagggaagggaagagaagagaagggaag aaaagaaaagaaaaaagaaaagagaatatcatcatagcttactggatctgtttcttactt gtggcagggtggagaatggaatggaggggagtaggactggagtcaatgaaagctatcatg agcctgctacagaagtttaggtaa >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_4|306_aa MSVMRKYRALREHPARRTDVSWSGSRNQFREDLLEKVLLKLRPEDKQELVMYKKRKGKCI AEATRCAITDYISKDYSAMNVSYSTQCLAQDSGPRVDFSDGDLGNWPSGGQCHLEPEGRQ RGMFHGRPRGCIIFYGHGECTGVRGTSITKKFREGSPLEGRADVQKELDQLEIPENITPI YFTNDIMLTRQDEQEVASMLETLWMEEFKNDMLTEKDARYLAVKEVLCHLIECNKDVPGI SQINWVIHVVDSPIINAFVLPYMFNRPYSRKLEAEADKIGLLLAAKSTQCEDKDGDIYDD PLPLNE >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_4|921_bp atgagtgttatgaggaaatacagggcgttacgagaacatccagcaaggagaactgacgta agctggagtggaagtaggaatcagtttagggaagatctgctcgagaaagttctgcttaag ctgagacctgaggacaaacaggagcttgtcatgtataagaagaggaaaggaaaatgtatt gcggaggcaactagatgtgcaataactgactatattagcaaggactattcagctatgaat gttagctattcaactcaatgtctggcacaagatagtggcccccgtgtggatttcagtgat ggcgacctgggaaactggcccagtggagggcaatgccatttagagcccgaaggcagacag aggggaatgttccatgggaggcccagaggatgcatcatattctatggccatggggaatgc actggcgtgaggggcactagcatcactaagaagttcagagaaggctctcctctggaggga agggctgacgtccaaaaggagctggatcaactggaaattccagagaacatcacacccatc tatttcaccaatgacattatgctaaccaggcaagatgagcaagaggtggctagcatgctg gaaactttgtggatggaagaatttaaaaatgatatgctaactgagaaagatgcccgatac ctggctgttaaagaagtgctttgtcatctaattgaatgcaataaagatgttccagggatc tctcagatcaattgggttattcatgtggttgattccccaattattaatgccttcgtgctt ccatatatgtttaatagaccatacagcagaaaattggaggccgaagctgacaaaattgga ctactgcttgctgcaaagtctactcagtgtgaagataaggatggagacatttatgatgat ccacttccacttaatgaatag >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_5|323_aa MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVCSPDGPGGRCQCRALGS GMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDGLYDPDCDPEGRFKARQCN QTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRHRPTAGAFNHSDLDAELRRLF RERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGDVDIGDAAYYFERDIKGESLFQGRG GLDLRVRGEPLQVERTLIYYLDEIPPKFSMKRLTAGLIAVIVVVVVALVAGMAVLVITNR RKSGKYKKVEIKELGELRKEPSL >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_5|972_bp atggctcggggccccggcctcgcgccgccaccgctgcggctgccgctgctgctgctggtg ctggcggcggtgaccggccacacggccgcgcaggacaactgcacgtgtcccaccaacaag atgaccgtgtgcagccccgacggccccggcggccgctgccagtgccgcgcgctgggctcg ggcatggcggtcgactgctccacgctgacctccaagtgtctgctgctcaaggcgcgcatg agcgcccccaagaacgcccgcacgctggtgcggccgagtgagcacgcgctcgtggacaac gatggcctctacgaccccgactgcgaccccgagggccgcttcaaggcgcgccagtgcaac cagacgtcggtgtgctggtgcgtgaactcggtgggcgtgcgccgcacggacaagggcgac ctgagcctacgctgcgatgagctggtgcgcacccaccacatcctcattgacctgcgccac cgccccaccgccggcgccttcaaccactcagacctggacgccgagctgaggcggctcttc cgcgagcgctatcggctgcaccccaagttcgtggcggccgtgcactacgagcagcccacc atccagatcgagctgcggcagaacacgtctcagaaggccgccggtgacgtggatatcggc gatgccgcctactacttcgagagggacatcaagggcgagtctctattccagggccgcggc ggcctggacttgcgcgtgcgcggagaacccctgcaggtggagcgcacgctcatctattac ctggacgagattcccccgaagttctccatgaagcgcctcaccgccggcctcatcgccgtc atcgtggtggtcgtggtggccctcgtcgccggcatggccgtcctggtgatcaccaaccgg agaaagtcggggaagtacaagaaggtggagatcaaggaactgggggagttgagaaaggaa ccgagcttgtag >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_6|27_aa MDCRRGWQEHDDDDDDDDDDDDNDDDK >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_6|84_bp atggactgccggagaggttggcaggagcacgatgatgatgatgatgatgatgacgacgac gacgacaacgatgatgacaaatga >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_7|173_aa MASQENPSKDKALILSDNLLPLDGMDNISSHIGPGLSVLRGSTALLLSAVMSHKTHTRDG SRGEEEWERGKVQKSRKNEAGRKEETSLFSNSGMCGPKTKGKKVLVVPLTLVGLGKAANA LTVLREASSQKYLLRTYYVLVSVEVLEKKAVNNRDACPLGAYIPVMTGEFDLG >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_7|522_bp atggcctctcaagagaatccaagtaaggataaggctttgattttatccgacaacttacta cctttagatggtatggataacatctccagtcacatcggtccagggctgagtgtgctgagg ggctcaactgcactcctactctcagccgtcatgagtcacaagactcacaccagggatgga agtagaggcgaggaggagtgggagagaggaaaagttcagaagtctaggaagaatgaggca ggcaggaaagaggagacttcattgttctctaattctgggatgtgtgggcccaagacaaag ggcaagaaagtccttgttgttcctctgaccttagtgggattaggtaaagcagccaatgct ctaactgttctcagggaagcctcaagtcagaaatacttattaagaacctactatgtgcta gtatcagtcgaggtgctagagaagaaagcagtgaacaacagagatgcttgcccccttgga gcttatatcccagtgatgactggggaatttgacttgggatag >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_8|128_aa MAELVKIHQPSIHCLQETHLTHKDSNLRARKNNLKFHMEPKERPDIAKARLSKRKKSGGI TLLDFKLYYKAVVTKTGAGSQIGKLPLQQLSARIKNPPKPANSHNLETEIPRKPPKSRSA LWRPDFRP >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_8|387_bp atggcagaattggtaaaaatccaccaaccaagtatccactgtcttcaagagactcaccta acacataaggactcaaacttaagagccaggaaaaacaatctgaaatttcatatggaacca aaagaaagacccgacatagccaaagcaagactgagcaaaaggaaaaaatctggaggcatc acattgctggacttcaagctatactacaaggctgtggttaccaaaacaggagctggcagc caaattgggaagctgcccctgcagcagctctctgccaggatcaaaaacccaccaaaacca gcaaacagccacaacctagaaactgaaatacccaggaagccaccaaaatcaagaagtgcc ttgtggcggccagacttccgtccatga >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_9|255_aa MKLRTLAVSATALKVARLEFVPFDVRMCSEFLSSGVKLQTFAVSVTALNALRLELFVPPG GLMVSLASGVKLQIFTVSVTAHKSSVDPKTLGWSMGLGAVEQGAALIGEAWAAQEPMEGV GGSGMAGCRSRALPRGKAAKAWREVDSESWRFWCKVANSVPTYKVLFHPALECVPVLTTL ALVPPAACTIPTAPLILCQAPEILGFPSSPVSTSADNSGFMGSNESLDWPPSALCPPVFL TLLTAASTQLIKPKA >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_9|768_bp atgaagctgcggaccctcgcggtgagtgctacagctcttaaggtggcgcgtctggagttt gttccttttgatgttcggatgtgttcggagtttctttcttccggagtgaagctgcagacc ttcgcggtgagtgttacagctcttaatgcactgcgtctggagttgttcgttcctcccggt gggctcatggtctcgctggcttcaggagtgaagctgcagatcttcacggtgagtgttaca gctcataaaagcagtgtggacccaaagacccttgggtggtcgatgggactgggcgccgtg gagcagggggcggcgctcatcggggaggcttgggccgcacaggagcccatggagggggtg ggaggctcaggcatggcgggctgcaggtctcgagccctgccccgcggaaaggcagctaag gcctggagagaagtggattctgaaagctggcggttctggtgtaaagttgctaatagtgtg cccacttataaggttctcttccacccagcactggagtgtgtccctgtcctcaccacttta gccctggtccctcctgcagcatgcacaattcccactgctcctttgattctgtgccaggca cctgagattcttggcttcccttcttctcctgtctccacttcagctgataactcaggcttc atgggttcaaatgaaagtcttgactggcccccatctgccctttgtcctccagtctttctc accttattgacagcagcatccacacagctgatcaagccaaaagcctga >gi568815597r:58476188_58677093|GENSCAN_predicted_peptide_10|461_aa XQAVYNRPQTVDKVRIRDRKDAVEAYQLAQRLQSMRTRRRRVRDPWGNWCDAKDLEGQTF EEPFQVKVASEALLIMDLHAHVSMAEVIGLLGGRYSEVDKVVEVCAAEPCNSLSTGLQCE MDPVSQTQASETLAVRGFSVIGWYHSHPAFDPNPSLRDIDTQAKYQSYFSRGGAKFIGMI VSPYNRNNPLPYSQITCLVISEEISPDGSYRLPYKFEVQQMLEEPQWGLVFEKTRWIIEK YRLSHRSGVEVCDMQTVSTVRTSAALNPTEETVASTCQPEQVSSALPSSHFSSWVLMRLL QVIATCPKAPVAQMEGSMPSEYRRAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSELG KKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKEK LDSVIEFSIPDSLLIRRITGRLITPRVAVPTTRSSTLQKSP >gi568815597r:58476188_58677093|GENSCAN_predicted_CDS_10|1386_bp naacaggctgtgtataataggccacaaacagttgacaaagtacgaatcagagacagaaaa gatgcagtagaagcataccaacttgcccagcgtctgcagtctatgcgtacaaggagacgt agggtccgagacccatggggaaactggtgtgatgcaaaggacttagaaggacaaacgttt gaggagccatttcaggtgaaagtggcttcagaagcacttttaataatggatttgcatgct catgtttctatggcagaagtgattggtctgttaggaggaagatactcagaagttgataaa gtagttgaagtctgtgcagcagaaccatgtaacagtctgagtacaggactacagtgtgag atggatcctgtatcacaaacacaggcctcagaaaccttggctgttagaggcttcagtgtt attggatggtatcattctcatcctgcttttgatcctaatccttccttacgagatattgac acacaagctaaataccagagttacttctccagaggaggtgcaaagttcattgggatgatt gttagtccctataatcgaaataatcccttaccatattctcagattacctgcctggttata agtgaggaaattagcccagatggctcttatcgcttaccttacaaatttgaagtacagcag atgttagaagaacctcagtggggattagtatttgaaaagacaagatggataatagaaaaa tacaggctctcccataggagtggggttgaggtctgtgacatgcagactgtaagcacagtc agaacctctgcagctctcaacccaacagaggagactgtggcttcaacatgccaacctgag caggtctcctctgcattaccttcctcccatttctcttcctgggtcctgatgaggctgcta caggtaatagctacatgccctaaagccccagttgctcagatggaaggctccatgcccagt gagtacaggagggccggtaaaggcacccaggcacccagattggctgaaaacttctgtgtc tgccatttagctactggggacatgctgagggccatggtggcttctggctcagagctagga aaaaagctgaaggcaactatggatgctgggaaactggtgagtgatgaaatggtagtggag ctcattgagaagaatttggagacccccttgtgcaaaaatggttttcttctggatggcttc cctcggactgtgaggcaggcagaaatgctcgatgacctcatggagaagaggaaagagaag cttgattctgtgattgaattcagcatcccagactctctgctgatccgaagaatcacagga aggctgatcaccccaagagtggccgttcctaccacgaggagttcaaccctccaaaagagc ccatga