GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:35:23 Sequence gi568815581r:48830478_49044734 : 214257 bp : 46.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2041 2125 85 1 1 111 46 71 0.752 5.02 1.02 Intr + 11297 11410 114 1 0 87 89 112 0.071 11.84 1.03 Intr + 12925 13083 159 0 0 40 77 59 0.527 0.18 1.04 Intr + 17587 17689 103 0 1 118 100 45 0.994 8.45 1.05 Intr + 17845 17978 134 2 2 52 72 216 0.983 16.76 1.06 Intr + 18775 18900 126 0 0 46 80 171 0.997 12.98 1.07 Intr + 20612 20700 89 1 2 59 70 86 0.997 2.67 1.08 Intr + 21082 21151 70 1 1 75 96 69 0.926 5.58 1.09 Intr + 22029 22151 123 2 0 80 80 134 0.990 12.58 1.10 Intr + 24356 24430 75 1 0 65 75 85 0.836 4.61 1.11 Intr + 25615 25710 96 0 0 80 95 9 0.586 1.01 1.12 Intr + 29837 29972 136 1 1 49 99 65 0.883 3.84 1.13 Intr + 31799 31827 29 0 2 121 131 -8 0.994 4.63 1.14 Term + 32361 32528 168 2 0 114 48 115 0.982 7.98 1.15 PlyA + 33340 33345 6 1.05 2.00 Prom + 35061 35100 40 -5.16 2.01 Init + 41074 41086 13 0 1 110 95 8 0.824 3.40 2.02 Intr + 50643 50843 201 1 0 114 72 167 0.981 16.96 2.03 Intr + 51222 51288 67 1 1 92 68 44 0.393 0.76 2.04 Intr + 52869 53022 154 0 1 43 117 41 0.303 2.57 2.05 Intr + 62153 62320 168 1 0 -13 31 187 0.116 3.04 2.06 Intr + 62932 62979 48 0 0 103 95 -8 0.417 0.28 2.07 Intr + 64679 64857 179 1 2 100 111 230 0.983 25.22 2.08 Term + 65178 65292 115 0 1 116 54 138 0.812 11.24 2.09 PlyA + 69610 69615 6 1.05 3.00 Prom + 71334 71373 40 -5.46 3.01 Init + 78027 78343 317 2 2 108 89 288 0.999 25.71 3.02 Intr + 80331 80403 73 0 1 90 95 69 0.998 7.21 3.03 Intr + 82357 82544 188 0 2 71 90 180 0.999 14.99 3.04 Intr + 85599 85710 112 0 1 89 110 89 0.999 11.58 3.05 Intr + 90683 90795 113 1 2 38 92 53 0.969 -0.12 3.06 Intr + 92370 92460 91 0 1 97 90 138 0.952 14.90 3.07 Term + 96487 96657 171 0 0 116 41 119 0.958 7.83 3.08 PlyA + 98558 98563 6 1.05 4.15 PlyA - 99641 99636 6 1.05 4.14 Term - 100135 99998 138 1 0 152 42 69 0.999 6.66 4.13 Intr - 101240 101166 75 2 0 106 72 110 0.983 10.91 4.12 Intr - 102869 102728 142 1 1 47 69 222 0.692 16.56 4.11 Intr - 105765 105693 73 1 1 114 39 19 0.853 -1.94 4.10 Intr - 106647 106543 105 1 0 87 78 104 0.974 9.49 4.09 Intr - 110585 110447 139 2 1 105 58 239 0.892 22.64 4.08 Intr - 112080 111991 90 0 0 72 42 119 0.976 5.89 4.07 Intr - 113498 113448 51 2 0 123 111 33 0.996 8.20 4.06 Intr - 114361 114204 158 2 2 46 83 82 0.056 3.23 4.05 Intr - 123934 123912 23 0 2 94 90 27 0.074 0.69 4.04 Intr - 130510 130409 102 0 0 51 91 115 0.864 7.39 4.03 Intr - 131342 131250 93 1 0 92 105 79 0.998 9.08 4.02 Intr - 134003 133833 171 1 0 102 61 167 0.998 14.46 4.01 Init - 136755 136670 86 0 2 73 91 196 0.999 16.69 4.00 Prom - 139745 139706 40 -5.46 5.00 Prom + 149743 149782 40 -3.46 5.01 Init + 167269 167443 175 0 1 110 111 465 0.970 50.51 5.02 Intr + 168632 168692 61 0 1 94 97 65 0.789 5.79 5.03 Intr + 195141 195189 49 0 1 109 100 44 0.967 6.38 5.04 Intr + 195989 196040 52 1 1 94 116 54 0.992 7.28 5.05 Intr + 201433 201496 64 2 1 116 73 96 0.896 8.68 5.06 Intr + 207691 207972 282 1 0 83 89 388 0.896 34.93 5.07 Intr + 209480 209614 135 2 0 97 111 112 0.999 14.08 5.08 Intr + 210901 211023 123 1 0 81 71 102 0.994 7.50 5.09 Intr + 211765 211900 136 1 1 64 75 171 0.999 13.97 5.10 Intr + 212951 213073 123 1 0 134 78 16 0.954 5.98 5.11 Intr + 213490 213609 120 0 0 74 53 200 0.636 15.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 11231 11410 180 1 0 92 89 151 0.879 14.79 S.002 Init - 114257 114204 54 2 0 62 83 79 0.929 6.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:48830478_49044734|GENSCAN_predicted_peptide_1|502_aa XIIIISGKVLEDFLALAMHFADEETGQKRVIFNSVEKFYIPGGDVTCHYTFTQHFIPRRK DWIGIFRYVMCLFASAIHPEIRHHSQLHCEGAVAYLYPSLNSFSIIVWNIKLIDEQRKLE VGWKTTREYYTFMWVTLPIDLNNKSAKQQEVQFKAYYLPKDDEYYQFCYVDEDGVVRGAS IPFQFRPENEEDILVVTTQGEVEEIEQHNKELCKENQELKDSCISLQKQNSDMQAELQKK QEELETLQSINKKLELKVKEQKDYWETELLQLKEQNQKMSSENEKMGIRVDQLQAQLSTQ EKEMEKLVQGDQDKTEQLEQLKKENDHLFLSLTEQEPRNKKAALTVEEQLAQEVERLKAK DENFDLSKRLSENEIICNALQRQKERLEGENDLLKRENSRLLSYMGLDFNSLPYQVPTSD EGGARQNPGLAYGNPYSGIQESSSPSPLSIKKCPICKADDICDHTLEQQQMQPLCFNCPI CDKIFPATEKQIFEDHVFCHSL >gi568815581r:48830478_49044734|GENSCAN_predicted_CDS_1|1509_bp nngatcatcattatatcaggcaaggtccttgaagattttctagccttagccatgcatttt gctgatgaggaaacaggccagaaaagggtcatctttaacagtgtggagaagttctacatc cctggaggggacgtcacatgtcattataccttcacccagcatttcatccctcgtcgaaag gattggattggcatctttagatatgtcatgtgcctatttgccagtgctattcatcctgaa atcagacaccattctcaactgcactgtgaaggagctgttgcttatttatatccttcattg aattccttttccataattgtttggaacatcaagcttattgatgagcaacggaagctggag gtggggtggaagacaacccgtgagtattacaccttcatgtgggttactttgcccattgac ctaaacaacaaatcagctaaacagcaggaagtccaattcaaagcttactacctgcccaag gatgatgagtattaccagttctgctatgtggatgaggatggtgtggtccggggagcaagt attcctttccaattccgtccagaaaatgaggaagacatcctggttgttaccactcaggga gaggtggaagagattgagcagcacaacaaggagctttgcaaagaaaaccaggagctgaag gacagctgtatcagcctccagaagcagaactcagacatgcaggctgagctccaaaagaag caggaggagctagaaaccctacagagcatcaataagaagttggaactgaaagtgaaagaa cagaaggactattgggagacagagctgcttcaactgaaagaacaaaaccagaagatgtcc tcagaaaatgagaagatgggaatcagagtggatcagcttcaggcccagctgtcaactcaa gagaaagaaatggagaagcttgttcagggagatcaagataagacagagcagttagagcag ctgaaaaaggaaaatgaccacctctttctcagtttaactgaacaggagcctaggaacaag aaagcggcactaactgtggaggagcagttagcacaagaggtggaacgccttaaggcaaag gatgaaaactttgacctgtcaaaaagactgagtgagaacgaaattatatgtaatgctctg cagagacagaaagagagattggaaggagaaaatgatcttttgaagagggagaacagcaga ttgctcagttacatgggtctggattttaattctttgccgtatcaagtacctacttcagat gaaggaggcgcaagacaaaatccaggacttgcctatggaaacccatattctggtatccaa gaaagttcttcccccagcccgctctccatcaagaaatgccctatctgcaaagcagatgat atttgtgatcacaccttggagcaacagcagatgcagcccctttgtttcaattgtccaatt tgtgacaagatcttcccagctacagagaagcagatctttgaagaccacgtgttctgccac tctctctga >gi568815581r:48830478_49044734|GENSCAN_predicted_peptide_2|314_aa MPGLDRDAGTQHRTWAHFIQIMKVEETAASKCNWPAVTQLHDSKVEFLLPHWVLCGQQQA TCHHQEARHLTGILSSPSTGCETLRKSLNDALAGLCSSEPSLRYMTLTNQSVARKRNGQH SLQALPLCELLTHRTSQRRNTGPKPESPASIRMEAGIATKLLWRERKQLRKANKSGESMT STNGDAGILRPMRMEKVQDTWTEKMQTAGALFISPALPSYSNFPLQVARREFQTSVVSRD IDTAAKFIGAGAATVGVAGSGAGIGTVFGSLIIGYARNPSLKQQLFSYAILGFALSEAMG LFCLMVAFLILFAM >gi568815581r:48830478_49044734|GENSCAN_predicted_CDS_2|945_bp atgcctggcctagaccgagacgcgggcacccagcaccgcacctgggcccacttcatccag atcatgaaggtggaggagactgcagccagcaagtgcaactggccagcagtcacacagctc catgactccaaggttgagttcctgctgcctcactgggtcctctgtggccagcaacaagcc acgtgtcaccaccaagaggcccgacatcttacaggaatcctgtcttcaccatcaactggc tgcgagaccttgagaaagtcacttaatgatgcactggctgggctatgctctagtgaaccc agcctaagatacatgacgcttactaaccagtcagtagcaaggaaacggaatggccagcac tcgcttcaggctctgcctctgtgtgagttgctgacacacagaacatcgcagaggaggaac accgggcccaaaccggaaagccccgcctctatccgcatggaggcgggaattgccacgaag ctcctgtggagggagaggaagcagctgcggaaagccaataagagtggggaatcgatgacg tcaaccaatggggacgcggggatattacggccaatgagaatggagaaggtccaggacacg tggactgaaaaaatgcagaccgccggggcattattcatttctccagctctgccttcctac agcaacttcccactccaggtggccagacgggagttccagaccagtgttgtctcccgggac attgacacagcagccaagtttattggtgctggggcagccacagttggtgtggctggttca ggggctggcattggaaccgtgtttggcagcttgatcattggctatgccaggaacccgtct ctcaagcagcagctcttctcctatgccattcttggctttgccctgtctgaggccatgggg cttttctgtttgatggtcgccttcctcatcctcttcgccatgtga >gi568815581r:48830478_49044734|GENSCAN_predicted_peptide_3|354_aa MAESPTEEAATAGAGAAGPGASSVAGVVGVSGSGGGFGPPFLPDVWAAAAAAGGAGGPGS GLAPLPGLPPSAAAHGAALLSHWDPTLSSDWDGERTAPQCLLRIKRDIMSIYKEPPPGMF VVPDTVDMTKIHALITGPFDTPYEGGFFLFVFRCPPDYPIHPPRVKLMTTGNNTVRFNPN FYRNGKVCLSILGTWTGPAWSPAQSISSVLISIQSLMTENPYHNEPGFEQERHPGDSKNY NECIRHETIRVAVCDMMEGKCPCPEPLRGVMEKSFLEYYDFYEVACKDRLHLQGQTMQDP FGEKRGHFDYQSLLMRLGLIRQKVLERLHNENAEMDSDSSSSGTETDLHGSLRV >gi568815581r:48830478_49044734|GENSCAN_predicted_CDS_3|1065_bp atggcggagagtccgactgaggaggcggcaacggcgggcgccggggcggcgggccccggg gcgagcagcgttgctggtgttgttggcgttagcggcagcggcggcgggttcgggccgcct ttcctgccggatgtgtgggcggcggcggcggcagcgggcggggccgggggcccggggagc ggcctggctccgctgcccgggctcccgccctcagccgctgcccacggggccgcgctgctt agccactgggaccccacgctcagctccgactgggacggcgagcgcaccgcgccgcagtgt ctactccggatcaagcgggatatcatgtccatttataaggagcctcctccaggaatgttc gttgtacctgatactgttgacatgactaagattcatgcattgatcacaggcccatttgac actccttatgaagggggtttcttcctgttcgtgtttcggtgtccgcccgactatcccatc cacccacctcgggtcaaactgatgacaacgggcaataacacagtgaggtttaaccccaac ttctaccgcaatgggaaagtctgcttgagtattctaggtacatggactggacctgcctgg agcccagcccagagcatctcctcagtgctcatctctatccagtccctgatgactgagaac ccctatcacaatgagcccggctttgaacaggagagacatccaggagacagcaaaaactat aatgaatgtatccggcacgagaccatcagagttgcagtctgtgacatgatggaaggaaag tgtccctgtcctgaacccctacgaggggtgatggagaagtcctttctggagtattacgac ttctacgaggtggcctgcaaagatcgcctgcaccttcaaggccaaactatgcaggaccct tttggagagaagcggggccactttgactaccagtccctcttgatgcgcctgggactgata cgtcagaaagtgctggagaggctccataatgagaatgcagaaatggactctgatagcagt tcatctgggacagagacagaccttcatgggagcctgagggtttag >gi568815581r:48830478_49044734|GENSCAN_predicted_peptide_4|481_aa MVATKTFALLLLSLFLAVGLGEKKEGHFSALPSLPVGSHAKVSSPQPRGPRYAEGTFISD YSIAMDKIHQQDFVNWLLAQKGKKNDWKHNITQREARALELASQANRKEEEAVEPQSSPA KNPSDEDLLRDLLIQELLACLLDQTNLCRLRLRNRRERDDARLGLPPWGAGGGVRDVETR GPGSRAARGPRVGMHRRGVGAGAIAKKKLAEAKYKERGTVLAEDQLAQPFPSLYVLRVYP VIVIGEEVEEAVKQENGAMSKQLDMFKTNLEEFASKHKQEIRKNPEFRVQFQDMCATIGV DPLASGKGFWSEMLGVGDFYYELGVQIIEVCLALKHRNGGLITLEELHQQVLKGRGKFAQ DVSQDDLIRAIKKLKALGTGFGIIPVGGTYLIQSVPAELNMDHTVVLQLAEKNGYVTVSE IKASLKWETERARQVLEHLLKEGLAWLDLQAPGEAHYWLPALFTDLYSQEITAEEAREAL P >gi568815581r:48830478_49044734|GENSCAN_predicted_CDS_4|1446_bp atggtggccacgaagacctttgctctgctgctgctgtccctgttcctggcagtgggacta ggagagaagaaagagggtcacttcagcgctctcccctccctgcctgttggatctcatgct aaggtgagcagccctcaacctcgaggccccaggtacgcggaagggactttcatcagtgac tacagtattgccatggacaagattcaccaacaagactttgtgaactggctgctggcccaa aaggggaagaagaatgactggaaacacaacatcacccagagggaggctcgggcgctggag ctggccagtcaagctaataggaaggaggaggaggcagtggagccacagagctccccagcc aagaaccccagcgatgaagatttgctgcgggacttgctgattcaagagctgttggcctgc ttgctggatcagacaaacctctgcaggctcaggctgaggaaccgtcgtgaaagagatgac gcgcggctcgggcttccgccttggggagccggcggcggagtccgggacgtggagacccgg ggtcccggcagccgggcggcccgcgggcccagggtggggatgcaccgccgcggggtggga gctggcgccatcgccaagaagaaacttgcagaggccaagtataaggagcgagggacggtc ttggctgaggaccagctagcccagcccttcccttctctgtacgtgctccgagtttaccca gtgattgtgattggggaagaagtggaggaagccgttaagcaggaaaatggggctatgtca aagcagttggacatgttcaagaccaacctggaggaatttgccagcaaacacaagcaggag atccggaagaatcctgagttccgtgtgcagttccaggacatgtgtgcaaccattggcgtg gatccgctggcctctggaaaaggattttggtctgagatgctgggcgtgggggacttctat tacgaactaggtgtccaaattatcgaagtgtgcctggcgctgaagcatcggaatggaggt ctgataactttggaggaactacatcaacaggtgttgaagggaaggggcaagttcgcccag gatgtcagtcaagatgacctgatcagagccatcaagaaactaaaggcacttggcactggc ttcggcatcatccctgtgggcggcacttacctcattcagtctgttccagctgagctcaat atggatcacaccgtggtgctgcagctggcagagaagaatggctacgtgactgtcagtgag atcaaagccagtcttaaatgggagaccgagcgagcgcggcaagtgctggaacacctgctg aaggaagggttggcgtggctggacttacaggccccaggggaggcccactactggctgcca gctctcttcactgacctctactcccaggagattacagctgaggaggccagagaagccctc ccctga >gi568815581r:48830478_49044734|GENSCAN_predicted_peptide_5|440_aa MNKLYIGNLNESVTPADLEKVFAEHKISYSGQFLVKSGYAFVDCPDEHWAMKAIETFSGK VELQGKRLEIEHSVPKKQRSRKIQIRNIPPQLRWEVLDSLLAQYGTVENCEQVNTESETA VVNVTYSNREQTRQAIMKLNGHQLENHALKVSYIPDEQIAQGPENGRRGGFGSRGQPRQG SPVAAGAPAKQQQVDIPLRLLVPTQYVGAIIGKEGATIRNITKQTQSKIDVHRKENAGAA EKAISVHSTPEGCSSACKMILEIMHKEAKDTKTADEVPLKILAHNNFVGRLIGKEGRNLK KVEQDTETKITISSLQDLTLYNPERTITVKGAIENCCRAEQEIMKKVREAYENDVAAMSL QSHLIPGLNLAAVGLFPASSSAVPPPPSSVTGAAPYSSFMQAPEQEMVQVFIPAQAVGAI IGKKGQHIKQLSRFASASIK >gi568815581r:48830478_49044734|GENSCAN_predicted_CDS_5|1320_bp atgaacaagctttacatcggcaacctcaacgagagcgtgacccccgcggacttggagaaa gtgtttgcggagcacaagatctcctacagcggccagttcttggtcaaatccggctacgcc ttcgtggactgcccggacgagcactgggcgatgaaggccatcgaaactttctccgggaaa gtagaattacaaggaaaacgcttagagattgaacattcggtgcccaaaaaacaaaggagc cggaaaattcaaatccgaaatattccaccccagctccgatgggaagtactggacagcctg ctggctcagtatggtacagtagagaactgtgagcaagtgaacaccgagagtgagacggca gtggtgaatgtcacctattccaaccgggagcagaccaggcaagccatcatgaagctgaat ggccaccagttggagaaccatgccctgaaggtctcctacatccccgatgagcagatagca cagggacctgagaatgggcgccgagggggctttggctctcggggtcagccccgccagggc tcacctgtggcagcgggggccccagccaagcagcagcaagtggacatcccccttcggctc ctggtgcccacccagtatgtgggtgccattattggcaaggagggggccaccatccgcaac atcacaaaacagacccagtccaagatagacgtgcataggaaggagaacgcaggtgcagct gaaaaagccatcagtgtgcactccacccctgagggctgctcctccgcttgtaagatgatc ttggagattatgcataaagaggctaaggacaccaaaacggctgacgaggttcccctgaag atcctggcccataataactttgtagggcgtctcattggcaaggaaggacggaacctgaag aaggtagagcaagataccgagacaaaaatcaccatctcctcgttgcaagaccttaccctt tacaaccctgagaggaccatcactgtgaagggggccatcgagaattgttgcagggccgag caggaaataatgaagaaagttcgggaggcctatgagaatgatgtggctgccatgagcctg cagtctcacctgatccctggcctgaacctggctgctgtaggtcttttcccagcttcatcc agcgcagtcccgccgcctcccagcagcgttactggggctgctccctatagctcctttatg caggctcccgagcaggagatggtgcaggtgtttatccccgcccaggcagtgggcgccatc atcggcaagaaggggcagcacatcaaacagctctcccggtttgccagcgcctccatcaag