GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:55:44 Sequence gi568815581f:9726064_9989702 : 263639 bp : 46.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1891 2630 740 2 2 123 105 397 0.186 36.06 1.02 Intr + 8589 8679 91 2 1 39 66 67 0.099 -0.83 1.03 Term + 17169 17275 107 1 2 107 49 89 0.799 5.47 1.04 PlyA + 17794 17799 6 1.05 2.05 PlyA - 17813 17808 6 1.05 2.04 Term - 38503 38425 79 0 1 83 29 71 0.084 -2.06 2.03 Intr - 45633 45429 205 1 1 28 45 345 0.309 22.46 2.02 Intr - 46859 46704 156 0 0 102 45 202 0.918 17.28 2.01 Init - 51208 51130 79 1 1 90 89 34 0.920 4.93 2.00 Prom - 51874 51835 40 -1.56 3.05 PlyA - 52832 52827 6 1.05 3.04 Term - 53364 53227 138 0 0 51 44 74 0.866 -2.74 3.03 Intr - 53972 53757 216 2 0 43 80 330 0.558 26.40 3.02 Intr - 55531 55419 113 2 2 125 116 65 0.988 13.10 3.01 Init - 65221 65068 154 1 1 67 98 265 0.524 23.54 3.00 Prom - 69658 69619 40 -3.06 4.06 PlyA - 70861 70856 6 1.05 4.05 Term - 76581 76323 259 2 1 82 42 162 0.637 6.02 4.04 Intr - 81538 81427 112 2 1 72 111 47 0.916 4.84 4.03 Intr - 82919 82767 153 0 0 81 80 91 0.954 7.64 4.02 Intr - 84555 84508 48 1 0 87 94 21 0.508 1.25 4.01 Init - 96008 95699 310 2 1 92 105 381 0.985 35.68 4.00 Prom - 101134 101095 40 -6.96 5.00 Prom + 101540 101579 40 -2.56 5.01 Init + 101921 101953 33 1 0 55 96 29 0.304 0.34 5.02 Intr + 107744 107831 88 1 1 85 107 30 0.769 4.14 5.03 Intr + 116432 116553 122 0 2 82 92 89 0.737 8.91 5.04 Intr + 117564 117746 183 2 0 97 37 59 0.500 1.68 5.05 Intr + 128432 128538 107 1 2 101 109 66 0.273 9.01 5.06 Intr + 131360 131513 154 2 1 60 74 202 0.899 16.07 5.07 Intr + 133879 134038 160 0 1 138 84 166 0.990 20.76 5.08 Intr + 137698 137893 196 2 1 110 75 41 0.612 3.47 5.09 Intr + 144438 144683 246 0 0 90 44 97 0.072 2.07 5.10 Intr + 154315 154453 139 1 1 102 103 129 0.999 16.27 5.11 Intr + 161869 161910 42 0 0 101 64 83 0.979 5.64 5.12 Term + 163307 163642 336 1 0 104 46 341 0.992 26.07 5.13 PlyA + 164679 164684 6 1.05 6.05 PlyA - 165478 165473 6 1.05 6.04 Term - 172141 172032 110 2 2 50 44 134 0.945 3.87 6.03 Intr - 172346 172320 27 0 0 97 90 27 0.759 1.89 6.02 Intr - 175037 174926 112 2 1 86 23 161 0.960 9.35 6.01 Init - 179117 178737 381 2 0 56 61 981 0.822 89.07 6.00 Prom - 184163 184124 40 -5.56 7.12 PlyA - 184474 184469 6 1.05 7.11 Term - 191278 191165 114 1 0 117 37 213 0.984 17.67 7.10 Intr - 192036 191938 99 0 0 87 75 183 0.891 17.21 7.09 Intr - 193642 193563 80 2 2 75 99 131 0.912 12.17 7.08 Intr - 199536 199413 124 0 1 64 119 219 0.953 22.76 7.07 Intr - 200706 200578 129 0 0 84 100 355 0.956 37.19 7.06 Intr - 208181 208103 79 1 1 111 86 63 0.987 7.95 7.05 Intr - 214137 214063 75 2 0 116 96 86 0.997 10.83 7.04 Intr - 217173 217058 116 0 2 117 67 175 0.998 17.55 7.03 Intr - 220920 220831 90 0 0 113 94 144 0.828 17.69 7.02 Intr - 233265 233139 127 2 1 78 98 97 0.563 10.38 7.01 Init - 235805 235651 155 2 2 99 53 60 0.920 3.06 7.00 Prom - 236180 236141 40 -2.16 8.07 PlyA - 236528 236523 6 1.05 8.06 Term - 239442 239374 69 0 0 97 54 21 0.495 -2.46 8.05 Intr - 243699 243614 86 1 2 97 110 51 0.439 7.74 8.04 Intr - 253400 253312 89 1 2 121 77 -9 0.298 0.91 8.03 Intr - 260052 259986 67 1 1 90 72 74 0.735 3.96 8.02 Intr - 261937 261859 79 1 1 33 52 99 0.676 -0.08 8.01 Init - 262482 262435 48 0 0 84 66 70 0.735 5.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 126797 126600 198 2 0 -57 41 300 0.805 8.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:9726064_9989702|GENSCAN_predicted_peptide_1|312_aa XGLEPRRLVRGVKGRSISMKAPTTSRAKQGPFKTMPLRWSFGSKEKPPGASVELVEYLES RRRPRSTSQSIVSLLTGTAGEDEKSASPRSNVALPANSEDGGRAIERGPAGVPCPSAQPN HCLAPGNSDGPNTARKLKENAGQDIKLPRKFDLPLTVMPSVEHEKPARPEGQKAMNWKES FQMGSKSSPPSPYMGFSGNSKDSRRGTSELDRPLQGTLTLLRSVFRKKENRRNERAEVSP QVPPVSLYPAILSFKGVWIHELNQVRELDEKLRLFNAGTEDAGQDTVNTRLFTSLGENQT SASTPETGQAAS >gi568815581f:9726064_9989702|GENSCAN_predicted_CDS_1|939_bp ngagggttggagcccaggcgtttggtacggggcgtgaaaggcagaagcattagcatgaag gcacccaccacttcccgagccaagcagggaccattcaagaccatgcctctgcggtggtcc tttggatccaaggagaaaccaccaggtgcctccgtcgagttggtggagtacttggaatcc agacgaagacctcggtccacgagccagtccattgtgtcgctgttgacgggcactgcgggt gaggatgagaagtcagcatcgccgaggtccaacgtcgcccttcctgctaacagcgaagat ggtgggcgggccattgaaagaggtccagccggggtgccctgtccctcggctcaacccaac cactgtctggcccctggaaactcagatggtccaaacacagcaaggaaactcaaggaaaat gcagggcaggacatcaagcttcccagaaagtttgacctgcctctcactgtgatgccttca gtggagcatgagaaaccagctcgaccggagggccagaaggccatgaactggaaggagagc ttccagatgggaagcaaaagcagcccaccctccccctatatgggattctctggaaacagc aaagacagtcgccgaggcacctctgagctagacagacccctgcaggggacactcaccctt ctgaggtccgtgtttcggaagaaggagaacaggaggaatgagagggcagaggtctctcca caggtgccccccgtctccctgtacccagccatcctgtcatttaagggagtctggattcat gaactgaatcaggtcagggaattagatgaaaagttaagactcttcaacgcagggactgag gacgctgggcaagacaccgtcaacacaaggctcttcaccagcctgggggaaaaccagaca tccgcatccacacctgaaacagggcaagccgcctcctga >gi568815581f:9726064_9989702|GENSCAN_predicted_peptide_2|172_aa MISRRTGQIVLVNNIQGKFGIPFRTTYAASKHAALGFFDCLRAEVEEYDVVISTVSPTFI RSYHVYPEQGNWEASIWKFFFRKLTYGVHPVEVAEEVMRTVRRKKQEVFMANPIPKAAVY VRTFFPEFFFAVVACGVKEKLNVPEEGEPPTKIEDFLEVASFEMDEQHVRDP >gi568815581f:9726064_9989702|GENSCAN_predicted_CDS_2|519_bp atgatctcccggagaacaggccaaatcgtgttagtgaataatatccaagggaagtttgga atcccgttccgtacgacttacgctgcctccaagcacgcagccctgggcttctttgactgc ctccgagccgaagtggaggaatacgatgttgtcatcagcaccgtgagcccgactttcatc cggtcgtaccacgtgtatccagagcaaggaaactgggaagcttccatttggaaattcttt ttcaggaagctgacctacggcgtgcacccagtagaggtggcggaggaggtgatgcgcacc gtgcggaggaagaagcaagaggtgtttatggccaaccccatccccaaggccgccgtgtac gtccgcaccttcttcccggagttctttttcgccgtggtggcctgtggggtgaaggagaag ctcaatgtcccggaggagggggaacccccaactaagattgaagactttttggaggtggcc tcatttgagatggacgagcagcatgttagggatccatag >gi568815581f:9726064_9989702|GENSCAN_predicted_peptide_3|206_aa MGVMAMLMLPLLLLGISGLLFIYQEVSRLWSKSAVQNKVVVITDAISGLGKECARVFHTG GARLVLCGKNWERLENLYDALISVADPSKTFTPKLVLLDLSDISCVPDVAKEVLDCYGCV DILINNASVKVKGPAHKISLELDKKIMDANYFGPITLTKGLLVKGKDSVLRNEEEKGLIT RLQETQYSVNPFTETGKGRRLSPFNE >gi568815581f:9726064_9989702|GENSCAN_predicted_CDS_3|621_bp atgggagtcatggccatgctgatgctccccctgctgctgctgggaatcagcggcctcctc ttcatttaccaagaggtgtccaggctgtggtcaaagtcagctgtgcagaacaaagtggtg gtgatcaccgatgccatctcaggactgggcaaggagtgtgctcgggtgttccacacaggt ggggcaaggctggtgctgtgtggaaagaactgggagaggctagagaacctatatgatgcc ttgatcagcgtggctgaccccagcaagacattcaccccaaagctggtcctgttggacctc tcagacatcagctgtgtcccagatgtggcaaaagaagtcctggattgctatggctgtgtg gacatcctcatcaacaatgccagtgtgaaggtgaaggggcctgcccataagatttctctg gagctcgacaaaaagatcatggatgccaattactttggccccatcacattgacgaaaggt ctcttggtcaaaggtaaagattctgtcctcagaaatgaagaagaaaaagggcttatcacc agactacaagagacccagtattctgtcaaccccttcacagaaacagggaaaggcaggagg ctgtctccctttaatgagtaa >gi568815581f:9726064_9989702|GENSCAN_predicted_peptide_4|293_aa MDRAKQQQALLLLPVCLALTFSLTAVVSSHWCEGTRRVVKPLCQDQPGGQHCIHFKRDNS SNGRMDNNSQAVLYIWELGDDKFIQRGFHVGLWQSCEESLNGEDEKCRSFRSVVPAEEQG VLWLSIGGEVLDIVLILTSAILLGSRVSCRSPGFHWLRVDALVAIFMVLAGLLGMVAHMM YTTIFQITVNLGPEDWKPQTWDYGWSYCLAWGSFALCLAVSVSAMSRFTAARLEFTEKQQ AQNGSRHSQHSFLEPEASESIWKTGAAPCPAEQAFRNVSGHLPPGAPGKVSIC >gi568815581f:9726064_9989702|GENSCAN_predicted_CDS_4|882_bp atggacagggccaagcagcagcaggcgctgctcctcctccctgtctgcctcgccctcacc ttctccctcaccgccgtggtcagcagccactggtgtgaggggacccgacgggtggtgaag ccactgtgccaggaccagccgggagggcagcactgcattcacttcaaacgggacaacagc agcaatggcaggatggacaacaatagccaggctgtcctgtacatttgggagctgggtgat gacaagttcattcagcgggggttccatgtggggctctggcagtcctgcgaggagagcctc aacggtgaagatgaaaagtgtaggagtttccggagtgtagtgccagctgaagaacaaggt gttttgtggctgtccatcgggggcgaggtcctggatatcgttctgatactgacaagcgcc atcctcctgggctccagagtgagttgtcgcagccctgggttccactggctcagggtggat gccttggtagccatcttcatggtgctggcagggcttctaggcatggtggcccacatgatg tacacaaccatttttcaaatcactgtgaaccttggaccagaagattggaagcctcagacc tgggactatggctggtcatattgccttgcctggggttctttcgccctctgcctggctgtg tcggtctcggccatgagcaggttcacggcagcccgcctggaattcaccgagaagcagcag gcacagaacggcagtcggcactctcaacacagcttcctggaacccgaggcttcggagagc atttggaaaacaggagctgctccttgccctgctgaacaagccttcaggaatgtttctgga cacctcccaccaggcgccccaggcaaggtgtccatatgctag >gi568815581f:9726064_9989702|GENSCAN_predicted_peptide_5|601_aa MAVFLLRSMGRVTGSLLEETTRKWAQYKQACLRDLLKEPSESSGRAYRHCLAQGTWQTIE NATDIWQDDSECSENHSFKQNNPSAPYFLVGLLLFSLRTSCLSNICSLSFKVQCKRLHKA EEQVFGDQTDIYGHTLVSFIPMVDRYALLSTLQLMYTVGYSFSLISLFLALTLLLFLRKL HCTRNYIHMNLFASFILRTLAVLVKDVVFYNSYSKRPDNENGWMSYLSEMSTSCRSVQVL LHYFVGANYLWLLVEGLYLHTLLEPTVLPERRLWPRYLLLGWDPVQASAPALANTISSIS LKPIFCHCKPHVLIILAASPSIWWVLDKHIQCTLMAYILDPERFLLTRLSVQMTFGTLEI QMAFGTVEIQMAFRTVEITAKHLLPGQLTAELPLVYIELMELMAEPSLTLKFTAMWNSSL TYPILLFFSRLAKSTLVLIPLLGVHEILFSFITDDQVEGFAKLIRLFIQLTLSSFHGFLV ALQYGFANGEVKAELRKYWVRFLLARHSGCRACVLGKDFRFLGKCPKKLSEGDGAEKLRK LQPSLNSGRLLHLAMRGLGELGAQPQQDHARWPRGSSLSECSEGDVTMANTMEEILEESE I >gi568815581f:9726064_9989702|GENSCAN_predicted_CDS_5|1806_bp atggctgtcttcctccttcgttccatgggcagggttacaggatccctccttgaggaaacg actcggaagtgggctcagtacaaacaggcatgtctgagagacttactcaaggaaccttct gagagctcaggaagggcctacagacactgcttggctcaggggacttggcagacgatagag aacgccacggatatttggcaggatgactccgaatgctccgagaaccacagcttcaagcaa aacaatccttctgcaccatactttctcgttggactcttgctcttctccctgaggacctcc tgtttgtccaacatctgctcactttccttcaaagtgcaatgcaagaggctgcacaaagcg gaggaacaggtgtttggggatcagacagatatctacggccacaccttggtttctttcatt ccgatggtggatcgttatgccttgctgtcaaccttgcagctgatgtacaccgtgggatac tccttctctcttatctccctcttcctggctctcaccctcctcttgtttcttcgaaaactc cactgcacgcgcaactacatccacatgaacttgtttgcttctttcatcctgagaaccctg gctgtactggtgaaggacgtcgtcttctacaactcttactccaagaggcctgacaatgag aatgggtggatgtcctacctgtcagagatgtccacctcctgccgctcagtccaggttctc ttgcattactttgtgggtgccaattacttatggctgctggttgaaggcctctacctccac acgctgctggagcccacagtgcttcctgagaggcggctgtggcccagatacctgctgttg ggttgggatcctgtgcaggcttctgctccagccctagccaacaccatttcaagtatctct ttaaaacccattttctgccactgtaagccccatgttctgatcatcctggcagccagccca agcatatggtgggttcttgataaacatattcagtgcacactcatggcctacattctggac cctgaaaggttcttgctaaccagactttcagttcagatgacctttgggactttggagatt cagatggcctttgggacagtggagattcagatggcctttaggacagtggagatcacagct aagcatttactacctggccagctaactgctgaattgcccctggtttacattgaactgatg gaactgatggccgagcccagcctgaccttgaagttcacagctatgtggaattcgtcactt acttacccaattctgctctttttttcaagattggcaaaatcaacactggtcctcattcct ttattgggcgttcatgagatcctcttctctttcatcactgatgatcaagttgaaggattt gcaaaacttatacgacttttcattcagttgacactgagctcctttcatgggttcctggtg gccttgcagtatggttttgccaatggagaggtgaaggctgagctgcggaaatactgggtc cgcttcttgctagcccgccactcaggctgcagagcctgtgtcctggggaaggacttccgg ttcctaggaaaatgtcccaagaagctctcggaaggagatggcgctgagaagcttcggaag ctgcagccctcacttaacagtgggcggctcctacatctagccatgcgaggtcttggggag ctgggcgcccagccccaacaggaccatgcacgctggccccggggcagcagcctgtccgag tgcagtgagggggatgtcaccatggccaacaccatggaggagattctggaagagagtgag atctag >gi568815581f:9726064_9989702|GENSCAN_predicted_peptide_6|209_aa MGNSKSGALSKEILEELQLNTKFSEEELCSWYQSFLKDCPTGRITQQQFQSIYAKFFPDT DPKAYAQHVFRSFDSNLDGTLDFKEYVIALHMTTAGKTNQKLEWAFSLYDVDGNGTISKN EVLEIVMAIFKMITPEDVKLLPDDENTPEKRAEKIWKYFGKNDDGSSKGHGKGDKLTEKE FIEGTLANKEILRLIQFEPQKVKEKMKNA >gi568815581f:9726064_9989702|GENSCAN_predicted_CDS_6|630_bp atggggaacagcaaaagtggggccctgtccaaggagatcctggaggagctgcagctgaac accaagttctcggaggaggagctgtgctcctggtaccagtccttcctgaaggactgtccc accggccgcatcacccagcagcagttccagagcatctacgccaagttcttccccgacacc gaccccaaggcctacgcccagcatgtgttccgcagcttcgattccaacctcgacggcacc ctggacttcaaggagtacgtcatcgccctgcacatgaccaccgcgggcaagaccaaccag aagctggagtgggccttctccctctacgacgtggacggtaacgggaccatcagcaagaat gaagtgctggagatcgtcatggctattttcaaaatgatcactcccgaggacgtgaagctc cttccagacgatgaaaacacgccggaaaagcgagccgagaagatctggaagtactttgga aagaatgatgatgggagctccaagggtcatgggaaaggagataaacttacagagaaagaa ttcattgaggggacactggccaataaggaaattctgcgactgatccagtttgagcctcaa aaagtgaaggaaaagatgaagaacgcctga >gi568815581f:9726064_9989702|GENSCAN_predicted_peptide_7|395_aa MAEDRGKSYAGLDSNFHSSVYYSLVTLDSFTSASPNFMICKMELMPFNLLGLEPEWGEKM SNMENSFDDVSCLSPQNLGSSSPSKKQSKENTITINCVTFPHPDTMPEQQLLKPTEWSYC DYFWADKKDPQGNGTVAGFELLLQKQLKGKQMQKEMSEFIRERIKIEEDYAKNLAKLSQN SLASQEEGSLGEAWAQVKKSLADEAEVHLKFSAKLHSEVEKPLMNFRENFKKDMKKCDHH IADLRKQLASRYASVEKARKALTERQRDLEMKTQQLEIKLSNKTEEDIKKARRKSTQAGD DLMRCVDLYNQAQSKWFEEMVTTTLELERLEVERVEMIRQHLCQYTQLRHETDMFNQSTV EPVDQLLRKVDPAKDRELWVREHKTGNIRPVDMEI >gi568815581f:9726064_9989702|GENSCAN_predicted_CDS_7|1188_bp atggcggaagatagaggaaagagttatgcaggcttggattctaatttccactcctctgtt tactatagccttgtgaccctggacagttttacttctgcaagccccaatttcatgatctgc aaaatggagttgatgccatttaacttgctggggttggaacctgaatggggggaaaaaatg tccaatatggagaacagctttgacgatgtttcttgcctctctccccagaacctgggatcc tcatcgccaagcaaaaagcagagcaaggaaaacaccatcacaataaactgtgtgacgttc cctcacccagacacgatgccggaacagcagctgctgaaaccaaccgagtggagctactgc gactacttctgggctgataagaaggacccccaaggcaacggcaccgtggctgggtttgaa ctactgctccagaaacagctgaagggcaaacaaatgcagaaggaaatgtcagaattcatc cgggaaaggataaagattgaagaagactatgcgaagaacttagctaagctctctcagaac tccttggcttcacaggaggaaggctccttgggagaggcgtgggcccaggtgaagaagagc ctggcggacgaagcagaagttcacctcaagttctctgccaagcttcacagcgaggtggag aagcccctgatgaacttccgtgagaacttcaagaaagacatgaagaagtgcgaccaccac attgccgaccttcgcaagcagctcgccagccgctatgcctcggtggagaaggcccggaaa gccctcacagagcggcagagagacctggagatgaagacccagcagctggagatcaagctg agcaacaagacagaggaggacatcaagaaggcgcggagaaagtccacacaggctggagac gacctcatgcgctgtgtggatctctacaaccaggcccagtccaaatggtttgaagagatg gtgaccaccacattggagctagagcggctggaggtggagagggtagagatgatccggcag cacctgtgccagtacacgcagctgcggcatgaaacagacatgttcaaccaaagcacagtc gagcccgtggatcagctgcttcgaaaagtggacccggccaaagacagggagctgtgggtc agagagcacaagacgggcaacatccgccctgtggacatggagatctag >gi568815581f:9726064_9989702|GENSCAN_predicted_peptide_8|145_aa MAVLEGIEKPEMEETLGAAESRHKPRKVDEPQQEATGGQRLVGQVPYLRGRTSRGFDPQG DKVSRCIKSLSLSSAALCAGPLKGAEKGGEMGGEVNGYHASGTPAHPPETAHMSVRKSTG DSQVLVCDVPPPVSIWSHCSSPTYE >gi568815581f:9726064_9989702|GENSCAN_predicted_CDS_8|438_bp atggctgtactggaaggcattgaaaaaccagagatggaggaaactctgggggcagcagag agccggcacaagccaagaaaagtggacgagccccagcaggaggccactgggggccagagg cttgtaggccaggtgccgtatttgagagggcgaaccagccgtggcttcgaccctcaagga gataaggtgtcaaggtgcatcaagagcctctctctgtccagcgctgcactgtgtgctggg cccctaaagggggctgagaaaggtggggagatgggtggagaagtgaatggataccacgca tcagggaccccagcgcaccctccagagactgcccacatgagtgtccgaaaatccaccggt gattcccaggtcctggtgtgtgatgttcccccaccggtgtccatatggtcccattgttca tctcccacttatgagtga