GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:44:56 Sequence gi568815576r:43726004_43962382 : 236379 bp : 48.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1741 1736 6 1.05 1.06 Term - 2355 2121 235 2 1 47 54 145 0.167 2.79 1.05 Intr - 9990 9854 137 0 2 74 80 121 0.968 9.27 1.04 Intr - 15028 14855 174 1 0 53 84 62 0.600 2.44 1.03 Intr - 19014 18870 145 2 1 129 45 71 0.165 7.18 1.02 Intr - 47100 46884 217 1 1 65 34 197 0.008 9.56 1.01 Init - 47675 47627 49 2 1 86 58 63 0.981 2.21 1.00 Prom - 52035 51996 40 -5.76 2.00 Prom + 55241 55280 40 -4.26 2.01 Init + 76553 76770 218 1 2 89 11 220 0.030 12.56 2.02 Intr + 84191 84248 58 2 1 70 98 1 0.061 -1.81 2.03 Intr + 84590 84707 118 1 1 99 115 -15 0.027 2.34 2.04 Intr + 97693 97753 61 2 1 96 105 65 0.095 6.79 2.05 Intr + 98821 98903 83 1 2 91 72 0 0.847 -2.02 2.06 Term + 98955 99118 164 1 2 68 47 159 0.811 7.90 2.07 PlyA + 99440 99445 6 1.05 3.12 PlyA - 99632 99627 6 1.05 3.11 Term - 100110 99998 113 1 2 89 38 130 0.921 6.82 3.10 Intr - 103195 103057 139 1 1 116 71 223 0.998 23.44 3.09 Intr - 107731 107637 95 2 2 112 100 146 0.798 17.88 3.08 Intr - 112834 112673 162 2 0 -3 70 131 0.598 2.15 3.07 Intr - 112990 112864 127 1 1 115 75 147 0.998 16.35 3.06 Intr - 114022 113942 81 1 0 124 121 71 0.999 13.83 3.05 Intr - 115929 115799 131 1 2 81 75 261 0.836 24.51 3.04 Intr - 118864 118778 87 2 0 85 58 38 0.037 0.44 3.03 Intr - 119204 119115 90 0 0 55 55 69 0.012 0.27 3.02 Intr - 123966 123952 15 1 0 114 99 24 0.008 1.62 3.01 Init - 136379 136211 169 2 1 107 105 313 0.778 34.60 3.00 Prom - 147559 147520 40 -4.26 4.10 PlyA - 149616 149611 6 1.05 4.09 Term - 154882 154792 91 0 1 112 55 42 0.701 0.49 4.08 Intr - 155671 155555 117 0 0 89 94 113 0.879 11.58 4.07 Intr - 158342 158210 133 0 1 35 116 189 0.302 16.10 4.06 Intr - 160485 160300 186 1 0 104 58 185 0.999 16.76 4.05 Intr - 161648 161588 61 2 1 117 66 120 0.996 11.01 4.04 Intr - 163535 163326 210 2 0 54 92 240 0.999 20.11 4.03 Intr - 163861 163796 66 1 0 112 105 27 0.982 5.90 4.02 Intr - 165291 165059 233 1 2 88 64 394 0.997 34.49 4.01 Init - 165877 165685 193 1 1 104 109 363 0.993 39.13 4.00 Prom - 177497 177458 40 -5.46 5.03 PlyA - 178097 178092 6 1.05 5.02 Term - 187349 187068 282 2 0 17 40 706 0.698 54.13 5.01 Init - 188848 188774 75 1 0 60 76 -16 0.271 -4.51 5.00 Prom - 189403 189364 40 -12.21 6.00 Prom + 189444 189483 40 -5.76 6.01 Init + 189777 189814 38 2 2 24 59 18 0.598 -8.02 6.02 Intr + 189979 190284 306 1 0 121 37 353 0.049 29.06 6.03 Intr + 192544 192565 22 1 1 91 93 6 0.013 -1.05 6.04 Intr + 196078 196147 70 0 1 71 49 40 0.034 -2.85 6.05 Intr + 196290 196448 159 1 0 103 48 35 0.044 0.96 6.06 Intr + 197874 198095 222 1 0 42 65 251 0.107 16.30 6.07 Intr + 200932 201164 233 2 2 97 69 202 0.963 16.69 6.08 Intr + 202821 202886 66 2 0 121 103 -2 0.876 3.70 6.09 Intr + 206875 207084 210 0 0 65 113 271 0.965 26.41 6.10 Intr + 208603 208663 61 0 1 110 75 25 0.598 1.71 6.11 Intr + 211048 211269 222 2 0 97 80 121 0.598 10.30 6.12 Intr + 217609 217622 14 2 2 125 71 2 0.122 -3.30 6.13 Intr + 218688 218792 105 2 0 85 99 67 0.377 7.91 6.14 Intr + 222130 222315 186 0 0 60 42 78 0.326 0.29 6.15 Term + 225653 225724 72 1 0 69 48 122 0.607 4.21 6.16 PlyA + 226779 226784 6 1.05 7.02 PlyA - 226949 226944 6 1.05 7.01 Sngl - 229576 229298 279 1 0 70 44 241 0.446 11.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 54712 54821 110 1 2 114 82 32 0.839 5.13 S.002 Sngl + 76553 76774 222 1 0 89 45 237 0.963 14.65 S.003 Init - 95937 95880 58 0 1 92 119 35 0.892 8.48 S.004 Init + 97713 97753 41 2 2 75 105 56 0.806 5.66 S.005 Init + 119446 119522 77 0 2 59 65 148 0.892 8.16 S.006 Intr + 125406 125559 154 0 1 100 77 108 0.908 10.97 S.007 Term + 129071 129234 164 0 2 81 47 140 0.950 7.30 S.008 Term + 189979 190288 310 1 1 121 54 349 0.944 29.33 S.009 Init + 197909 198095 187 1 1 93 65 241 0.805 21.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:43726004_43962382|GENSCAN_predicted_peptide_1|318_aa MGFRHVGQAGLELLTSATAVANPTLSSLDVKRILFQKITDRGDELQKAFQLLDTGQNLTV SKSELRRIITDFLMPLTREQFQDVLAQVLNIFSRSGGSLVTFRSKQVQCRRVQFKPQVWG WFSTPFSAPPSPRALEWPLPNFSLTNDKSLGDVALVGVSSLPGCCFPVLTLEIHMGCSGV LKKRIVLMSSILPKEVFKNIKTVMKAFELIDVNKTGLVRPQELRRVLETFCMKLRDEEYE NTICKSKDLELTQMPINDRLDKENVADIHQEYYAAIKKNEFMSFAGTWMKLETVILSKLT QEQKTKHRMFSLIVRVEQ >gi568815576r:43726004_43962382|GENSCAN_predicted_CDS_1|957_bp atggggtttcgccatgttggccaggctggtctcgaactcctgacctcagccacagctgtt gcaaatccaacactgtcctccttagatgttaaacggattttatttcaaaaaattaccgac agaggggatgagttgcaaaaagcctttcagctgctggatactggtcagaacttgactgtg tcaaaaagtgaactgagaagaatcatcacagacttcctgatgccgctcacacgagaacag tttcaggacgtgttggctcaggtgctgaatatcttcagcagatctggtggctctctggtc acatttaggtctaaacaggtccagtgcaggcgagttcagtttaaacctcaagtgtggggc tggttcagcacgcccttctcagctcctccaagcccgagggccctggagtggcccttgcca aacttcagtctgacaaatgataagagccttggagacgtcgctctggtgggtgtgagctcg ctgccaggttgctgcttcccggtgctcacgctagagatccatatgggttgctctggggtt ttaaaaaagcgcatcgtgctcatgtcctccattttaccaaaggaggttttcaaaaacatt aagactgttatgaaagcctttgagctcattgatgttaacaagactggactggttcgaccg caggagctaagaagggttctggagaccttctgtatgaagttaagagacgaggaatacgaa aacactatttgtaagagcaaagacttggaactaacccaaatgcccataaatgatagactg gataaagaaaatgtggcagatatacaccaggagtactatgcagccataaaaaagaatgaa ttcatgtcatttgctggtacatggatgaagttggaaaccgttattctcagcaaactaaca caggaacagaaaaccaaacaccgcatgttctcactcattgtgagagttgaacaatga >gi568815576r:43726004_43962382|GENSCAN_predicted_peptide_2|233_aa MKDKSQRSASLSAKPAPPKPEPKPKRAPTKKREKVHKGKKGKADIGKEENNPAQHGDAKT DRAQKAECVGDANQLVRTKFKKYEGPNCAWHTVPVSCRDQSMGAIEISVQGEEVDSWNRA VFPWRAKFTVSVPGTVPGMEQVQSNDRLHEVSHINDRQPRSPRGWLATGPRLSIASANVD TRRDEVDRHAGGRANLIRVKQGYTRKMVLPRLSASGAQNKLNDKDPGILEIYY >gi568815576r:43726004_43962382|GENSCAN_predicted_CDS_2|702_bp atgaaggacaaatcacagagatctgcaagcttgtctgctaaacctgctcctccaaagcca gagcccaagcctaaaagggcccctacaaagaagagagagaaggtacacaaagggaaaaag ggaaaagctgatattggcaaggaggagaataaccctgcacaacatggagatgccaaaaca gaccgggcacagaaagctgaatgtgttggagatgccaaccaacttgtgaggactaaattt aaaaagtatgagggaccaaactgtgcttggcacacagtgccagtcagctgcagagatcag agcatgggtgctatagagatctctgtacaaggggaagaggtagatagctggaacagggca gtttttccctggagagcgaaatttacagtctcagtgccgggaacagtgcctggcatggag caggtgcagtcgaatgaccgactccatgaagtgagccacataaatgacagacaaccccgg tccccacggggttggctggctactggcccccgactctccatcgcttcagcgaatgtggat acacggagggatgaggtggacagacacgcaggcggccgtgccaacctgatccgagtgaaa cagggctacactcgcaaaatggtcctcccacggctgagcgcctcaggtgcacagaacaaa ctgaatgataaagacccggggattttagaaatttactattaa >gi568815576r:43726004_43962382|GENSCAN_predicted_peptide_3|402_aa MAESEAETPSTPGEFESKYFEFHGVRLPPFCRGKMEEIANFPVRPSDVWIVTYPKSGREH AGRGDEEPVAETGAADMKGPQWVKLCPFLLGGVEPVTVPAPLGRSREERACPVHPRVCTA GTSLLQEVVYLVSQGADPDEIGLMNIDEQLPVLEYPQPGLDIIKELTSPRLIKSHLPYRF LPSDLHNGDSKVIYMARNPKDLVVSYYQFHRSLRTMSYRGTFQEFCRRFMNDKQPEPSVV VDGLAGAGCFPDTVSSSDSDHVGQARGHFTDEDTEAEGRERGRDPRAVGYGSWFEHVQEF WEHRMDSNVLFLKYEDMHRDLVTMVEQLARFLGVSCDKAQLEALTEHCHQLVDQCCNAEA LPVGRGRVGLWKDIFTVSMNEKFDLVYKQKMGKCDLTFDFYL >gi568815576r:43726004_43962382|GENSCAN_predicted_CDS_3|1209_bp atggcggagagcgaggccgagacccccagcaccccgggggagttcgagagcaagtacttc gagttccatggcgtgcggctgccgcccttctgccgcgggaagatggaggagatcgccaac ttcccggtgcggcccagcgacgtgtggatcgtcacctaccccaagtccgggagggagcac gcagggcggggagatgaggagccagtggcagaaactggagccgccgatatgaagggacca cagtgggttaagctgtgcccgtttcttctgggaggtgtggagccggtcacagtgcctgct cccctgggccgtagcagggaggagagggcctgtcccgtgcatcccagggtgtgcactgca ggcaccagcttgctgcaggaggtggtctacttggtgagccagggcgctgaccccgatgag atcggcttgatgaacatcgacgagcagctcccggtcctggagtacccacagccgggcctg gacatcatcaaggaactgacctctccccgcctcatcaagagccacctgccctaccgcttt ctgccctctgacctccacaatggagactccaaggtcatctatatggctcgcaaccccaag gatctggtggtgtcttattatcagttccaccgctctctgcggaccatgagctaccgaggc acctttcaagaattctgccggaggtttatgaatgataagcaaccggagccgtctgttgtc gtggacggcctggcaggtgctgggtgcttccccgacacggtctccagttctgacagtgac catgtaggtcaggcccgcggtcactttacagatgaagacactgaggccgagggtcgtgaa cgtggccgagatcctagagctgtgggctacggctcctggtttgagcacgtgcaggagttc tgggagcaccgcatggactcgaacgtgctttttctcaagtatgaagacatgcatcgggac ctggtgacgatggtggagcagctggccagattcctgggggtgtcctgtgacaaggcccag ctggaagccctgacggagcactgccaccagctggtggaccagtgctgcaacgctgaggcc ctgcccgtgggccggggaagagttgggctgtggaaggacatcttcaccgtctccatgaat gagaagtttgacttggtgtataaacagaagatgggaaagtgtgacctcacgtttgacttt tatttataa >gi568815576r:43726004_43962382|GENSCAN_predicted_peptide_4|429_aa MGFLEEEGRWNLSFSGAGYLGAHHVGATECLRQRAPRLLQGARRIYGSSSGALNAVSIVC GKSVDFCCSHLLGMVGQLERLSLSILHPAYAPIEHVKQQLQDALPPDAHVLASQRLGISL TRWPDGRNFLVTDFATCDELIQALVCTLYFPFYCGLIPPEFRGERYIDGALSNNLPFADC PSTITVSPFHGTVDICPQSTSPNLHELNVFNFSFQISTENFFLGLICLIPPSLEVVADNC RQGYLDALRFLERRGLTKEPVLWTLVSKEPPAPADGNWDAGCDQRWKGGLSLNWKVPHVQ VKDVPNFEQLSPELEAALKKACTRDPSRWARFWHSGPGQVLTYLLLPCTLPFEYIYFRSR RLVVWLPDVPADLWWMQGLLRNMALEVFSRTKAQLLGPISPPATRVLETSPLQPQIAPHR EELGPTHQA >gi568815576r:43726004_43962382|GENSCAN_predicted_CDS_4|1290_bp atgggcttcttagaggaggagggcagatggaacctgtccttctccggcgccggctacctg ggcgcccaccacgtgggcgccaccgaatgcctgcgccagcgagccccgcgcctcctccag ggcgcccgccgcatctacggttcctcgtctggggcgctcaacgcagtcagcatcgtctgc ggcaagtcggtcgacttctgctgctcccacctcctgggcatggttgggcagttggagcgg ctgagcctaagcatcctgcacccggcctacgcgcccatcgagcacgtcaagcagcagctg caggatgctctgccccccgacgcccacgtcctggcctcccagcggctgggcatttcgctg acccgctggcctgacggacgcaacttcttggtcactgacttcgccacctgcgatgagctc atccaggccttggtctgcaccttatactttcctttctactgcgggctgatcccccccgag ttcagaggggagcgctacatcgatggggctctgagcaacaacttgccctttgcagactgc ccctccaccatcacggtgtcgcccttccatgggacagtggacatctgcccccagagcacc tcccccaacctgcatgagctgaacgtcttcaacttcagcttccaaatctccactgagaac ttcttcctggggctcatatgtctcataccccccagcctcgaggtagtggccgacaactgc agacaaggctacctggatgccctgaggttcctggagagacgtggactcaccaaggaacca gtgctatggacgctggtgtctaaggaacccccagccccggctgacggaaactgggatgct ggctgtgaccaacgctggaaggggggcctgtctctcaactggaaagtgccccatgtgcaa gtcaaggatgtacccaactttgagcagctctcaccagagctggaggctgcactgaagaaa gcatgtacgagggatcccagccggtgggcccgcttctggcactcggggcctggacaggtg ctgacgtacctgctgctaccctgcacactgcccttcgagtacatctacttccgcagcaga aggttggtggtgtggctgcccgatgtgccggcggacttgtggtggatgcagggcctgctg aggaacatggccctcgaggttttctccaggaccaaggcccagctccttgggcccatcagc cctccggccactcgcgtcctggaaacaagccccctccaaccccagatagctcctcataga gaggagctcgggcccacccaccaggcctga >gi568815576r:43726004_43962382|GENSCAN_predicted_peptide_5|118_aa MVLGKNSGNDSLNISCACAPLGKSYPGQQSKTPSEEEERRKKEKEEERRRRKKKKEEEEG RRRKKKEEEEEEEEEEEEEEEEEEEEEGRRRKKKKRKRKKKKKKKKKKKKKKKKKKKK >gi568815576r:43726004_43962382|GENSCAN_predicted_CDS_5|357_bp atggttttaggaaagaactcaggaaacgactcccttaacattagctgtgcctgtgcaccc cttggaaaatcctaccctgggcaacagagcaagactccatcagaagaagaagaaagaaga aagaaggagaaagaagaagaaagaagaagaagaaagaagaagaaagaagaagaagaagga agaagaaggaagaagaaggaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaaggaagaagaaggaagaagaagaagaggaagaggaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagtag >gi568815576r:43726004_43962382|GENSCAN_predicted_peptide_6|661_aa MWKLNPQCNGIGSVRTQQVGPHQTPDAGALIRVRTQQEGPHQTPDAGALIRVRAQQEGPH QTPDSGALIHVRAQQEGPHQTPDAGALIRVRAQQEGPHQTPDASALILDFPASRLTRTVG GKARQKQGLQQNGEAQLNMNHPTNPGLTEKGEGFLVHSSWGRYGPADQGPPVRSSGPRSN LAPCTAVGLLHPVPGVGQDPNPRPRPAAAAMYDAERGWSLSFAGCGFLGFYHVGATRCLS EHAPHLLRDARMLFGASAGALHCVGVLSGIPLEQTLQVLSDLVRKARSRNIGIFHPSFNL SKFLRQGLCKCLPANVHQLISGKIGISLTRVSDGENVLVSDFRSKDEVVDALVCSCFIPF YSGLIPPSFRGVRYVDGGVSDNVPFIDAKTTITVSPFYGEYDICPKVKSTNFLHVDITKL SLRLCTGNLYLLSRAFVPPDLKVLGEICLRGYLDAFRFLEEKGICNRPQPGLKSSSEGMD PEVAMPSWANMSLDSSPESAALAVRLEGDELLDHLRLSILPWDESILDTLSPRLATDLSL LTGDMASRYARRCPVVAVGDLTGVHSSADVSAPRLQTTVRLAMGVFPDVGVTARPRSPLM ALSRHNRRLALLSSGHFSLRPLSSYLCMAWFFLGYDCRTLRHPISILDTELVTKAQLQLL C >gi568815576r:43726004_43962382|GENSCAN_predicted_CDS_6|1986_bp atgtggaaactcaatccccaatgcaatggtattgggagtgtgaggacacagcaagtgggc cctcaccagacaccagatgctggtgccttgatccgtgtgaggacacagcaagagggccct caccagacaccagatgctggcgccttgatccgtgtgagggcacagcaagagggccctcac cagacaccagattctggtgccttgatccatgtgagggcgcagcaagagggccctcaccag acaccagatgctggcgccttgatccgtgtgagggcgcagcaagaaggccctcaccagaca ccagatgctagtgccttgatcctggactttccagcatccagactaaccaggacagttgga gggaaggcaaggcaaaagcaaggactgcagcagaatggggaagctcagttgaacatgaac cacccaaccaatccaggtctcacggagaagggggagggcttcctggtccactccagctgg gggcgctatggcccggcagaccagggtcctccagtgaggagctctggaccaaggtctaac ttggctccatgtactgctgtgggcttgctccacccagtgccaggggtaggtcaagatcct aacccgcgcccccgccccgccgccgccgccatgtacgacgcagagcgcggctggagcttg tccttcgcgggctgcggcttcctgggcttctaccacgtcggggcgacccgctgcctgagc gagcacgccccgcacctcctccgcgacgcgcgcatgttgttcggcgcttcggccggggcg ttgcactgcgtcggcgtcctctccggtatcccgctggagcagactctgcaggtcctctca gatcttgtgcggaaggccaggagtcggaacattggcatcttccatccatccttcaactta agcaagttcctccgacagggtctctgcaaatgcctcccggccaatgtccaccagctcatc tccggcaaaataggcatctctcttaccagagtgtctgatggggaaaacgttctggtgtct gactttcggtccaaagacgaagtcgtggatgccttggtatgttcctgcttcatccccttc tacagtggccttatccctccttccttcagaggcgtgcgatatgtggatggaggagtgagt gacaacgtacccttcattgatgccaaaacaaccatcaccgtgtcccccttctatggggag tacgacatctgccctaaagtcaagtccacgaactttcttcatgtggacatcaccaagctc agtctacgcctctgcacagggaacctctaccttctctcgagagcttttgtccccccggat ctcaaggtgctgggagagatatgccttcgaggatatttggatgcattcaggttcttggaa gagaagggcatctgcaacaggccccagccaggcctgaagtcatcctcagaagggatggat cctgaggtcgccatgcccagctgggcaaacatgagtctggattcttccccggagtcggct gccttggctgtgaggctggagggagatgagctgctagaccacctgcgtctcagcatcctg ccctgggatgagagcatcctggacaccctctcgcccaggctcgctacagatttatcactt ctcactggtgacatggcttccagatatgcccgacgatgtcctgtggttgcagtgggtgac ctcacaggtgttcactcgagtgctgatgtgtctgctccccgcctccagaccacggtccgc ctggcaatgggcgtcttcccagatgttggtgtcaccgctagaccaaggagccctctgatg gccctgtccaggcataacagaaggctcgcactcctgtcttctggtcacttctcgctacgt cccctcagctcctatctctgtatggcctggtttttcctaggttatgattgtagaactctc aggcaccccatctctatcctggacaccgagcttgtcaccaaggcccagctgcagctgctg tgctga >gi568815576r:43726004_43962382|GENSCAN_predicted_peptide_7|92_aa MVPSPAAASQSCPGGTERLLLRAEGGSAVAELQVKATPTNPRTRNTGGGHDSPPSPRPLA RDAFAHWSLCACDPDVHTAAEDVLTAQAKSVH >gi568815576r:43726004_43962382|GENSCAN_predicted_CDS_7|279_bp atggttccctctccagccgctgcctcgcagagctgcccgggcgggacagagcgtctgctg ctgagggcagaaggcgggtccgcggtggcggagctgcaggtcaaggcaactcccacaaac ccccggacgcggaacactgggggcggccatgattccccgccttctccgcgccccctggct cgcgatgccttcgctcattggtccttatgcgcctgtgacccggatgttcacacggccgcg gaggacgttctgactgcgcaagcgaagagcgttcactga