GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:00:32 Sequence gi568815578f:32725928_32948725 : 222798 bp : 47.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 2068 1966 103 1 1 106 101 34 0.147 6.25 1.02 Intr - 17619 17381 239 1 2 22 80 243 0.320 14.23 1.01 Init - 24550 24403 148 1 1 97 73 111 0.784 10.75 1.00 Prom - 31154 31115 40 -3.66 2.00 Prom + 31825 31864 40 -9.85 2.01 Init + 35360 35436 77 1 2 81 78 3 0.459 -0.83 2.02 Intr + 36502 36772 271 1 1 57 99 220 0.196 17.54 2.03 Intr + 54391 54538 148 0 1 123 24 238 0.792 20.71 2.04 Intr + 55426 55487 62 2 2 96 80 22 0.761 0.65 2.05 Intr + 58831 58932 102 0 0 52 87 108 0.976 7.47 2.06 Intr + 60575 60700 126 1 0 122 7 87 0.938 4.88 2.07 Intr + 61303 61524 222 0 0 104 80 138 0.949 12.92 2.08 Intr + 62927 63085 159 1 0 84 105 134 0.999 14.88 2.09 Intr + 65674 65781 108 0 0 100 77 121 0.999 12.68 2.10 Intr + 66684 66843 160 2 1 89 75 137 0.661 12.06 2.11 Intr + 69482 69607 126 0 0 45 45 183 0.998 10.35 2.12 Intr + 69723 69767 45 1 0 122 92 46 0.990 6.78 2.13 Intr + 70863 70942 80 1 2 102 114 -24 0.973 0.87 2.14 Intr + 71260 71372 113 0 2 64 105 132 0.880 11.68 2.15 Intr + 72533 72716 184 2 1 52 66 208 0.998 14.79 2.16 Intr + 73317 73401 85 2 1 119 109 110 0.957 15.59 2.17 Intr + 74226 74371 146 1 2 110 54 195 0.965 18.30 2.18 Intr + 74908 74998 91 0 1 127 89 18 0.671 5.37 2.19 Intr + 75220 75293 74 2 2 112 98 36 0.926 6.13 2.20 Intr + 75410 75499 90 1 0 31 78 139 0.456 7.39 2.21 Intr + 76458 76543 86 2 2 120 92 82 0.965 10.42 2.22 Intr + 79411 79480 70 1 1 41 116 80 0.974 5.28 2.23 Intr + 80282 80400 119 1 2 90 105 79 0.975 9.06 2.24 Term + 81835 81976 142 1 1 89 32 184 0.993 10.30 2.25 PlyA + 85782 85787 6 1.05 3.00 Prom + 86301 86340 40 -3.26 3.01 Init + 99547 99558 12 0 0 75 100 20 0.476 1.44 3.02 Intr + 99998 100121 124 1 1 66 110 93 0.787 9.46 3.03 Intr + 107790 107935 146 1 2 116 106 106 0.999 15.20 3.04 Intr + 110707 110914 208 0 1 99 98 131 0.653 13.85 3.05 Intr + 113808 113952 145 1 1 97 94 134 0.948 14.24 3.06 Intr + 114042 114096 55 0 1 82 58 67 0.890 2.08 3.07 Intr + 120691 120843 153 0 0 80 42 142 0.944 9.07 3.08 Term + 122745 122801 57 2 0 105 42 94 0.942 4.19 3.09 PlyA + 124454 124459 6 1.05 4.00 Prom + 126795 126834 40 -4.96 4.01 Init + 132513 132569 57 2 0 98 78 9 0.480 2.21 4.02 Intr + 137611 137907 297 0 0 53 116 47 0.380 1.07 4.03 Intr + 141655 141820 166 0 1 61 94 100 0.541 7.43 4.04 Intr + 149999 150117 119 0 2 51 68 265 0.243 20.98 4.05 Intr + 152777 152880 104 1 2 29 67 136 0.703 4.57 4.06 Intr + 159578 159713 136 2 1 113 96 140 0.999 17.87 4.07 Intr + 163374 163479 106 2 1 76 95 85 0.979 7.79 4.08 Intr + 166286 166370 85 0 1 101 105 136 0.987 15.48 4.09 Intr + 167247 167371 125 0 2 123 22 128 0.893 9.93 4.10 Intr + 170527 170600 74 2 2 85 113 59 0.994 7.23 4.11 Intr + 172566 172696 131 2 2 77 110 135 0.984 14.09 4.12 Intr + 180635 180702 68 2 2 81 106 6 0.596 0.25 4.13 Intr + 180916 181067 152 2 2 76 68 241 0.680 20.78 4.14 Intr + 182348 182485 138 1 0 97 70 206 0.986 20.36 4.15 Intr + 183894 184004 111 2 0 113 109 16 0.946 6.78 4.16 Intr + 185553 185780 228 2 0 104 78 251 0.993 23.77 4.17 Intr + 186867 186937 71 2 2 76 76 -20 0.235 -6.42 4.18 Intr + 191374 191578 205 1 1 72 75 332 0.828 29.50 4.19 Intr + 192435 192647 213 2 0 77 75 178 0.967 14.31 4.20 Intr + 194151 194288 138 2 0 104 58 90 0.971 8.26 4.21 Intr + 204471 204689 219 2 0 123 100 357 0.967 38.90 4.22 Intr + 205251 205409 159 2 0 38 75 122 0.978 6.08 4.23 Intr + 217709 217877 169 1 1 88 52 191 0.350 15.02 4.24 Term + 219013 219023 11 2 2 135 38 -2 0.299 -2.14 4.25 PlyA + 220740 220745 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 101068 101123 56 2 2 77 43 64 0.808 -1.48 S.002 Init + 106209 106248 40 2 1 83 108 29 0.917 4.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:32725928_32948725|GENSCAN_predicted_peptide_1|164_aa MGDRCCHGNVLRLDCINVNILVMTLWDLCIEGNRVKATRNVSVSFLKTVSQGGRGRSALR RPWRKCGCGCAERPAAVAAAAEEEAEQGGGGGGTCGAGARAMGRLHCTEDPVPEAVGGDM QQLNQLGAQVERFLAQLSEFATTNQISLGSLRSIVKSLLLVPNX >gi568815578f:32725928_32948725|GENSCAN_predicted_CDS_1|492_bp atgggagatcgttgctgccatggaaatgttctgcgtcttgactgtatcaatgtcaatatc ctggtgatgacactgtgggatctgtgcatcgagggaaaccgggtaaaggccacaaggaat gtctctgtatcatttcttaaaactgtatcccagggcggccgggggaggagtgcgctgcga cggccgtggcgaaagtgcggttgtggatgcgcggagaggccggcagcggtggcagcggca gcggaggaggaagctgagcagggcggcggcggcggtggaacctgcggggctggggcgcgc gccatgggccgcctgcactgcactgaggacccggtgccggaggccgtgggcggcgacatg cagcagctgaaccagctgggcgcgcaggtggaaagatttctggctcagctctctgaattt gccaccaccaatcagatcagtcttggctccctcagaagcatcgtgaaaagcctccttctg gttccaaatgnn >gi568815578f:32725928_32948725|GENSCAN_predicted_peptide_2|961_aa MDRGDFRWSHREKPLHKVTFWEKTEGSCSRLRGRSPRGRSERPPTDGTGSLAVGRAGGNA ARPAALGLSGPSKPSSAIGAGDSRAQRPARPPAGLPPASPDPRLRRPAAPQPALRQESMK GDTRHLNGEEDAGGREDSILVNGACSDQSSDSPPILEAIRTPEIRGRRSSSRLSKREVSS LLSYTQDLTGDGDGEDGDGSDTPVMPKLFRETRTRSESPAVRTRNNNSVSSRERHRPSPR STRGRQGRNHVDESPVEFPATRSLRRRATASAGTPWPSPPSSYLTIDLTDDTEDTHGTPQ SSSTPYARLAQDSQQGGMESPQVEADSGDGDSSEYQDGKEFGIGDLVWGKIKGFSWWPAM VVSWKATSKRQAMSGMRWVQWFGDGKFSEVSADKLVALGLFSQHFNLATFNKLVSYRKAM YHALETLPLQKARVRAGKTFPSSPGDSLEDQLKPMLEWAHGGFKPTGIEGLKPNNTQPEN KTRRRTADDSATSDYCPAPKRLKTNCYNNGKDRGDEDQSREQMASDVANNKSSLEDGCLS CGRKNPVSFHPLFEGGLCQTCRDRFLELFYMYDDDGYQSYCTVCCEGRELLLCSNTSCCR CFCVECLEVLVGTGTAAEAKLQEPWSCYMCLPQRCHGVLRRRKDWNVRLQAFFTSDTGLE YEAPKLYPAIPAARRRPIRVLSLFDGIATGYLVLKELGIKVGKYVASEVCEESIAVGTVK HEGNIKYVNDVRNITKKNIEEWGPFDLVIGGSPCNDLSNVNPARKGLYGGHRLVGITLNC QEAIGNLLVSGNKEGDDRPFFWMFENVVAMKVGDKRDISRFLECNPVMIDAIKVSAAHRA RYFWGNLPGMNRPVIASKNDKLELQDCLEYNRIAKLKKVQTITTKSNSIKQGKNQLFPVV MNGKEDVLWCTELERIFGFPVHYTDVSNMGRGARQKLLGRSWSVPVIRHLFAPLKDYFAC E >gi568815578f:32725928_32948725|GENSCAN_predicted_CDS_2|2886_bp atggacaggggagacttcaggtggagtcatcgggaaaagcctctccataaagtgaccttc tgggagaaaaccgagggcagctgctcccggctccgcggccgcagcccgcgtggacgctcc gagcgccccccgacggacgggaccggctccctggcggtcgggcgagcgggcggcaacgct gcccggccggcagcgctggggttaagtggcccaagtaaacctagctcggcgatcggcgcc ggagattcgcgagcccagcgccctgcacggccgccagccggcctcccgccagccagcccc gacccgcggctccgccgcccagccgcgccccagccagccctgcggcaggaaagcatgaag ggagacaccaggcatctcaatggagaggaggacgccggcgggagggaagactcgatcctc gtcaacggggcctgcagcgaccagtcctccgactcgcccccaatcctggaggctatccgc accccggagatcagaggccgaagatcaagctcgcgactctccaagagggaggtgtccagt ctgctaagctacacacaggacttgacaggcgatggcgacggggaagatggggatggctct gacaccccagtcatgccaaagctcttccgggaaaccaggactcgttcagaaagcccagct gtccgaactcgaaataacaacagtgtctccagccgggagaggcacaggccttccccacgt tccacccgaggccggcagggccgcaaccatgtggacgagtcccccgtggagttcccggct accaggtccctgagacggcgggcaacagcatcggcaggaacgccatggccgtcccctccc agctcttaccttaccatcgacctcacagacgacacagaggacacacatgggacgccccag agcagcagtaccccctacgcccgcctagcccaggacagccagcaggggggcatggagtcc ccgcaggtggaggcagacagtggagatggagacagttcagagtatcaggatgggaaggag tttggaataggggacctcgtgtggggaaagatcaagggcttctcctggtggcccgccatg gtggtgtcttggaaggccacctccaagcgacaggctatgtctggcatgcggtgggtccag tggtttggcgatggcaagttctccgaggtctctgcagacaaactggtggcactggggctg ttcagccagcactttaatttggccaccttcaataagctcgtctcctatcgaaaagccatg taccatgctctggagactctgcctttgcagaaagctagggtgcgagctggcaagaccttc cccagcagccctggagactcattggaggaccagctgaagcccatgttggagtgggcccac gggggcttcaagcccactgggatcgagggcctcaaacccaacaacacgcaaccagagaac aagactcgaagacgcacagctgacgactcagccacctctgactactgccccgcacccaag cgcctcaagacaaattgctataacaacggcaaagaccgaggggatgaagatcagagccga gaacaaatggcttcagatgttgccaacaacaagagcagcctggaagatggctgtttgtct tgtggcaggaaaaaccccgtgtccttccaccctctctttgagggggggctctgtcagaca tgccgggatcgcttccttgagctgttttacatgtatgatgacgatggctatcagtcttac tgcactgtgtgctgcgagggccgagagctgctgctttgcagcaacacgagctgctgccgg tgtttctgtgtggagtgcctggaggtgctggtgggcacaggcacagcggccgaggccaag cttcaggagccctggagctgttacatgtgtctcccgcagcgctgtcatggcgtcctgcgg cgccggaaggactggaacgtgcgcctgcaggccttcttcaccagtgacacggggcttgaa tatgaagcccccaagctgtaccctgccattcccgcagcccgaaggcggcccattcgagtc ctgtcattgtttgatggcatcgcgacaggctacctagtcctcaaagagttgggcataaag gtaggaaagtacgtcgcttctgaagtgtgtgaggagtccattgctgttggaaccgtgaag cacgaggggaatatcaaatacgtgaacgacgtgaggaacatcacaaagaaaaatattgaa gaatggggcccatttgacttggtgattggcggaagcccatgcaacgatctctcaaatgtg aatccagccaggaaaggcctgtatggtggacatagactggtaggcatcaccctgaactgt caggaggccattgggaacctgctggtctcagggaataaggagggtgatgaccggccgttc ttctggatgtttgagaatgttgtagccatgaaggttggcgacaagagggacatctcacgg ttcctggagtgtaatccagtgatgattgatgccatcaaagtttctgctgctcacagggcc cgatacttctggggcaacctacccgggatgaacaggcccgtgatagcatcaaagaatgat aaactcgagctgcaggactgcttggaatacaataggatagccaagttaaagaaagtacag acaataaccaccaagtcgaactcgatcaaacaggggaaaaaccaacttttccctgttgtc atgaatggcaaagaagatgttttgtggtgcactgagctcgaaaggatctttggctttcct gtgcactacacagacgtgtccaacatgggccgtggtgcccgccagaagctgctgggaagg tcctggagcgtgcctgtcatccgacacctcttcgcccctctgaaggactactttgcatgt gaatag >gi568815578f:32725928_32948725|GENSCAN_predicted_peptide_3|299_aa MGKKKMAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGS IALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFF DANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKA GPGVVRKNPGVGNGDDEAAELMQQVGTPVFSTACDTCEPQVLCGEHERGDVNVLKLTVED LEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEY >gi568815578f:32725928_32948725|GENSCAN_predicted_CDS_3|900_bp atggggaagaagaagatggcagtgaacgtatactcaacgtcagtgaccagtgataaccta agtcgacatgacatgctggcctggatcaatgagtctctgcagttgaatctgacaaagatc gaacagttgtgctcaggggctgcgtattgtcagtttatggacatgctgttccctggctcc attgccttgaagaaagtgaaattccaagctaagctagaacacgagtacatccagaacttc aaaatactacaagcaggttttaagagaatgggtgttgacaaaataattcctgtggacaaa ttagtaaaaggaaagtttcaggacaattttgaattcgttcagtggttcaagaagtttttc gatgcaaactatgatggaaaagactatgaccctgtggctgccagacaaggtcaagaaact gcagtggctccttcccttgttgctccagctctgaataaaccgaagaaacctctcacttct agcagtgcagctccccagaggcccatctcaacacagagaaccgctgcggctcctaaggct ggccctggtgtggtgcgaaagaaccctggtgtgggcaacggagacgacgaggcagctgag ttgatgcagcaggtgggcacccctgtgtttagcacagcatgtgacacgtgtgagcctcag gtgctgtgcggggagcatgaacgtggagatgtcaacgtattgaaacttactgttgaagac ttggagaaagagagggatttctacttcggaaagctacggaacattgaattgatttgccag gagaacgagggggaaaacgaccctgtattgcagaggattgtagacattctgtatgccaca gatgaaggctttgtgatacctgatgaagggggcccacaggaggagcaagaagagtattaa >gi568815578f:32725928_32948725|GENSCAN_predicted_peptide_4|1093_aa MDQDCSGCIKSRPALKSGFDNSSLSGEERLKCKLGKSFLLEKSLGKGMLIHCSLGVSMGK GKPPSPLTLTSFPPFCDLAKSAFHVVLTTTGVKLTMIPYSRSRLMSSEDLAEIPQLQKLS IPHGFQNKEAASSPTPSITLSQVPDLQPGSQLFTEIHLAKIEKMFEEDINSTGALGMDAF IKAMKKVLSSVSDEMLKELFLKVDSDCEGFVTWQKYVDYMMREFQGKEDMRKSQYRLHFY LPMTVVPLNHGCEVVKVVFLIHRFKKIGCFLTVTKDGILQFWSESFSLMSSFRLNQTQQL YNQPMWVIDMVCLHNMNLVAVASTRQKIDFFDISDHKCVRAFTFVDLDSCALVMDYWSDY HRGVFCYGDAKGNVIVFTSENMTSGLFNPRILPRASKWDHWIKVSLQKLLNEKSALHRSY RLKALHPNWCEQVKFIPQMNVVVSCSAIEKSSLVLTILPAKASKKPRLSVLRLRKGILCF DYCPDRNFLVTGGYDAFIRLWNPFVSKRPVWLMKGHQTSVTHILVDSRNNSILISVSKDK NIRVWDMLDYICLQSFCGKFFALGNCPITSAYFFEKDNTLICSTYSIGILKGYLEAQGLI KARKRTTHCSPLCAVLYSKIFKQVVSGCLRGTVSVWEVVTGRKTMEFAVSGGQHVEMTAM ALDESERCLLTGLRDGTMKMWNYNIGKCLLTFPSPEQLEISGIIHMNKVFYVTGWSKRIT HFLFHKTKPVLLCYHWQTYHTEDILSMAKYRNQFLGTSSYSGDILFWNTGTLKPIFNFNA SRSPSPLQPKRVQDVNNCLAESHRPSRPYVEREKWTYKTSRKLSSLSPESVANTNLRRSL VSAPPVMRCPRDKEPDRPVPQQKPSSASGTSRQSSKIHSKQSIYKEDETRKGEWQKNMLV QSSASVEKIIFLQTRPRLPHTAALLSSCMDGYIYAWSLHENGGLLGKFPVDLDNGDVVVG AMATDKNDWILITGDCKGYIKIWDIKDYCALIDKQPFQSSGAKVVSEAHNKFRLLIPQQL GTNFPHYIPLEDKEVVAGHTISLVPPTLLMTWKGHLNSVADILYVDNFQLVISAGQDRDV KAWKLSGDAIVLP >gi568815578f:32725928_32948725|GENSCAN_predicted_CDS_4|3282_bp atggaccaagattgcagtggctgcatcaagtctaggcctgccctgaagtctggatttgac aactcctctctatcaggggaagaaaggctcaagtgcaagcttggcaaaagctttttgctg gagaaatctctgggaaaaggaatgctcatccactgctcgctgggagtgtccatgggcaag gggaagccaccctctcccttgaccttgacttcttttccacctttctgtgacctggctaag tcagccttccacgtggtccttacaacgactggagttaagttgactatgattccctattct aggtcaaggctaatgtcttctgaagacttagcagagatccctcaactccaaaagctgtcc atcccacatggcttccagaacaaggaggctgctagctccccaacaccatccatcaccctt agccaggtgcctgacctccagcctgggtcccagctgtttactgagatacacctggccaag atagagaaaatgtttgaggaggacatcaactcgactggagccctgggcatggacgccttc atcaaggccatgaagaaggttctgagcagtgtgtcggacgagatgctaaaggagctgttt ttgaaggtggactcggactgtgaaggctttgtcacctggcaaaagtatgtggattacatg atgcgtgagttccagggaaaagaggacatgcgaaagagccagtaccgcctgcacttctac cttcccatgacggtcgtccccctgaaccatggctgtgaggtggtgaaggtggtgttttta atccaccggttcaagaagatcgggtgtttcctgactgtcaccaaagacgggatcctgcag ttctggtctgagtccttctcgctgatgagctcctttaggcttaaccagacccagcagctc tacaaccagccgatgtgggtcattgacatggtatgtctgcacaatatgaacctcgttgca gttgcgtctaccaggcaaaagatagatttctttgatattagtgaccacaaatgtgtccgg gccttcacctttgttgatctggacagctgtgctctggtcatggactactggtctgactat cacagaggtgtgttctgctatggagacgccaaaggcaacgtcattgtcttcacctccgaa aacatgaccagtgggctgttcaacccccgtatcctccccagggcctccaagtgggatcac tggatcaaagtttccttgcagaaactcttaaatgagaagtctgctttgcatagaagctac cggctgaaggctctccatcccaactggtgtgagcaggtcaagttcatcccccagatgaat gtggtagtctcctgttcagccatcgagaagtcctctctggtgctgacaatattgccagcc aaagcctctaagaaacccaggttgtcagtgctgcgtttaaggaaagggattctttgcttt gattactgcccagacaggaacttcctggtgactggtggctacgatgccttcatccgcctg tggaacccctttgtctcaaagaggcccgtgtggctgatgaagggacaccagacctcagtg acgcacatccttgtggatagcaggaacaacagcatcctcatcagtgtctccaaggacaag aatattcgcgtgtgggacatgctggactacatatgcctccagtccttctgtgggaagttt tttgctctgggaaactgccccatcaccagtgcctacttcttcgagaaggacaataccctc atctgcagcacctactcaatcgggatcctaaaagggtacttagaggcccaggggcttatc aaagcaaggaagaggaccactcattgctcacccctgtgtgctgtcctctacagcaagatc tttaagcaggtggtgagtggctgcctgcgcggcacagtgagtgtgtgggaggtcgtgacg ggcaggaagacgatggagtttgctgtgtctgggggccagcacgtggagatgaccgccatg gccctggatgagtcagagcggtgcctgctcacaggtttgcgggatggcacaatgaagatg tggaactacaacattggcaaatgcctgttgacctttcccagtccggaacagctggagatt agtgggattatccatatgaacaaagtgttctatgtgacaggatggagtaagagaatcact catttcctgttccacaagaccaagccagtgctcttgtgctaccactggcagacctaccac acggaggacatcctgagcatggccaagtaccggaaccagttccttgggacctcctcctac agtggggacatcctcttctggaacaccggcacactcaagcccatcttcaacttcaatgcc tctaggagcccctcgcccttgcagcccaagagggtgcaagatgtgaacaactgcctggct gagagccacaggcccagcagaccctatgtggagcgggagaagtggacatacaagacctcc aggaagctctccagtctcagccccgagtctgtggccaataccaacctgaggcggagcctg gtgtcggctcccccagtgatgcggtgcccgagagacaaggagccagacaggcctgtgccc cagcagaaaccttccagtgcttctggcacatccaggcagtcaagcaagatccacagcaaa cagtccatttacaaagaggatgaaacgagaaaaggagaatggcagaagaatatgttggtt caatccagtgcctcggtggagaagataatcttcctgcagaccaggcctcgcctgccgcac acggctgccctgctgagcagctgcatggacggctacatctacgcctggtccctccatgag aatggaggcctgctggggaagttccctgtggacctagacaatggggatgttgtcgtgggt gccatggccactgataaaaatgactggatcctcatcacgggggattgtaaaggatacatc aagatctgggacatcaaggattactgcgcattgattgataaacagccattccaatccagt ggggccaaggttgtctctgaagcacacaacaagttccggttgttaattcctcagcaactt gggaccaacttcccacactacattcccttggaggataaagaggttgtggctggccatacc atttccctggttccccccacgctcctgatgacctggaaaggccatttgaatagtgtggca gacatcctgtatgtggacaacttccagctggttatcagcgctggccaggaccgggacgtc aaggcttggaaactctccggtgatgccattgtactaccataa