GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:15:34 Sequence gi568815596r:68845371_69050755 : 205385 bp : 40.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2515 2656 142 0 1 54 79 95 0.088 4.21 1.02 Intr + 10359 10467 109 1 1 39 57 93 0.064 -0.28 1.03 Intr + 12318 12489 172 0 1 107 66 46 0.323 3.32 1.04 Term + 15779 15880 102 1 0 72 34 153 0.343 5.60 1.05 PlyA + 17048 17053 6 1.05 2.03 PlyA - 18906 18901 6 1.05 2.02 Term - 21201 20261 941 1 2 87 43 1005 0.875 87.17 2.01 Init - 25988 25655 334 2 1 89 115 411 0.998 39.40 2.00 Prom - 32881 32842 40 -5.75 3.03 PlyA - 35098 35093 6 1.05 3.02 Term - 46955 46795 161 0 2 64 31 131 0.742 2.12 3.01 Init - 50594 50198 397 2 1 78 92 207 0.491 17.01 3.00 Prom - 75529 75490 40 -4.65 4.00 Prom + 76925 76964 40 -4.65 4.01 Init + 79238 79294 57 1 0 95 111 75 0.450 11.86 4.02 Intr + 80315 80456 142 1 1 88 93 98 0.989 9.31 4.03 Intr + 81370 81444 75 2 0 93 81 32 0.729 1.57 4.04 Intr + 83319 83485 167 1 2 97 94 40 0.581 4.26 4.05 Term + 86902 87132 231 0 0 103 40 129 0.413 5.09 4.06 PlyA + 88504 88509 6 -0.45 5.05 PlyA - 88896 88891 6 1.05 5.04 Term - 90648 90463 186 0 0 83 50 111 0.904 3.31 5.03 Intr - 91680 91659 22 2 1 105 98 37 0.894 3.03 5.02 Intr - 94907 94736 172 0 1 78 46 47 0.015 -2.42 5.01 Init - 98837 98426 412 2 1 79 81 333 0.910 28.42 5.00 Prom - 99647 99608 40 -8.65 6.06 PlyA - 99788 99783 6 -0.45 6.05 Term - 100080 99998 83 1 2 99 45 47 0.771 -1.62 6.04 Intr - 101090 100934 157 2 1 44 94 100 0.817 4.86 6.03 Intr - 101887 101777 111 1 0 62 91 108 0.976 8.16 6.02 Intr - 104893 104756 138 1 0 85 77 105 0.980 8.84 6.01 Init - 113731 113705 27 1 0 98 73 19 0.254 1.13 6.00 Prom - 113895 113856 40 -3.85 7.00 Prom + 118202 118241 40 -6.75 7.01 Init + 118407 118469 63 2 0 56 75 72 0.180 3.90 7.02 Intr + 128486 128653 168 1 0 53 77 165 0.515 11.12 7.03 Intr + 132267 132404 138 2 0 60 51 208 0.823 14.04 7.04 Intr + 133501 133611 111 0 0 87 93 42 0.896 4.26 7.05 Intr + 134543 134690 148 1 1 76 113 129 0.895 13.09 7.06 Term + 136373 136380 8 0 2 98 50 0 0.210 -5.75 7.07 PlyA + 136439 136444 6 1.05 8.06 PlyA - 136501 136496 6 1.05 8.05 Term - 140009 139850 160 1 1 21 44 170 0.445 2.33 8.04 Intr - 143681 143530 152 2 2 9 62 158 0.235 3.34 8.03 Intr - 150170 150047 124 1 1 67 105 77 0.609 6.97 8.02 Intr - 165922 165747 176 1 2 45 64 116 0.102 2.72 8.01 Init - 168384 167617 768 0 0 60 26 293 0.137 13.04 8.00 Prom - 170205 170166 40 -6.95 9.00 Prom + 174570 174609 40 -4.55 9.01 Sngl + 177869 178225 357 1 0 41 42 585 0.997 45.01 9.02 PlyA + 178717 178722 6 1.05 10.03 PlyA - 184639 184634 6 1.05 10.02 Term - 187321 187115 207 1 0 45 41 198 0.660 7.16 10.01 Intr - 201092 201064 29 0 2 114 81 18 0.150 0.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 91969 91875 95 1 2 75 14 108 0.872 2.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_1|174_aa PQAFILTQTSINFPDGLSTSNSSEQGNRTTLVNNDPGGRRWTKNLSDMIDNDIMAPSDKT AWRHKGTLAQRSLAGLQVLWALQEILSELLTYGTHTLRKGRKRKHLYCREKTLKDVVVKN SSKEGRKARDRKKPQGRDGSMSSDNYTNVDMDNEQVLPQNMTVHQLIRCVCGVR >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_1|525_bp cctcaagcatttatattgactcagaccagtatcaacttccctgatggactttctacctcc aatagctctgaacaaggaaacaggacaactctggtaaacaatgatcctggtggaaggagg tggacaaaaaacctgagtgacatgatagataacgacatcatggcaccatccgataagact gcctggaggcataaggggacattagcacagaggagcctggctggtctgcaggttctctgg gctttgcaggaaattctcagtgagctcttaacctatgggacccatacattgagaaagggc aggaagaggaagcacctgtattgcagggagaaaactcttaaagatgtagtagtgaagaac agctccaaagagggaaggaaagcaagagacaggaaaaagccccaggggagggatggaagt atgtctagtgataactatacaaatgtggacatggataatgaacaggttttaccccagaac atgaccgtgcatcagctgattcggtgtgtctgtggtgttcgctga >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_2|424_aa MGSLVLTLCALFCLAAYLVSGSPIMNLEQSPLEEDMSLFGDVFSEQDGVDFNTLLQSMKD EFLKTLNLSDIPTQDSAKVDPPEYMLELYNKFATDRTSMPSANIIRSFKNEDLFSQPVSF NGLRKYPLLFNVSIPHHEEVIMAELRLYTLVQRDRMIYDGVDRKITIFEVLESKGDNEGE RNMLVLVSGEIYGTNSEWETFDVTDAIRRWQKSGSSTHQLEVHIESKHDEAEDASSGRLE IDTSAQNKHNPLLIVFSDDQSSDKERKEELNEMISHEQLPELDNLGLDSFSSGPGEEALL QMRSNIIYDSTARIRRNAKGNYCKRTPLYIDFKEIGWDSWIIAPPGYEAYECRGVCNYPL AEHLTPTKHAIIQALVHLKNSQKASKACCVPTKLEPISILYLDKGVVTYKFKYEGMAVSE CGCR >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_2|1275_bp atgggctctctggtcctgacactgtgcgctcttttctgcctggcagcttacttggtttct ggcagccccatcatgaacctagagcagtctcctctggaagaagatatgtccctctttggt gatgttttctcagagcaagacggtgtcgactttaacacactgctccagagcatgaaggat gagtttcttaagacactaaacctctctgacatccccacgcaggattcagccaaggtggac ccaccagagtacatgttggaactctacaacaaatttgcaacagatcggacctccatgccc tctgccaacatcattaggagtttcaagaatgaagatctgttttcccagccggtcagtttt aatgggctccgaaaataccccctcctcttcaatgtgtccattcctcaccatgaagaggtc atcatggctgaacttaggctatacacactggtgcaaagggatcgtatgatatacgatgga gtagaccggaaaattaccatttttgaagtgctggagagcaaaggggataatgagggagaa agaaacatgctggtcttggtgtctggggagatatatggaaccaacagtgagtgggagact tttgatgtcacagatgccatcagacgttggcaaaagtcaggctcatccacccaccagctg gaggtccacattgagagcaaacacgatgaagctgaggatgccagcagtggacggctagaa atagataccagtgcccagaataagcataaccctttgctcatcgtgttttctgatgaccaa agcagtgacaaggagaggaaggaggaactgaatgaaatgatttcccatgagcaacttcca gagctggacaacttgggcctggatagcttttccagtggacctggggaagaggctttgttg cagatgagatcaaacatcatctatgactccactgcccgaatcagaaggaacgccaaagga aactactgtaagaggaccccgctctacatcgacttcaaggagattgggtgggactcctgg atcatcgctccgcctggatacgaagcctatgaatgccgtggtgtttgtaactaccccctg gcagagcatctcacacccacaaagcatgcaattatccaggccttggtccacctcaagaat tcccagaaagcttccaaagcctgctgtgtgcccacaaagctagagcccatctccatcctc tatttagacaaaggcgtcgtcacctacaagtttaaatacgaaggcatggccgtctccgaa tgtggctgtagatag >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_3|185_aa MDPNQEEIPDLPEKEFRRLVIKLIREGPEKGKAQCKEIQKMIQEVKEEIFKEIDSLKKKQ KVQETLDTLLEMQNALESLSNRTEQVEERNSELKDKVFRLIQSNKDKEKTIRKYEQLGMV AHACNPSTLGGRVLEVLARAVRQEKETKGIQIRKEEVKLSLFADDVIIYLGNPKDSSRKL LELIK >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_3|558_bp atggatccaaaccaagaagaaatccctgatttacctgaaaaagaattcaggaggttagtt attaagctaatcagggagggaccagagaaaggtaaagcccaatgcaaggaaatccaaaaa atgatacaagaagtgaaggaagaaatattcaaggaaatagatagcttaaagaaaaaacaa aaagttcaggaaacgttggacacacttttagaaatgcaaaatgctctggaaagtctcagc aatagaactgaacaagtagaagaaagaaattcagagctcaaagacaaggtcttcagatta atccaatccaacaaagacaaagaaaaaacaataagaaaatatgaacagctgggcatggtg gctcatgcctgtaatccaagcactttgggaggtcgagtactggaagtcttagccagagca gtcagacaagagaaagaaacaaagggcatccaaatccgtaaagaggaagttaaactgtca ctgtttgctgacgacgtgatcatttaccttggaaaccctaaggactcctccagaaagctc ctagaattgataaaataa >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_4|223_aa MDKAVFPSLDDISKALDKQAFKYYPSTRGLTYTVLPSWVKNLAQYGKPIKNMCRDDPTYF AQQQKEGTALAIDSNSCFEIQLLSFMGLFICDTRFPAGTGFLPEMPGSRPAALPMCIITL GCHWLGCDSSLLRKSQHTPLDLRGLVKSREEPAVSKLRTVGMGELARQPQEAAAQCSFSG YLLTLCSQFPFQEIRSSAPYPAWNVLLLLVLLSAWGGETHPIT >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_4|672_bp atggacaaggcagtcttcccaagtttggatgacattagcaaggccctggacaagcaggct tttaagtattacccgtctactcgcggcctgacttacactgttttacccagctgggtcaag aacttagcacagtatggaaagcccattaagaacatgtgcagagatgaccccacctacttt gcccagcagcaaaaagaaggtactgccctggcaattgactccaattcttgttttgaaatc caacttctgtcctttatgggactcttcatctgcgatactcgctttccagctgggactggt ttccttccagaaatgccaggttcaaggcctgctgccctcccgatgtgcatcatcacactt ggctgtcattggctgggctgtgatagctccctcctgagaaagagccagcacacccctcta gaccttagaggacttgtcaagtctagggaagagccagctgtgtccaaactcaggaccgtg ggtatgggtgagctggctcgccagccacaggaggcagctgcccagtgttcattttctggc tacctgctcactctgtgctctcagtttcctttccaagagatcaggagcagtgccccttat cctgcctggaatgtgctgcttctccttgtcctgctgtctgcttggggcggagagacccac cccataacataa >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_5|263_aa MGALLHLSAFRQRGSCLNPPVHTDPNPKGPGEGAVSRSGRPRAVAPECGTGPSASAEEEP ILSENSQRGVSRTGSWAGVEPRRASLHRAGPVSVGPVSLLDLCLKEPNDLEHLTTTESHG HSASDQGSLPSRPRRRPELEKNYSKIHMEQKRAQTAKAILSKQNKTNEQTKNKAEDITLP DFKLHYKATVTKTAGSFAGSGQGINPSAKGDSKCNAACLTTGYTGSDLTSDSPDFNLLCR VLDPQSLSAQGFKVWSNHSPAKQ >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_5|792_bp atgggtgctttgctccacttgagtgcattccggcagcgtgggagctgtttgaatccccca gtgcacacagatcccaaccccaagggtccaggggagggagctgtgagcagatccggacgt cccagggctgtggctccggagtgcggaactgggcccagtgcttcagcagaagaggagccc atactctcagaaaactctcagagaggggtgagtcgcacaggttcctgggctggtgtggaa cctaggcgtgcctccctccacagagctggtccagtaagtgtggggcctgtctccctgctg gacctctgcctgaaggagcccaacgacctggaacacctaacaacaacagaaagtcacggc cacagtgccagtgatcaggggtccctcccctcaagaccgaggaggagacctgaattagaa aaaaactattctaaaattcatatggaacaaaaaagagcccaaacagccaaagcaatccta agcaaacaaaacaaaacaaatgaacaaacaaaaaacaaagctgaagacatcacactacca gacttcaaactacactacaaagctacagtaaccaaaacggcagggtcctttgctggcagt ggacagggcatcaatccttcagcaaagggagacagcaaatgcaatgctgcctgtctgacc acgggatatactggttcagatctcactagtgatagcccagacttcaacttgctttgcagg gtattagatccccaatcccttagtgcccagggattcaaagtttggagtaatcacagccct gccaaacagtag >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_6|171_aa MSIICFELYVFNIISPSNNGGNVQETVTIDNEKNTAIINIHAGSCSSTTIFDYKHGYIAS RVLSRRACFILKMDHQNIPPLNNLQWYIYEKQALDNMFSSKYTWVKYNPLESLIKDVDWF LLGSPIEKLCKHIPLYKGEVVENTHNVGAGGCAKAGLLGILGISICADIHV >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_6|516_bp atgagcatcatctgctttgaactatatgtttttaacatcatcagcccaagcaacaatggt ggcaatgttcaggagacagtgacaattgataatgaaaaaaataccgccatcattaacatc catgcaggatcatgctcttctaccacaatttttgactataaacatggctacattgcatcc agggtgctctcccgaagagcctgctttatcctgaagatggaccatcagaacatccctcct ctgaacaatctccaatggtacatctatgagaaacaggctctggacaacatgttctccagc aaatacacctgggtcaagtacaaccctctggagtctctgatcaaagacgtggattggttc ctgcttgggtcacccattgagaaactctgcaaacatatccctttgtataagggggaagtg gttgaaaacacacataatgtcggtgctggaggctgtgcaaaggctgggctcctgggcatc ttgggaatttcaatctgtgcagacattcatgtttag >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_7|211_aa MTTMLRALTEKVDNMQEDMGNAASEISKRDRVMSIYALQKDNLRVNEENALEYKKLVTRV INCMTVVKVQVRGAVGKNINVNDDNNNAGSGQQSVSVNNEHNVANVDNNNGWDSWNSIWD YGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVNP NKVDDLSKFGKNIANMCRGIPTYMAEEMQVP >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_7|636_bp atgactacaatgctaagagcactaacggaaaaagtagacaacatgcaagaggacatgggt aatgctgccagcgaaatttccaagcgtgatagagtcatgtctatctatgcacttcagaaa gacaacctcagggttaatgaagaaaatgcattggaatataagaaactggtgaccagagtg atcaattgcatgactgttgtgaaagtccaggtgaggggagctgtgggcaagaatatcaac gtcaatgatgacaacaacaatgctggaagtgggcagcagtcagtgagtgtcaacaatgaa cacaatgtggccaatgttgacaataacaacggatgggactcctggaattccatctgggat tatggaaatggctttgctgcaaccagactctttcaaaagaagacatgcattgtgcacaaa atgaacaaggaagtcatgccctccattcaatcccttgatgcactggtcaaggaaaagaag cttcagggtaagggaccaggaggaccacctcccaagggcctgatgtactcagtcaaccca aacaaagtcgatgacctgagcaagttcggaaaaaacattgcaaacatgtgtcgtgggatt ccaacatacatggctgaggagatgcaagtgccatag >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_8|459_aa MRSCPQATLPTSPARGESVFARLAWGGGDNSRHLLVQNEVQVKSAVAGWTPILPASPLPG ADEHQSGQREPLEADAEGSPLRRGHGPQPGESRVRSFPRSANSPRPSGTRHPRALPRGSF IPPRSLATPRKQLSGGVQGFPSGFRSDAVSGFRVSRIPILSSASFKGEGCLGPGPPGSSR GRAELLQWKEFRPDGSLSQILNMMGGGGGFKLFLVGAVLCLLSFLSAPLSRSYGEESSKA ATLWTKAAKARVGPAGLDDLKDSEEIYRNQMGKGKQKHWWELVREEKGYDNVMERVFMGL ARNRKELECGKRVEGHLPHFEVKGVGEGAKLMLKAFLKGSDSPGKGRQDQHEEGTLGTAA AAIQLILAAIEESSRLSETCMVLASCRTESDQEGPSDKNILPNIQTCYRFMDDDRRHETS GSETKDFITRVTASSVSMSMFVSVLVAYKSHRSDDVDQD >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_8|1380_bp atgcgctcctgtccccaggcgaccctgccaacgagcccggcccggggcgaaagcgttttc gcccgcttagcctggggtgggggggacaactcgcggcacttacttgtccaaaatgaagta caggtcaaatccgccgtagcaggctggacccccatcctccctgcgtcccccttgcccggc gcagatgagcaccagagtggccaaagagagccactggaagccgatgccgagggctctccg ctccgccgtggccatggcccgcagcccggggagagcagggtccgctccttcccacgctcc gcgaactcgccacgaccctcagggacgcgccatccgcgggcccttcctcgcgggtccttt attccccctcgctccctcgcaactccccggaagcaattgtctggaggagttcaaggtttc ccctctggtttccgctccgatgctgtttctgggtttcgggtttcaaggatcccaatcctg tcctccgcttcttttaaaggggagggctgcctcggtccggggccgccgggcagctcccga ggccgcgccgagcttttgcaatggaaggagttccgtcccgacggttctttgtcccagatt ttaaatatgatgggggggggggggggtttcaaattgttcttggtgggggcagtgctttgc cttctctccttcctctctgcccccctttcgcgttcttacggggaagaaagttcaaaggct gccaccttgtggacaaaggcggcgaaagcacgcgtgggacccgcgggactcgatgacttg aaagacagtgaggaaatctatagaaatcagatgggtaagggaaagcagaaacactggtgg gagctggtaagagaagaaaagggatatgacaatgtgatggagagagtgtttatgggattg gcaaggaataggaaagagcttgaatgtggaaagcgggtggagggacaccttccccacttt gaagtgaaaggggttggagagggagctaagctaatgctaaaagcctttttaaagggctct gacagccctggaaaggggagacaggaccaacacgaagagggaactctggggacggcagca gctgcaattcagcttattctggcagccatcgaggagtcttccaggctcagtgagacatgt atggttctggcttcatgtagaactgagagtgaccaggaaggcccttcagacaaaaatata ctcccaaatatacagacttgctatcgtttcatggatgatgacagaagacatgagacttct ggatcagagacaaaggactttattactcgtgtcacagcaagcagtgtgagcatgagcatg tttgtgtcagtgctcgttgcctataagtctcatagaagtgatgatgtagaccaagattga >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_9|118_aa MSAETIDGDGDGYGGGGGRDDKDDGDGSGDGDSDSGDYGNVDSCDEDDDGDKDNGDGGGG DSDSDDDNGSSDDSDGGGSDDGDGDMVVVVEVEMMMMMMMMAVVMVVVVLVQKQNMVL >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_9|357_bp atgagtgctgagacgatagatggtgatggtgatggttatggtggtggaggtggacgtgat gataaagatgatggtgatggtagtggtgatggtgacagtgatagtggtgattatggcaat gttgatagttgtgatgaagatgatgatggtgataaggataatggtgatggtggtggtggt gatagtgatagtgatgatgataatggtagtagtgatgatagtgatggtggtggtagtgat gatggtgatggtgatatggtggtggtggtggaggtggagatgatgatgatgatgatgatg atggcagtggtcatggtggtggtggtgttagtgcagaaacagaatatggtcctctaa >gi568815596r:68845371_69050755|GENSCAN_predicted_peptide_10|78_aa XQETSSSDVKIQHKSTVVKNTDNGLVTQKVFPIIEMHSCGSIPHTRQIHRDEATVATLHP PERSKGDGSTSSGESGCG >gi568815596r:68845371_69050755|GENSCAN_predicted_CDS_10|237_bp naacaggaaactagcagtagtgacgttaagattcaacacaaaagcactgttgttaagaat acagacaatgggttagtaacccaaaaggtatttcccataattgaaatgcacagctgcggc tccattcctcacactcgccagattcacagagatgaagccacagtagccactttgcatccc ccagaaagatcaaagggggatgggtcaacatcgtcgggtgagagtggctgtggatga