GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:03:20 Sequence gi568815591f:5780890_5983999 : 203110 bp : 47.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 73 816 744 0 0 41 49 274 0.541 12.26 1.02 PlyA + 1303 1308 6 1.05 2.04 PlyA - 3561 3556 6 1.05 2.03 Term - 12977 12841 137 0 2 65 41 79 0.135 -0.92 2.02 Intr - 30268 30209 60 2 0 52 72 76 0.095 1.11 2.01 Init - 39829 39772 58 1 1 93 74 13 0.348 1.91 2.00 Prom - 40763 40724 40 -2.46 3.00 Prom + 40857 40896 40 -5.86 3.01 Init + 42227 42637 411 1 0 102 99 191 0.631 16.14 3.02 Intr + 52638 52679 42 2 0 111 109 21 0.042 4.94 3.03 Intr + 59060 59214 155 1 2 66 -2 267 0.420 14.37 3.04 Intr + 59801 59967 167 2 2 107 99 35 0.516 6.10 3.05 Term + 66191 66747 557 0 2 47 46 357 0.293 21.89 3.06 PlyA + 70074 70079 6 1.05 4.00 Prom + 98971 99010 40 -3.86 4.01 Init + 100001 100061 61 1 1 49 65 113 0.935 6.52 4.02 Intr + 101604 101736 133 1 1 65 71 215 0.849 17.20 4.03 Intr + 103001 103110 110 2 2 76 75 137 0.536 11.13 4.04 Term + 105175 105200 26 2 2 130 42 2 0.367 -1.81 4.05 PlyA + 105453 105458 6 1.05 5.00 Prom + 105704 105743 40 -7.16 5.01 Init + 117911 118030 120 1 0 89 85 173 0.999 15.55 5.02 Intr + 119966 120043 78 1 0 57 23 106 0.625 0.75 5.03 Intr + 120768 120815 48 2 0 128 121 82 0.997 14.38 5.04 Intr + 121772 121855 84 1 0 56 114 70 0.986 6.42 5.05 Intr + 124205 124380 176 1 2 99 82 41 0.637 3.34 5.06 Intr + 129146 129227 82 2 1 109 64 -12 0.032 -1.86 5.07 Intr + 137978 138015 38 1 2 123 67 49 0.340 3.36 5.08 Intr + 138874 139077 204 1 0 80 13 106 0.341 0.62 5.09 Intr + 142498 142656 159 1 0 70 60 188 0.884 13.30 5.10 Intr + 142946 143073 128 2 2 58 115 180 0.987 18.02 5.11 Term + 144743 144798 56 0 2 101 49 67 0.966 1.82 5.12 PlyA + 145061 145066 6 1.05 6.23 PlyA - 145279 145274 6 -1.75 6.22 Term - 145659 145479 181 2 1 86 44 268 0.995 19.28 6.21 Intr - 147503 147307 197 2 2 -5 88 268 0.326 15.71 6.20 Intr - 154259 154131 129 2 0 95 68 54 0.542 4.99 6.19 Intr - 157012 156869 144 1 0 93 92 -4 0.366 0.98 6.18 Intr - 157940 157833 108 2 0 87 68 52 0.338 3.58 6.17 Intr - 162583 162435 149 2 2 89 25 37 0.361 -2.55 6.16 Intr - 163101 163022 80 2 2 35 94 51 0.549 -0.41 6.15 Intr - 164289 164175 115 1 1 60 100 45 0.604 2.41 6.14 Intr - 167520 167331 190 0 1 139 89 -15 0.899 2.86 6.13 Intr - 170527 170406 122 2 2 47 98 118 0.998 8.91 6.12 Intr - 172330 172186 145 1 1 92 74 99 0.999 8.76 6.11 Intr - 175292 175116 177 2 0 85 101 61 0.856 7.22 6.10 Intr - 177138 177018 121 2 1 83 101 50 0.858 6.30 6.09 Intr - 178189 178104 86 1 2 69 78 19 0.749 -2.38 6.08 Intr - 179975 179802 174 2 0 80 86 306 0.706 29.74 6.07 Intr - 184823 184748 76 1 1 103 86 76 0.863 8.42 6.06 Intr - 186148 185974 175 2 1 62 55 173 0.612 10.40 6.05 Intr - 189900 189729 172 0 1 78 83 93 0.830 7.32 6.04 Intr - 196868 196699 170 0 2 57 123 197 0.949 19.77 6.03 Intr - 197807 197707 101 1 2 105 115 61 0.999 10.25 6.02 Intr - 199122 199018 105 2 0 66 46 99 0.850 2.83 6.01 Intr - 202102 201935 168 0 0 77 75 205 0.858 17.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:5780890_5983999|GENSCAN_predicted_peptide_1|247_aa RPDTRAGPAPSAPQKGLTHLRRPRECRVTVANFRGTRAPDVREEEEKRRPPGLGFTGTAR LPPLLQAALGCTDTASAAASGPGEGSAVSRAVALAQQTRGAGPRNFAKWREGATRPLSAF VAPASPPHRARDAQPSPLARSRPGLGPRPSARRPSRPAPPPPPPELHRGAPQRGRVASQT RPGARARPAADTHSSLKSPASQAGSAAFATAPLTPQKPQLRAPWQPLHSFRPCRGVTSRL RTAPPPD >gi568815591f:5780890_5983999|GENSCAN_predicted_CDS_1|744_bp cgcccggacacccgagccgggccagcccccagcgccccgcagaagggtttgacgcacttg cggcgccctcgcgagtgccgggtgacagtcgccaactttcgcggaacacgcgctccagat gtgagggaagaggaggaaaaacgccgcccgcctgggctggggttcacaggcacggcccgc ctcccgccgctcctccaggccgcgctcgggtgcacggacaccgcctccgcggcggcctcg ggcccgggggagggaagcgcggtctcccgggctgttgccctcgcgcagcagacccgaggg gccggcccgcgaaacttcgcgaagtggcgagagggggccacccggccgctcagcgccttt gtcgctccagcctcgccaccgcaccgggctcgagacgcccagcccagccccttggcccga tcccgcccgggcctcggcccgcgcccctcagctcggcggccctcacggcccgcgcccccg cccccacccccggagctccacagaggcgcccctcagagaggccgcgtcgcctcacagacc cgcccaggagctcgagcgcgccccgcagccgacactcactcgtcactcaagtcgccggct agccaggcaggttcggcggccttcgctaccgcgccgcttactcctcagaagccgcagctg cgagctccgtggcagccgctgcactccttccggccctgccgcggcgtcacctcgcgcctg cgcacagcgccccctcccgactaa >gi568815591f:5780890_5983999|GENSCAN_predicted_peptide_2|84_aa MTVSFLRPPQPRFLYRLQNYLTLSLSMAKIYGKASPDQEALGRSMGSGAEEQGTALVGEA QAAQEPTVPVGRHCWGSRHTLLSC >gi568815591f:5780890_5983999|GENSCAN_predicted_CDS_2|255_bp atgactgtcagtttcctgaggcctccccagccacgcttcctgtacaggctgcagaactac ctcacgctcagtttatccatggctaaaatctatggcaaagccagccctgatcaagaggcc cttgggcggtcaatgggatccggcgccgaggagcaggggacggcactcgtcggggaggct caggctgcgcaggagcccacagtgccggtgggccggcactgctggggttcccggcacacc ctccttagctgctag >gi568815591f:5780890_5983999|GENSCAN_predicted_peptide_3|443_aa MASWGVLGVVVPGGAAFTQCFEVQDCNARTSGPQTLPTATDTSDTRCGGRRSGSGAVPSA SVAGTHLLQSRHCTGVLGGGSSRRLCGLSGAAELGWGTLQVVGPEQRLAPHLTALPRTPP SSGSLCRAPATVTTWTQIPAFPQQQGRTMAQGSLSFGDVAVGFTRKEWQQLDLEQRTLYQ DVMLENYSHLLSVGEEDFPVVLGLNCLLAWKAQLAQHKSFNRFSPRGCQVSKPAVISSLE QGKEPWMEEEEIRTWSFPGANPRGGHQCGSALSCDEGVPAAQGASSEKPHECTKCGKALC CRSDLRVHHGVHAGEKSSACSERGSGFREKLCPDKQGTHTKEKPARDSRSGKTIFRKTRL CVPGTVHAGAKPYKCWECEKTSHKSRLIEHLRSHTGEKPCGCRECGKAFFQKSHLILRQR THTGEKPCDCAECGKAAPRTPAS >gi568815591f:5780890_5983999|GENSCAN_predicted_CDS_3|1332_bp atggcgtcgtggggggttctgggagttgtagtccctgggggagctgcttttacgcagtgt tttgaggtgcaggactgcaatgcccgcacttccggaccccagactcttcctacggcgact gacacctcagacacgcgttgtggggggcggcgctctggctccggggctgtgcccagcgcg agcgtggccggcacccacttgctgcagagccggcactgcactggggttcttgggggcggc tcctccagacgtctctgcgggttgtcgggtgctgccgagttgggttgggggacgctacag gtagtgggtccggagcagcggctggcccctcatctcactgcccttcctcggaccccaccg tcctctggaagcttgtgcagggcacctgccactgtgaccacctggacgcagatacctgct ttcccacaacagcaaggaagaaccatggcccaggggtcactgtcattcggggacgtggct gtgggcttcacccggaaagagtggcagcagctggacctggagcagaggaccctgtaccag gatgtgatgctggagaactacagccacctgctctctgtgggtgaggaagacttccccgtg gtcctcgggctgaactgtctgcttgcttggaaggcccagctggctcagcacaaatccttt aaccgtttctccccaagagggtgtcaagtcagcaaaccagctgtgatctccagtttggag caggggaaggagccatggatggaggaggaagagataaggacgtggagcttcccaggagcg aatccacgcggaggtcaccagtgtggaagtgctttaagctgtgacgagggagttcctgca gctcagggagccagtagtgagaaaccccacgaatgcacgaagtgtgggaaagccttgtgc tgcagatcggacctcagggtacatcacggggtccacgcgggggagaagtcctctgcgtgc agtgaacgggggagtggtttcagggagaagctttgccctgacaaacagggaactcacaca aaggagaaacccgctagagacagcagaagtggtaaaacgatcttccggaagacacgcctg tgtgtcccgggcacagttcacgccggagcgaagccttacaagtgttgggagtgtgagaaa acctcccacaagtcgcgcctcatcgagcaccttcgctcccacacgggggagaagccctgc ggctgcagggaatgcggaaaggcctttttccagaagtcacacctcatcctgcgtcagagg actcacacgggggagaagccctgcgactgcgcggagtgcgggaaagctgctcccaggact cctgcctcctga >gi568815591f:5780890_5983999|GENSCAN_predicted_peptide_4|109_aa MSITDVLSADDIAAALQECRDPDTFEPQKFFQTSGLSKMSANQVKDVFRFIDNDQSGYLD EEELKFFLQKFESGARELTESETKSLMAAADNDGDGKIGAEEFQEMVHS >gi568815591f:5780890_5983999|GENSCAN_predicted_CDS_4|330_bp atgagcatcacggacgtgctcagtgctgacgacattgcagcagcgctccaggaatgccga gacccagacacttttgaaccccaaaaattcttccagacatcaggcctctccaagatgtca gccaatcaggtgaaggatgttttccggttcatagacaacgaccagagcgggtacctggat gaagaagagcttaagtttttcctccagaagtttgagagtggtgccagagaactgaccgag tcagaaaccaagtccttgatggctgcggcggataatgatggagatgggaaaattggagca gaggaattccaggaaatggtgcattcttaa >gi568815591f:5780890_5983999|GENSCAN_predicted_peptide_5|390_aa MAAAAAGAGSGPWAAQEKQFPPALLSFFIYNPRFGPREGQVVRNPIIEKQSKDGKPVIEY QEEELLDKVYSSVLRQCYSMYKLFNGTFLKAMEDGGVKLLKERLEKFFHRYLQTLHLQSC DLLDIFGGISFFPLDKMTYLKIQSFINRMEESLNIVKYTAFLYNDQLIWSGLEQDDMRIL YKYLTTSLFPRHIEPEAMSAAVCFMIDGGPVKFSFEYCCSASHSKMNLTGIFCHDLPASV HPTLDFCRRLDSIVGPQLTVLASDICEQFNINKRMSGSEKEPQFKFIYFNHMNLAEKSTV HMRKTPSVSLTSVHPDLMKILGDINSDFTRVDEDEEIIVKAMSDYWVVGKKSDRRELYVI LNQKNANLIEVNEEVKKLCATQFNNIFFLD >gi568815591f:5780890_5983999|GENSCAN_predicted_CDS_5|1173_bp atggctgcagcggcggccggggccgggagcgggccctgggcggcccaggagaagcagttc ccgccggcgctgctgagtttcttcatctacaacccgcgcttcgggccgcgcgaaggacag gttgttcggaatcctataattgaaaaacagagtaaagatggaaaaccagttattgaatat caagaggaggagttgttggacaaggtttatagctcggtgctgcggcagtgctacagcatg tacaagctttttaatggtacatttctgaaagccatggaagacggaggcgtcaagcttctg aaagaaagattagagaaattcttccatcggtatttgcaaacgctacatttgcagtcatgt gacctacttgacatttttggtggaatcagcttcttcccgttggataaaatgacttatttg aaaatccagtcctttattaatagaatggaggaaagcctgaatatagtcaaatacactgct tttctctataacgatcagctcatctggagtggattagaacaagatgacatgagaatttta tacaaataccttaccacctccctttttccaaggcacatcgaacctgaggccatgagtgcg gctgtgtgctttatgatcgacggtggaccagtgaagttttcgtttgaatactgttgttca gcgtcgcacagtaaaatgaatcttactggtattttttgtcatgacttgccagcctctgtc cacccaacgttggatttttgccgaagactggacagcatcgttgggccccagctcacagtg ctggcctctgacatctgtgaacagtttaacatcaacaagaggatgtccgggtctgagaaa gaaccccagtttaagtttatctacttcaaccacatgaatctcgccgagaagagcacagtt cacatgaggaaaacgcccagcgtgtcgctcacttccgtgcacccggatttaatgaagatt ctcggtgacatcaacagtgactttaccagagtggatgaagatgaggagatcattgtgaag gccatgagtgattactgggttgttggaaagaagtctgatcggcgggagctctatgttatt ttgaatcaaaaaaatgcaaacctgattgaagtaaatgaagaggtcaagaaactttgtgca acgcagttcaacaacatcttcttcttggattga >gi568815591f:5780890_5983999|GENSCAN_predicted_peptide_6|1028_aa XKTMFAEMEIIGQFNLGFIITKLNEDIFIVDQHATDEKYNFEMLQQHTVLQGQRLIAVKS LYDTGCFFFLTFLSFLIIIDLPEYLLNVFLPRPQTLNLTAVNEAVLIENLEIFRKNGFDF VIDENAPVTERAKLISLPTSKNWTFGPQDVDELIFMLSDSPGVMCRPSRVKQMFASRACR KSDQQGRDTCLALARGPFKSTGWAWRRPRLPGDRSRVWDAESPATPQPEPGEVSGLLERD FSKQDGNTTRQEMSPAGVPLLGMQLNEVKPKKDRQNVQQNEDATQYEESILTKLIVESYE GEKVRGLYEGEGFAAFQGGCTYRGDFVKNVPMNHGVYTWPDGSMYEGEVVNGMRNGFGMF KCSTQPVSYIGHWCNGKRHGKGSIYYNQEGTCWYEGDWVQNIKKGWGIRCYKSGNIYEGQ WEDNMRHGEGRMRWLTTNEEYTGRWERGIQNGFGTHTWFLKRIRSSQYPLRNEYIGEFVN GYRHGRGKFYYASGAMYDGEWVSNKKHGMGRLTFKNGRVYEGAFSNDHIAGFPDLEVEFI SCLDLSSGVAPRLSRSAELIRKLDGSESHSVLGSSIELDLNLLLDMYPETVQPEEKKQVE YAVLRNITELRRIYSFYSSLGCGHSLDNTFLMTKLHFWRFLKDCKFHHHKLTLADMDRIL SANNDIPVEEIHSPFTTILLRTFLNYLLHLAYHIYHEEFQKRSPSLFLCFTKLMTENIRP NAFQIKGNLFREQQRTLYSMSYMNKCWEIYLAYCRPSAAPPHEPTMKMRHFLWMLKDFKM INKELTAATFMEVIAEDNRFIYDGIDSNFEPELVFLEFFEALLSFAFICVTDQMTKSYTN VPADDVSGNKHETIYTILNQDAQNKSPSAVMSHESDAAHSDSARSSSSKLELSPDVNKIR KSEKYERPKDDREEEFNTWVNNMYVFFVNTLFHAYKREEAIKEKIRADRLRSTAQAQQRK MEDDELEARLNIFILREEEAKRHDYEVDITVLKEPADVSSSHLILDPPKEDVTVSPSSKT ITSKKKKK >gi568815591f:5780890_5983999|GENSCAN_predicted_CDS_6|3087_bp nntaaaacgatgtttgcagaaatggaaatcattggtcagtttaacctgggatttataata accaaactgaatgaggatatcttcatagtggaccagcatgccacggacgagaagtataac ttcgagatgctgcagcagcacaccgtgctccaggggcagaggctcatagcggtgaagtct ctctatgacactggctgctttttcttcctgaccttcctgtcgtttctcatcataatcgac ttgcctgagtatttgctcaacgtctttcttccccgacctcagactctcaacttaactgct gttaatgaagctgttctgatagaaaatctggaaatatttagaaagaatggctttgatttt gttatcgatgaaaatgctccagtcactgaaagggctaaactgatttccttgccaactagt aaaaactggaccttcggaccccaggacgtcgatgaactgatcttcatgctgagcgacagc cctggggtcatgtgccggccttcccgagtcaagcagatgtttgcctccagagcctgccgg aagtcggaccagcaagggcgggacacctgcttggcgctggcgcgcggcccctttaagagc acagggtgggcctggcggcgtccgcggttgcctggagaccggagccgggtctgggacgcc gagagcccggcaacacctcagcccgagcccggcgaggtctctgggctcctggagcgagac ttttccaaacaagatggcaacaccactaggcaagagatgtccccagctggtgtcccattg ctgggaatgcagctcaacgaagtgaaacccaaaaaagaccgccaaaacgttcagcagaac gaagatgccacccaatacgaagagtccattctgaccaaactcatagtggaaagctatgaa ggggaaaaggttcgtgggctgtatgagggagaaggcttcgcagcctttcaaggcggttgt acctatcgtggcgactttgtgaagaatgtcccgatgaaccacggcgtgtacacgtggccg gacggcagcatgtatgaaggcgaagtggtcaacggcatgaggaacggattcgggatgttc aagtgcagcacccagcctgtgtcctacatcggccactggtgcaatggcaagcggcacggg aagggctccatttattacaatcaagagggtacgtgttggtacgagggagactgggtacaa aacatcaaaaagggctggggaataagatgttataaatctggaaatatatacgaaggccag tgggaagacaacatgcgccacggggaggggaggatgaggtggctgaccaccaacgaagag tacaccgggcggtgggagaggggcatccagaatggctttggaacacacacatggtttcta aagagaatccgcagttcccagtatcctttgagaaatgaatacataggggagtttgtaaat ggatatcgtcacggacgtggcaagttttattatgccagtggagccatgtatgatggagaa tgggtttccaataagaaacatggcatgggccgattaactttcaagaacgggcgtgtgtac gaaggcgcattctccaatgaccacatagctgggtttccggatcttgaagttgaattcatc agctgcctggacctgtcttcaggagttgccccaagactgtccaggagcgccgaactgatc agaaagcttgatggcagtgaaagtcattctgtgttgggatcgagcattgagctggatcta aatttgctcctggacatgtaccctgagacagtccaacctgaagaaaagaagcaggtggaa tatgccgtcttaagaaatattacagaattaagaagaatttacagcttttacagcagcctg ggatgcggccactctctggataatacctttctgatgacaaagcttcacttctggagattt ctaaaagattgcaaatttcatcaccacaaactaactcttgctgatatggacaggatatta agtgccaataatgacataccagttgaagaaatccattctccatttacaacaatacttttg agaacatttttgaattacctcctgcatttggcgtaccacatttatcatgaagaattccaa aagagaagcccatccctcttcttgtgttttacaaaactgatgaccgagaacattcgtcca aatgccttccagataaaaggcaatttattccgtgagcaacagcggacgctctactctatg agttacatgaataagtgctgggagatttatctcgcttactgcagacccagtgcagcgcct ccccacgagcctacgatgaagatgagacacttcctctggatgctgaaagactttaaaatg ataaataaagaattaacagcagctacatttatggaggtcatagcagaggataatcgtttc atatatgatggaattgacagcaactttgaacctgagctggttttcctggaattctttgaa gctctcttaagctttgcattcatctgtgttactgaccaaatgactaaatcctatacaaat gttccagctgatgatgtgtctggaaataaacatgaaactatttatacaatactaaatcag gacgcccagaacaagagtcccagcgcggtcatgagccacgaatcggatgctgctcactct gacagtgccaggtcatcttccagcaagttagaactctcgcctgatgttaacaaaataagg aaatcagagaaatatgagagacccaaggatgatcgagaggaagagttcaacacgtgggtc aataatatgtacgtcttctttgtgaacacgctctttcatgcgtataaacgtgaagaagct atcaaggagaaaataagggcagacaggttacgtagcacagcacaggcccagcagcggaag atggaagatgacgaactggaagcaaggctgaacatcttcatcttgagagaggaagaggcc aagagacatgactatgaggtggacatcacagtgctcaaggagccggcagacgtgtcatcc tctcacctcatactggaccctcccaaggaggatgtgaccgtgtccccatccagcaagacc atcaccagcaagaagaagaaaaagtag