GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:26:03 Sequence gi568815583f:78773018_78997064 : 224047 bp : 48.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 186 66 121 0 1 105 47 241 0.676 21.77 1.13 Intr - 1283 1150 134 0 2 84 75 340 0.550 32.66 1.12 Intr - 1776 1607 170 2 2 126 113 224 0.999 28.29 1.11 Intr - 3316 3171 146 1 2 37 83 127 0.997 6.18 1.10 Intr - 3824 3732 93 2 0 87 97 71 0.986 8.06 1.09 Intr - 4571 4427 145 1 1 93 76 234 0.918 22.98 1.08 Intr - 15357 15214 144 2 0 63 80 206 0.947 16.70 1.07 Intr - 16821 16672 150 2 0 76 84 230 0.991 20.68 1.06 Intr - 17777 17653 125 2 2 78 84 113 0.996 9.28 1.05 Intr - 18206 18123 84 2 0 115 105 118 0.999 16.22 1.04 Intr - 23862 23573 290 1 2 96 101 275 0.536 26.56 1.03 Intr - 25096 24931 166 1 1 116 91 163 0.891 18.93 1.02 Intr - 27530 27175 356 0 2 104 105 667 0.914 65.01 1.01 Init - 38203 38104 100 1 1 80 94 118 0.889 10.03 1.00 Prom - 47353 47314 40 -5.56 2.00 Prom + 51248 51287 40 -3.76 2.01 Init + 57947 58176 230 1 2 81 12 173 0.524 5.04 2.02 Intr + 58188 58727 540 0 0 62 50 717 0.293 57.32 2.03 Intr + 58999 59128 130 1 1 76 55 113 0.262 7.50 2.04 Intr + 67535 67658 124 1 1 45 98 62 0.036 3.06 2.05 Term + 89657 89910 254 0 2 53 38 134 0.082 0.70 2.06 PlyA + 89929 89934 6 1.05 3.00 Prom + 94751 94790 40 -4.36 3.01 Init + 100390 100712 323 0 2 51 52 249 0.803 12.41 3.02 Intr + 113124 113210 87 0 0 101 89 107 0.995 11.19 3.03 Intr + 117972 117997 26 0 2 130 69 25 0.800 2.47 3.04 Intr + 118467 118555 89 1 2 54 82 61 0.991 1.79 3.05 Intr + 119195 119296 102 1 0 88 93 60 0.982 6.87 3.06 Intr + 120522 120610 89 2 2 65 60 76 0.903 1.27 3.07 Term + 123966 124050 85 0 1 84 54 102 0.822 3.53 3.08 PlyA + 124193 124198 6 1.05 4.15 PlyA - 124307 124302 6 1.05 4.14 Term - 145092 144958 135 0 0 59 42 128 0.710 3.32 4.13 Intr - 149188 149119 70 0 1 76 50 83 0.709 2.48 4.12 Intr - 150101 149976 126 1 0 92 102 28 0.951 4.39 4.11 Intr - 152423 152317 107 2 2 87 109 163 0.989 17.31 4.10 Intr - 154764 154696 69 0 0 84 78 139 0.935 11.88 4.09 Intr - 156476 156395 82 1 1 101 88 122 0.997 13.14 4.08 Intr - 158489 158368 122 2 2 92 28 163 0.851 9.99 4.07 Intr - 158961 158852 110 1 2 35 75 77 0.705 1.10 4.06 Intr - 159441 159355 87 1 0 114 91 116 0.723 14.44 4.05 Intr - 162065 161882 184 2 1 60 31 115 0.321 2.36 4.04 Intr - 162733 162663 71 2 2 78 96 40 0.949 2.70 4.03 Intr - 164406 164301 106 0 1 93 68 149 0.908 13.19 4.02 Intr - 168085 168054 32 2 2 120 48 38 0.329 0.85 4.01 Init - 171964 171874 91 1 1 64 73 200 0.495 14.75 4.00 Prom - 183021 182982 40 -0.96 5.10 PlyA - 184088 184083 6 1.05 5.09 Term - 189219 189127 93 0 0 125 48 50 0.361 2.53 5.08 Intr - 198917 198849 69 2 0 105 121 74 0.999 11.78 5.07 Intr - 200403 200286 118 2 1 39 74 213 0.553 15.47 5.06 Intr - 207682 207603 80 1 2 94 106 41 0.966 4.85 5.05 Intr - 212187 211990 198 0 0 32 80 328 0.973 25.95 5.04 Intr - 217256 217172 85 1 1 111 86 76 0.989 9.52 5.03 Intr - 218867 218674 194 2 2 18 80 176 0.313 8.09 5.02 Intr - 222783 222723 61 2 1 91 89 92 0.904 8.34 5.01 Init - 223547 223525 23 2 2 92 94 46 0.890 4.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:78773018_78997064|GENSCAN_predicted_peptide_1|742_aa MPGGPSPRSPAPLLRPLLLLLCALAPGAPGPAPGRATEGRAALDIVHPVRVDAGGSFLSY ELWPRALRKRDVSVRRDAPAFYELQYRGRELRFNLTANQHLLAPGFVSETRRRGGLGRAH IRAHTPACHLLGEVQDPELEGGLAAISACDGLKGVFQLSNEDYFIEPLDSAPARPGHAQP HVVYKRQAPERLAQRGDSSAPSTCGVQVPQRELTEERPWRRVPWACPLAANSTPSGPPVY PELESRRERWEQRQQWRRPRLRRLHQRSVSKEKWVETLVVADAKMVEYHGQPQVESYVLT IMNMVAGLFHDPSIGNPIHITIVRLVLLEDEEEDLKITHHADNTLKSFCKWQKSINMKGD AHPLHHDTAILLTRKDLCAAMNRPCETLGLSHVAGMCQPHRSCSINEDTGLPLAFTVAHE LGHSFGIQHDGSGNDCEPVGKRPFIMSPQLLYDAAPLTWSRCSRQYITRFLDRGWGLCLD DPPAKDIIDFPSVPPGVLYDVSHQCRLQYGAYSAFCEDMDNVCHTLWCSVGTTCHSKLDA AVDGTRCGENKWCLSGECVPVGFRPEAVDGGWSGWSAWSICSRSCGMGVQSAERQCTQPT PKYKGRYCVGERKRFRLCNLQACPAGRPSFRHVQCSHFDAMLYKGQLHTWVPVVNDVNPC ELHCRPANEYFAEKLRDAVVDGTPCYQVRASRDLCINGICKNVGCDFEIDSGAMEDRCGV CHGNGSTCHTVSGTFEEAEGLX >gi568815583f:78773018_78997064|GENSCAN_predicted_CDS_1|2226_bp atgcccggcggccccagtccccgcagccccgcgcctttgctgcgccccctcctcctgctc ctctgcgctctggctcccggcgcccccggacccgcaccaggacgtgcaaccgagggccgg gcggcactggacatcgtgcacccggttcgagtcgacgcggggggctccttcctgtcctac gagctgtggccccgcgcactgcgcaagcgggatgtatctgtgcgccgagacgcgcccgcc ttctacgagctacaataccgcgggcgcgagctgcgcttcaacctgaccgccaatcagcac ctgctggcgcccggctttgtgagcgagacgcggcggcgcggcggcctgggccgcgcgcac atccgggcccacaccccggcctgccacctgcttggcgaggtgcaggaccctgagctcgag ggtggcctggcggccatcagcgcctgcgacggcctgaaaggtgtgttccagctctccaac gaggactacttcattgagcccctggacagtgccccggcccggcctggccacgcccagccc catgtggtgtacaagcgtcaggccccggagaggctggcacagcggggtgattccagtgct ccaagcacctgtggagtgcaagtgccccagagagagctcactgaggagaggccctggaga cgtgtgccctgggcctgtccactggcagctaactccaccccatctgggcctccagtgtac ccagagctggagtctcgacgggagcgttgggagcagcggcagcagtggcggcggccacgg ctgaggcgtctacaccagcggtcggtcagcaaagagaagtgggtggagaccctggtagta gctgatgccaaaatggtggagtaccacggacagccgcaggttgagagctatgtgctgacc atcatgaacatggtggctggcctgtttcatgaccccagcattgggaaccccatccacatc accattgtgcgcctggtcctgctggaagatgaggaggaggacctaaagatcacgcaccat gcagacaacaccctgaagagcttctgcaagtggcagaaaagcatcaacatgaagggggat gcccatcccctgcaccatgacactgccatcctgctcaccagaaaggacctgtgtgcagcc atgaaccggccctgtgagaccctgggactgtcccatgtggcgggcatgtgccagccgcac cgcagctgcagcatcaacgaggacacgggcctgccgctggccttcactgtagcccacgag ctcgggcacagttttggcattcagcatgacggaagcggcaatgactgtgagcccgttggg aaacgacctttcatcatgtctccacagctcctgtacgacgccgctcccctcacctggtcc cgctgcagccgccagtatatcaccaggttccttgaccgtgggtggggcctgtgcctggac gaccctcctgccaaggacattatcgacttcccctcggtgccacctggcgtcctctatgat gtaagccaccagtgccgcctccagtacggggcctactctgccttctgcgaggacatggat aatgtctgccacacactctggtgctctgtggggaccacctgtcactccaagctggatgca gctgtggacggcacccggtgtggggagaataagtggtgtctcagtggggagtgcgtaccc gtgggcttccggcccgaggccgtggatggtggctggtctggctggagcgcctggtccatc tgctcacggagctgtggcatgggcgtacagagcgccgagcggcagtgcacgcagcctacg cccaaatacaaaggcagatactgtgtgggtgagcgcaagcgcttccgcctctgcaacctg caggcctgccctgctggccgcccctccttccgccacgtccagtgcagccactttgacgct atgctctacaagggccagctgcacacatgggtgcccgtggtcaatgacgtgaacccctgc gagctgcactgccggcccgcgaatgagtactttgccgagaagctgcgggacgccgtggtc gatggcaccccctgctaccaggtccgagccagccgggacctctgcatcaacggcatctgt aagaacgtgggctgtgacttcgagattgactccggtgctatggaggaccgctgtggtgtg tgccacggcaacggctccacctgccacaccgtgagcgggaccttcgaggaggccgagggc ctggnn >gi568815583f:78773018_78997064|GENSCAN_predicted_peptide_2|425_aa MASARGRVESPELSGPCFPSAGRKRRRGLLPNSSLGNGSENTSPARSPRSLHGLDGVAGA VRAPALRALIAGAYWALPGGQRLVLVRVRSQQWRRHWLLNCFLLNLAATDLQFVLTLPFW AVDTARDFSWPFGGAICKVMLTLTVLNMYASIFLLSAMSVARYCIVTGALPPSHRGASRA SCVCCLLWAMAVLATAPTALFATAARVGGKHSCLLRFPAGGPKWQVLYHLQKIAVAFVLP LATLGTCSLLLRFLRLWAFESCVAEPSGRCPSEQAPTAAPQRLHSSNEGRLSLRQYQGHG QKRMIPFPHPQVTDALGRYSNYILSSTVESPSLRSMNFEKQGGNLEKDTQTQTPTSGNLG ESHVQTKAETGVMLLQAKEHQMLLQAKERQRLPANHQKLGERNGTDSSSQSTEGTKPTNT FIWDV >gi568815583f:78773018_78997064|GENSCAN_predicted_CDS_2|1278_bp atggcatccgccagaggcagggtcgaatcgcccgagctgtctgggccgtgcttcccttca gctggccggaagaggcgtcgggggctgctccctaacagctccctgggcaacggatcagag aacactagcccggcccggagtccccgcagcctgcatggcctggacggggtggcgggggcc gtccgggcgccggcgctgcgggctctgattgcgggcgcctactgggccctgcctggtggg caacggctggtgctagtccgggtgaggtcccagcagtggcgccgccactggctgctcaat tgcttcctcctcaatctggcagccactgacctgcagtttgtgctaacgctgcccttttgg gccgtggacacggcgcgcgactttagctggcccttcgggggtgccatctgcaaggtgatg ctgacgctcaccgtgctcaacatgtatgccagcatcttcctcctcagtgccatgagcgtg gcacgctattgcattgtgactggcgcgctgcctccgagccatcggggcgcatcacgggcc agctgtgtgtgctgcctgctctgggctatggccgtcctggctacggcgcccaccgccctg ttcgccacggcagctagggtggggggaaagcactcgtgcctgctgcgcttccccgccggc ggccccaaatggcaggtgctctaccacctgcagaagatcgcagtagccttcgtgctgccg ctggccacgctgggcacctgttcgctgctgctgcgcttcctgcgactgtgggccttcgag agctgtgtagctgagcccagcggccggtgcccctccgagcaagctcccacggcagcgccc cagcggctgcactcttccaatgagggccgcttgtcgctgcgtcagtatcaagggcatggg caaaaaagaatgatccccttcccccatcctcaagtgactgatgctcttggaagatactcc aactatatcctgagtagcactgtggagagccccagtctgagaagcatgaactttgagaag caagggggaaatttggaaaaagacacacaaacccagacaccaacatctggaaacctggga gaaagccatgtgcagacgaaggcagaaactggggtaatgcttctacaagctaaggaacac caaatgctcctgcaagctaaggaacgccaaagattgccagcaaaccaccagaagctaggg gagaggaatggcacagattcttcctcacagtccacagaaggaaccaaacctaccaacacc tttatatgggatgtctag >gi568815583f:78773018_78997064|GENSCAN_predicted_peptide_3|266_aa MWRRGGRGGGCFGRSCGAPGRLGSYDGFFGASWSPGVVAFGGNSAADGGGFGATGREKCV VKWPLQDGGAIMAAGVSVPYGGTAYGQMQRPLPRRPEGCRGPPHTTECWDEWVPESRVLK YVDTNLQKQRELQKANQKTKKNKQKTPGNGDGGSTSETPQPPRKKRARVDPTVENEETFM NRVEVKVKIPEELKPWLVDDWDLITRQKQLFYLPAKKNVDSILEDYANYKKSRGNTDNKY LAKNSATLFSASDYEVAPPEYHRKAV >gi568815583f:78773018_78997064|GENSCAN_predicted_CDS_3|801_bp atgtggcggcggggggggaggggcggtggctgtttcgggcggtcctgcggcgcgcccggc cgcttgggcagctacgacgggtttttcggtgcttcctggagccccggggtggttgcgttc ggtggcaactcagctgcggatgggggtgggtttggcgccacggggcgggagaagtgcgtt gtaaaatggccgttgcaagatggcggcgccatcatggccgccggcgtctcggtcccgtat ggaggcacagcttacgggcagatgcagcgcccccttccacgacgaccagaaggttgccgg ggccctccgcataccacagagtgttgggatgaatgggttccggagagcagagtactcaaa tacgtggacaccaatttgcagaaacagcgagaacttcaaaaagccaatcagaaaacgaaa aagaacaaacagaaaacacctggaaatggagatggtggcagtaccagtgagacccctcag cctcctcggaagaaaagggcccgggtagatcctactgttgaaaatgaggaaacattcatg aacagagttgaagttaaagtaaagattcctgaagagctaaaaccgtggcttgttgatgac tgggacttaattaccaggcaaaaacagctcttttatcttcctgccaagaagaatgtggat tccattcttgaggattatgcaaattacaagaaatctcgtggaaacacagataataagtac ctggcaaagaattctgcaactttgttcagtgccagcgattatgaagtggctcctcctgag taccatcggaaagctgtgtga >gi568815583f:78773018_78997064|GENSCAN_predicted_peptide_4|463_aa MWATLPLLCAGAWLLGVPVCGAAELCVNSLEKNVKVIKHKNHRKTYSTEEYHHRLQTFAS NWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSV DWRKKGNFVSPVKNQVCLAISILSLQNGHTPRQYTRKNNSKGCLRQLLDFLHHWGPGVCD RHRNRKDAVLETDDAPEGTILAPASVEKGGTVHSRPGDAMRTLGCGQAEQQLVDCAQDFN NHGCQGYVSDQKGPKTSSFPTDRFDRIQGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCK FQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKT PDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLA PGSTQDVYNIQCWQDCCETSSMACALVLFGFKEIGRHGYIQLR >gi568815583f:78773018_78997064|GENSCAN_predicted_CDS_4|1392_bp atgtgggccacgctgccgctgctctgcgccggggcctggctcctgggagtccccgtctgc ggtgccgccgaactgtgcgtgaactccttagaaaagaatgtgaaggtgatcaagcataaa aatcaccgtaagacctacagtacggaggagtaccaccacaggctgcagacgtttgccagc aactggaggaagataaacgcccacaacaatgggaaccacacatttaaaatggcactgaac caattttcagacatgagctttgctgaaataaaacacaagtatctctggtcagagcctcag aattgctcagccaccaaaagtaactaccttcgaggtactggtccctacccaccttccgtg gactggcggaaaaaaggaaattttgtctcacctgtgaaaaatcaggtatgcctggcaatc tccatcctctccctgcaaaacggccacactcccagacaatacaccaggaagaacaactca aaagggtgcctgcggcagttgctggactttctccaccactggggccctggagtctgcgat cgccatcgcaaccggaaagatgctgtccttgagacggatgatgcccctgaaggaactatc ctagcccctgcctctgtggagaaaggagggacagtgcacagccggcctggtgatgccatg aggaccctgggttgtggacaggcggaacagcagctggtggactgcgcccaggacttcaat aatcacggctgccaagggtacgtctctgaccagaaggggcccaaaacttcctcatttccc actgaccgcttcgacaggatccagggtctccccagccaggctttcgagtatatcctgtac aacaaggggatcatgggtgaagacacctacccctaccagggcaaggatggttattgcaag ttccaacctggaaaggccatcggctttgtcaaggatgtagccaacatcacaatctatgac gaggaagcgatggtggaggctgtggccctctacaaccctgtgagctttgcctttgaggtg actcaggacttcatgatgtatagaaccggcatctactccagtacttcctgccataaaact ccagataaagtaaaccatgcagtactggctgttgggtatggagaaaaaaatgggatccct tactggatcgtgaaaaactcttggggtccccagtggggaatgaacgggtacttcctcatc gagcgcggaaagaacatgtgtggcctggctgcctgcgcctcctaccccatccctctggca ccagggagcacacaggatgtttataacatccagtgttggcaggactgttgtgaaacaagc agcatggcctgtgccttggtgctgtttggatttaaagagatcggcagacatggctacatc caactgagataa >gi568815583f:78773018_78997064|GENSCAN_predicted_peptide_5|306_aa MLQLRDPRTLTQEDPGDNQITLEEITQMGEIPPQRQLGIQEVSVMGLTQNVAPPTQNQAE GVKAEPFENHSALEIAEQLTLLDHLVFKKIPYEEFFGQGWMKLEKNERTPYIMKTTKHFN DISNLIASEIIRNEDINARVSAIEKWVAVADICRCLHNYNAVLEITSSMNRSAIFRLKKT WLKVSKQTKALIDKLQKLVSSEGRFKNLREALKNCDPPCVPYLGMYLTDLAFIEEGTPNY TEDGLVNFSKMRMISHIIREIRQFQQTAYKIEHQAKVTQYLLDQSFVMDEESLYESSLRI EPKLPT >gi568815583f:78773018_78997064|GENSCAN_predicted_CDS_5|921_bp atgctccagctgcgggaccccaggactctgacccaggaggacccaggtgacaaccagatc acgctggaggagatcacgcagatgggcgagatacccccacagaggcagctgggaatccag gaggtgtccgtcatgggcctaactcaaaatgttgctcccccaactcaaaaccaggctgaa ggcgtgaaggctgagccctttgaaaaccactcagccctggagatcgcggagcagctgacc ctgctagatcacctcgtcttcaagaagattccttatgaggagttcttcggacaaggatgg atgaaactggaaaagaatgaaaggaccccttatatcatgaaaaccactaagcacttcaat gacatcagtaacttgattgcttcagaaatcatccgcaatgaggacatcaacgccagggtg agcgccatcgagaagtgggtggccgtagctgacatatgccgctgcctccacaactacaat gccgtactggagatcacctcgtccatgaaccgcagtgcaatcttccggctcaaaaagacg tggctcaaagtctctaagcagactaaagctttgattgataagctccaaaagcttgtgtca tctgagggcagatttaagaatctcagagaagctctgaaaaattgtgacccaccctgtgtc ccttacctggggatgtacctcaccgacctggccttcatcgaggaggggacgcccaattac acggaagacggcctggtcaacttctccaagatgaggatgatatcccatattatccgagag attcgccagtttcaacaaactgcctacaaaatagagcaccaagcaaaggtaacgcaatat ttactggaccaatcttttgtaatggatgaagaaagcctctacgagtcttctctccgaata gaaccaaaactccccacctga