GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:47:49 Sequence gi568815588f:30338939_30560833 : 221895 bp : 44.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 1512 1288 225 0 0 67 84 139 0.707 9.58 1.03 Intr - 2702 2530 173 0 2 59 78 60 0.730 1.76 1.02 Intr - 3458 3422 37 2 1 94 71 36 0.706 0.34 1.01 Init - 5229 5146 84 0 0 80 115 11 0.642 3.82 1.00 Prom - 7578 7539 40 -6.56 2.00 Prom + 9051 9090 40 -4.46 2.01 Init + 9829 9846 18 0 0 72 89 34 0.299 1.07 2.02 Intr + 17307 17444 138 2 0 100 0 248 0.958 17.86 2.03 Term + 18216 18416 201 2 0 74 41 354 0.831 26.69 2.04 PlyA + 21349 21354 6 1.05 3.11 PlyA - 21964 21959 6 1.05 3.10 Term - 25641 25472 170 1 2 -22 48 118 0.499 -5.16 3.09 Intr - 26129 25768 362 1 2 98 89 237 0.604 19.66 3.08 Intr - 26313 26207 107 0 2 24 24 143 0.915 0.51 3.07 Intr - 26627 26583 45 2 0 85 80 55 0.857 3.01 3.06 Intr - 27810 27612 199 2 1 44 -17 320 0.640 16.45 3.05 Intr - 28940 28828 113 2 2 83 83 98 0.817 7.98 3.04 Intr - 30487 30389 99 1 0 105 78 34 0.836 4.41 3.03 Intr - 30742 30595 148 0 1 15 72 179 0.967 9.24 3.02 Intr - 34280 34201 80 2 2 113 100 50 0.978 7.05 3.01 Init - 35412 35365 48 0 0 74 93 56 0.998 5.65 3.00 Prom - 38956 38917 40 -6.26 4.00 Prom + 39204 39243 40 -7.06 4.01 Init + 41236 41319 84 0 0 65 25 120 0.937 2.44 4.02 Intr + 41822 41953 132 1 0 93 30 75 0.793 3.04 4.03 Term + 44510 44644 135 1 0 91 43 98 0.831 3.62 4.04 PlyA + 45173 45178 6 1.05 5.03 PlyA - 45766 45761 6 1.05 5.02 Term - 64752 64249 504 0 0 -22 48 474 0.494 26.84 5.01 Init - 69431 69429 3 2 0 108 81 0 0.100 1.30 5.00 Prom - 89760 89721 40 -0.76 6.00 Prom + 90409 90448 40 -4.56 6.01 Init + 95130 95440 311 2 2 71 89 210 0.966 14.19 6.02 Intr + 95699 95810 112 2 1 131 117 40 0.963 11.58 6.03 Intr + 98238 98274 37 2 1 104 36 15 0.726 -4.26 6.04 Intr + 99978 100336 359 1 2 102 88 173 0.503 13.67 6.05 Intr + 108844 109011 168 0 0 102 84 80 0.615 9.14 6.06 Intr + 111320 111581 262 1 1 80 111 151 0.995 13.66 6.07 Intr + 112700 112806 107 0 2 79 76 90 0.986 6.83 6.08 Intr + 119146 119298 153 0 0 92 92 177 0.996 18.77 6.09 Intr + 120317 120563 247 1 1 20 58 177 0.998 4.93 6.10 Term + 121768 121898 131 2 2 106 49 190 0.949 15.24 6.11 PlyA + 122827 122832 6 1.05 7.00 Prom + 123774 123813 40 -5.56 7.01 Init + 133947 134043 97 2 1 89 28 170 0.778 9.60 7.02 Term + 139703 139851 149 0 2 52 37 74 0.250 -3.24 7.03 PlyA + 141034 141039 6 1.05 8.08 PlyA - 141584 141579 6 1.05 8.07 Term - 155023 154881 143 2 2 77 43 97 0.644 2.19 8.06 Intr - 155451 155310 142 0 1 74 81 61 0.950 3.93 8.05 Intr - 157790 157632 159 2 0 26 49 154 0.431 5.48 8.04 Intr - 158004 157940 65 1 2 61 68 47 0.239 -1.56 8.03 Intr - 176632 176500 133 1 1 123 63 42 0.063 5.42 8.02 Intr - 190812 190711 102 0 0 84 77 28 0.000 1.67 8.01 Intr - 214407 214338 70 2 1 115 83 55 0.040 6.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 50207 50351 145 2 1 66 54 133 0.829 5.08 S.002 Term - 214407 214189 219 0 0 115 55 107 0.927 7.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:30338939_30560833|GENSCAN_predicted_peptide_1|173_aa MVQALYTHAMELRCDPKGSGETIKGFKQFRFLTVNCGPQIGFEDKIPKRRFSEMQNERRE QAQRTVLIHCPEKISENKFLKYLSQFGPINNHFFYESFGLYAVVEFCQKESIGSLQNGTH TPSTAMETAIPFRSRFFNLKLKNQTSERSRVRSSNQLPRSNKQLFELLCYAES >gi568815588f:30338939_30560833|GENSCAN_predicted_CDS_1|519_bp atggtgcaagcactttatacccatgctatggagttgagatgtgatcctaaaggcagtggg gaaaccattaaaggatttaagcagtttcggttcctcacagtcaactgtggtccacaaata ggctttgaagacaagattcccaaaaggagattctctgagatgcaaaatgaaagacgagaa caggcacagcggactgttttaatacattgcccagagaaaatcagtgaaaacaagtttctt aaatatttatcccaatttggacctattaataatcatttcttctatgaaagctttggtctc tatgctgtcgtagaattttgccaaaaggaaagcataggttcactgcagaatgggactcat actccaagcacggccatggagactgcaattccattcagatcacgtttcttcaatctgaag ttgaaaaaccagacttctgaacggtcacgcgtacggtcaagtaatcagttgccacgttca aacaagcagctttttgaattactttgttatgcagaaagt >gi568815588f:30338939_30560833|GENSCAN_predicted_peptide_2|118_aa MQRLAQASKAEENASDSFMHFMDSQLERQMETTQNLVESYMAIVNKTLLDLVTKEFIFLE LLDNLYSRGDQNTLMEESAEQAWRCDEMLCMQHVLKEALSIISDISTNTVSMVMGTRG >gi568815588f:30338939_30560833|GENSCAN_predicted_CDS_2|357_bp atgcagcgactggctcaggccagcaaggctgaggagaatgcctccgacagcttcatgcac ttcatggactcacagctggagcggcaaatggagaccacccagaacctggtggaatcctac atggccattgtcaacaagaccttgctggacctcgtgaccaaggagttcatcttcttggag ctgctggacaacctgtactcgcgtggggaccagaacacgctgatggaggagtcggcagag caggcatggcggtgtgatgagatgctgtgcatgcaacacgtgctgaaggaggcgctcagc atcatcagtgacatcagcacaaacaccgtcagcatggtcatggggacccgtggatga >gi568815588f:30338939_30560833|GENSCAN_predicted_peptide_3|456_aa MLEETQQSKLAVAKKKIQDILKVLVSNLNHSNGVALSHWTNGRGTVWNGPGSPDRNFSLF TIQSSSRCEAVLKRQLWQSIKARAQLEAHVTQMLEQVQLETDEYTQHLKGERARWQQRVW KMSEENNENKSALQLEQQVKELQEKLGKLKETVTSAHPRRAGRVFLQVKLKSQEAQSLQQ QGHQSLGHLQQYVAAYQQLTSEKEAQHRQLLLQTQLMDQLQQQEAWGKVVAFFNSAGASA QEEQSGFMDLLKEKVDLKEWVEKLELRSIHLSGQADTISEKVNHNIRGPEGSAKDAAPGG GGHHQAGPGQRGDEDGALQHLCGGGGGVGVSVGRGTGTSVAAEHPSLQVKLLELQELVLR LAGDHNEGHGKFLAAAQNPADDPAPGAPAPQELGAADKQGGEAREGCPHDNPTAQQLMQL LPVMRDPQEYPGLGSSPCMPFFYQAAKNRELNITII >gi568815588f:30338939_30560833|GENSCAN_predicted_CDS_3|1371_bp atgttggaagaaactcaacagagtaaattggctgtggccaagaaaaagattcaggacatt ctgaaggtgctggtgtccaaccttaaccattccaatggggtagcgctctcccattggaca aatggaaggggcactgtgtggaacggcccaggctccccagatcgaaacttctcactcttc accatacagtcctcgagccgctgtgaagcagtcctcaagcggcagttatggcagtccata aaggctcgggcacagctggaagcacacgtgacacagatgttggaacaagtccagctagag acagatgaatatactcaacatctaaaaggagagagggcccggtggcagcagagggtatgg aaaatgtcagaggagaacaacgagaataagagtgcactacagttggagcagcaagtaaag gagctgcaggagaagctgggcaagctgaaggagactgtaacctctgcccatccaagaagg gctgggagggtcttcctgcaggtgaagctgaagagccaagaggctcagagtctgcagcag cagggacaccagtccctgggtcacctgcagcagtacgtggctgcctatcagcagctgacc tctgagaaggaggcgcagcacaggcagttactgctgcagacccagctcatggaccagctg cagcagcaggaagcttggggcaaagtggtggcgtttttcaactccgctggagccagtgcc caggaggagcagagtggctttatggacctcctgaaggagaaggtggacctgaaggagtgg gtggagaaactagagcttcgatccatccacctctcaggacaggcagacaccatcagtgag aaagtaaatcacaacatacgagggccagagggcagcgccaaagacgcggcaccaggagga ggaggacatcatcaggctggcccaggacaaagaggagatgaagatggggcattgcagcac ctctgtgggggtgggggtggggtgggtgtgagcgtgggcaggggcactggcaccagcgtg gcagctgagcacccctcccttcaggtgaaactgctggagctgcaggagctggtgttgcgg cttgcaggcgatcacaacgaggggcatggcaaattcctggccgctgcccagaaccctgct gatgatcctgctccaggggccccagcccctcaggagcttggggctgctgacaagcagggt ggagaggccagggagggttgtccccatgacaaccccactgcacagcagctcatgcagctt cttcctgtgatgcgggacccccaggagtacccaggcttgggcagcagtccctgcatgcca ttcttttaccaggctgccaagaacagggagctaaacatcaccatcatctaa >gi568815588f:30338939_30560833|GENSCAN_predicted_peptide_4|116_aa MAAFSRSAGPILSLNPQEDAELQKEVAQKMESISKTNHRMSTKGQVLRKKKKKVLGTPDT LEKTVDSQGPTPLQHIVSDEIHVRVTDLYPAENNNGATGAQPNTQNSRSLLESAYQ >gi568815588f:30338939_30560833|GENSCAN_predicted_CDS_4|351_bp atggcggccttttcccgctcggctgggccaatcctgtcgctgaacccgcaggaagatgcc gagttgcagaaggaagtggcacagaaaatggaaagcatttcaaaaacgaatcatcggatg tctacaaaaggccaggttttacgtaagaagaagaaaaaggttttaggcactcctgacact cttgaaaagactgtggatagccagggccccacaccgctgcagcatatcgtgagtgatgag atccatgtgcgggtgactgatctttacccggcagaaaataataatggggccactggagcc cagccgaacacacagaactcaaggagcctcctggagtcagcgtatcagtag >gi568815588f:30338939_30560833|GENSCAN_predicted_peptide_5|168_aa MRCVHCAYFQCVQRESKPHMRKMLVYWMLEVCEEQCCEEEQCCKEEVFPLAMNHLHATCP TSPPTRKAQLQLLVAVSMRLASKLRKTGPMTIEKMCIYTDHAVSPCQLRDWEVMVLGKLK WDLAAVIAHDFLALILHRPTGLGQKACPDLFGCLCYRLHLCHVPTIQL >gi568815588f:30338939_30560833|GENSCAN_predicted_CDS_5|507_bp atgcgctgcgtgcactgcgcctacttccagtgcgtgcaaagggagagcaagccgcacatg cggaagatgctggtttactggatgctggaggtgtgtgaggagcagtgctgtgaggaggag cagtgctgtaaggaggaagtctttcccctggccatgaaccacctgcatgctacctgtcct acgtccccacccacccgaaaggcacagttgcagctcttggttgcggtctccatgcggctg gcctccaagctgcgtaagactgggcccatgaccattgagaaaatgtgcatctacaccgac cacgctgtctctccctgccagttgcgggactgggaggtgatggtcctggggaagctcaaa tgggacctggccgctgtgattgctcatgacttcttggccctcattctgcaccgaccgaca ggccttggtcaaaaagcatgcccagatctttttggctgtctgtgctacagattacacctt tgccatgtacccaccatccagttgtga >gi568815588f:30338939_30560833|GENSCAN_predicted_peptide_6|628_aa MEKSGRRWLASAAPPLGRLRRRESGAEQGGLSVRATRVSLVRSALDCAPRSGVRRPGSCF CRCRRRIPVARRARLPQACSQHRTEPSGGRGWSARPAWERQGRSARAVLVWAVRGEAGTR CHCASRCRRRLDGRTLPARDPMQSSYREEARGIDSPERATVMEYMSTGSDNKEEIDLLIK HLNVSDVIDIMENLYASEEPAVYEPSLMTMCQDSNQNDERSKSLLLSGQEVPWLSSVRYG TVEDLLAFANHISNTAKHFYGQRPQESGILLNMVITPQNGRYQIDSDVLLIPWKLTYRNI GSDFIPRGAFGKVYLAQDIKTKKRMACKLIPVDQFKPSDVEIQACFRHENIAELYGAVLW GETVHLFMEAGEGGSVLEKLESCGPMREFEIIWVTKHVLKGLDFLHSKKVIHHDIKPSNI VFMSTKAVLVDFGLSVQMTEDVYFPKDLRGTEIYMSPEVILCRGHSTKADIYSLGATLIH MQTGTPPWVKRYPRSAYPSYLYIIHKQAPPLEDIADDCSPGMRELIEASLERNPNHRPRA ADLLKHEALNPPREDQPRCQSLDSALLERKRLLSRKELELPENIADSSCTGSTEESEMLK RQRSLYIDLGALAGYFNLVRGPPTLEYG >gi568815588f:30338939_30560833|GENSCAN_predicted_CDS_6|1887_bp atggaaaagtcggggaggcggtggctggcgtccgctgcgccgcccctgggcaggctcaga cgccgtgagtcaggggcagagcagggcggtctgagcgtgcgggcgacgcgggtctcactc gtccgctccgctctggactgcgcgccacgctctggggtccggcgccctggttcctgcttc tgccgctgccgccgccggatcccagtggcccggcgtgctcggctcccacaggcctgcagc cagcatcgcaccgaaccttcggggggccgcggctggagcgctcggccggcgtgggagcgc caaggccgcagcgccagggcagtgctggtctgggcagtgcgcggggaagcggggacccgc tgtcactgcgcctcccgctgccgacgccgcctggacggccgcactctccctgcccgagac ccgatgcaatcttcttaccgcgaagaagccaggggaatagactctccagaaagagcaaca gtaatggagtacatgagcactggaagtgacaataaagaagagattgatttattaattaaa catttaaatgtgtctgatgtaatagacattatggaaaatctttatgcaagtgaagagcca gcagtttatgaacccagtctaatgaccatgtgtcaagacagtaatcaaaacgatgagcgt tctaagtctctgctgcttagtggccaagaggtaccatggttgtcatcagtcagatatgga actgtggaggatttgcttgcttttgcaaaccatatatccaacactgcaaagcatttttat ggacaacgaccacaggaatctggaattttattaaacatggtcatcactccccaaaatgga cgttaccaaatagattccgatgttctcctgatcccctggaagctgacttacaggaatatt ggttctgattttattcctcggggcgcctttggaaaggtatacttggcacaagatataaag acgaagaaaagaatggcgtgtaaactgatcccagtagatcaatttaagccatctgatgtg gaaatccaggcttgcttccggcacgagaacatcgcagagctgtatggcgcagtcctgtgg ggtgaaactgtccatctctttatggaagcaggcgagggagggtctgttctggagaaactg gagagctgtggaccaatgagagaatttgaaattatttgggtgacaaagcatgttctcaag ggacttgattttctacactcaaagaaagtgatccatcatgatattaaacctagcaacatt gttttcatgtccacaaaagctgttttggtggattttggcctaagtgttcaaatgaccgaa gatgtctattttcctaaggacctccgaggaacagagatttacatgagcccagaggtcatc ctgtgcaggggccattcaaccaaagcagacatctacagcctgggggccacgctcatccac atgcagacgggcaccccaccctgggtgaagcgctaccctcgctcagcctatccctcctac ctgtacataatccacaagcaagcacctccactggaagacattgcagatgactgcagtcca gggatgagagagctgatagaagcttccctggagagaaaccccaatcaccgcccaagagcc gcagacctactaaaacatgaggccctgaacccgcccagagaggatcagccacgctgtcag agtctggactctgccctcttggagcgcaagaggctgctgagtaggaaggagctggaactt cctgagaacattgctgattcttcgtgcacaggaagcaccgaggaatctgagatgctcaag aggcaacgctctctctacatcgacctcggcgctctggctggctacttcaatcttgttcgg ggaccaccaacgcttgaatatggctga >gi568815588f:30338939_30560833|GENSCAN_predicted_peptide_7|81_aa MGAVLELGIPLGGGLGAVSPADDVALRVVMAGGWKEEKLEPAWEGPYLVLLTTKTAVRTA KKDGLITPESRKHHPLQSRGP >gi568815588f:30338939_30560833|GENSCAN_predicted_CDS_7|246_bp atgggggctgtgttggagctgggcatccccctgggtggaggcctgggggctgtcagtcca gctgatgatgtggcactgagagtagtgatggccggtgggtggaaagaagaaaaactcgag ccagcctgggaaggaccctaccttgtgctgctaaccaccaagactgctgttcgtacagca aaaaaggatggactcatcacacccgagtcaagaaagcatcaccccctccagagtcgtggg ccatag >gi568815588f:30338939_30560833|GENSCAN_predicted_peptide_8|271_aa XTGVPNPQAIDQYWPIRNWATQQEVPYFLRVEDSGQAHPATGSLREPRRWVGRQEVAVVK CFYPALLLLLEILVLEQHVSPTVLACRVQQYWCSPNPHSLSPCPIFKYHYIGDSVSTCEP LIGWSTEPAKSRTTVGASHPLGIWKLLLTDLYEGWQKSEMQKADSNIPFALDQTYRQRVT QALDSAQSGSVSMTHSVSPSVTLASSKVILAVLGNTCPCGSELTRDHRIYHSNQILLSVK GGAINNYTNIVGVNHDCPKWDLMAIPLFADV >gi568815588f:30338939_30560833|GENSCAN_predicted_CDS_8|816_bp nngacaggggtccccaacccccaggccatagaccagtactggcctatcaggaactgggcc acacagcaggaggttccttattttcttcgggtggaggactcaggccaggcgcacccagcc actggcagcctccgagagccgcggcgctgggtaggaagacaggaagtggcggtggttaaa tgcttctatccggcattgcttttgctactagaaatcctggttctggagcagcatgtctca ccaaccgtcctggcctgcagagtccagcagtactggtgctcccctaacccacattcctta tccccttgccccatcttcaaataccactacattggggattcagtttcaacatgtgaaccg ctcattgggtggagcacagagccagccaagtcccggaccactgtgggagccagtcaccct ctaggcatctggaagctgcttctaactgatctatatgaaggctggcagaagtctgagatg caaaaagctgacagcaacatcccctttgccctggaccaaacttacagacagcgagtgact caagccctggattctgcccagtctggttctgtctccatgacccactctgtgtcgccctct gtgacactagcctcctccaaggtcatccttgctgtgctggggaacacatgtccttgtggt tctgagctcaccagggaccacagaatttaccattcaaaccagatacttctgagcgtgaaa gggggtgctattaataattacaccaacatagtaggtgtaaaccacgactgtcccaaatgg gatctgatggccatcccactgtttgcagacgtatga