GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:40:35 Sequence gi568815589f:121657023_121882401 : 225379 bp : 50.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 17207 17352 146 2 2 108 44 68 0.251 4.33 1.02 Intr + 21656 21759 104 0 2 111 45 65 0.466 4.39 1.03 Intr + 24486 24581 96 2 0 58 94 23 0.220 0.11 1.04 Intr + 42303 42436 134 2 2 109 83 278 0.879 28.84 1.05 Intr + 44825 44933 109 2 1 31 110 21 0.129 -1.11 1.06 Intr + 52227 52365 139 2 1 57 86 88 0.499 5.54 1.07 Intr + 53135 53158 24 0 0 109 83 16 0.628 1.20 1.08 Term + 53639 53883 245 0 2 60 45 190 0.902 7.96 1.09 PlyA + 54986 54991 6 1.05 2.08 PlyA - 58741 58736 6 1.05 2.07 Term - 61614 61487 128 1 2 97 44 51 0.030 0.04 2.06 Intr - 68771 68634 138 0 0 94 40 39 0.136 0.14 2.05 Intr - 72509 72436 74 1 2 7 94 92 0.070 0.75 2.04 Intr - 79897 79811 87 0 0 111 119 -1 0.829 4.39 2.03 Intr - 80311 80173 139 2 1 77 23 89 0.814 0.82 2.02 Intr - 80707 80590 118 1 1 101 55 85 0.875 6.54 2.01 Init - 84316 84188 129 1 0 80 42 14 0.191 -3.75 2.00 Prom - 84656 84617 40 0.74 3.00 Prom + 86711 86750 40 -8.06 3.01 Init + 94873 94960 88 0 1 94 103 30 0.640 3.94 3.02 Intr + 97447 97540 94 2 1 118 96 -3 0.722 2.52 3.03 Intr + 99991 100144 154 1 1 92 90 189 0.988 19.57 3.04 Intr + 101876 101974 99 1 0 153 121 56 0.999 15.71 3.05 Intr + 102863 103417 555 1 0 83 100 1261 0.961 119.84 3.06 Intr + 106483 106627 145 0 1 125 63 364 0.987 37.46 3.07 Intr + 106713 106857 145 1 1 114 65 236 0.924 23.24 3.08 Intr + 109472 109708 237 2 0 87 89 394 0.985 36.13 3.09 Intr + 111410 111611 202 2 1 89 51 407 0.721 36.39 3.10 Intr + 113524 113702 179 0 2 98 94 149 0.996 15.22 3.11 Intr + 115585 116473 889 1 1 108 82 731 0.971 65.82 3.12 Intr + 117238 117390 153 0 0 97 93 139 0.645 15.57 3.13 Intr + 119176 119369 194 0 2 68 75 496 0.977 44.59 3.14 Intr + 124442 124529 88 2 1 110 75 197 0.531 20.57 3.15 Intr + 125249 125379 131 1 2 97 77 136 0.042 12.89 3.16 Intr + 129596 129669 74 2 2 73 18 109 0.078 1.45 3.17 Intr + 133529 133576 48 0 0 22 119 52 0.403 0.35 3.18 Intr + 134240 134436 197 0 2 112 36 70 0.808 3.33 3.19 Intr + 134955 135191 237 2 0 109 98 63 0.507 7.21 3.20 Intr + 135788 135851 64 1 1 -10 110 123 0.384 2.99 3.21 Intr + 137660 137808 149 0 2 54 70 40 0.138 -1.25 3.22 Term + 146139 146249 111 2 0 38 42 86 0.082 -2.44 3.23 PlyA + 146321 146326 6 1.05 4.05 PlyA - 147694 147689 6 1.05 4.04 Term - 154719 154513 207 0 0 37 55 142 0.292 3.14 4.03 Intr - 165857 165718 140 0 2 107 70 163 0.573 16.58 4.02 Intr - 181394 181302 93 0 0 67 75 59 0.835 2.44 4.01 Init - 181963 181873 91 1 1 70 86 142 0.990 10.85 4.00 Prom - 193144 193105 40 -4.76 5.05 PlyA - 193228 193223 6 1.05 5.04 Term - 194577 194492 86 1 2 73 42 90 0.411 0.72 5.03 Intr - 203421 203315 107 2 2 76 68 215 0.842 18.16 5.02 Intr - 213726 213475 252 2 0 71 86 328 0.991 27.45 5.01 Init - 214672 214632 41 1 2 78 108 -6 0.933 0.20 5.00 Prom - 217008 216969 40 -3.46 6.03 PlyA - 217355 217350 6 1.05 6.02 Term - 221797 221688 110 2 2 61 49 77 0.863 -0.23 6.01 Init - 224597 224531 67 2 1 76 111 69 0.992 9.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:121657023_121882401|GENSCAN_predicted_peptide_1|332_aa XVPLWPNHMEVEGLEAQIIKSIQASALEQKAVHRDVENGTRGHTGDTWEESPQERPGSRR SLPGSLSEKSPSMEPSAATPFRVTGLQDGERVQGGRFAHFSWQPSRGGTISVPISQGFLS RRLKGSIKRTKSQPKLDRNHSFRHILPGFRSAAAAAADNERLGGQDGLSLAWFSSKEGLC PGSMGPPLAAMPLAGERVMKPLELLENGAIVAGRGTEMPFILLCSWQAASDAVIAPPTAP PTAREEFEVPRGQKNIEVTSITTQEGEGKQGDDDGGQGLPHQPLLRICGKMGEAGAEIPS HPRQYYSRLTKGVAYCAAGHTALVSPPSVDWM >gi568815589f:121657023_121882401|GENSCAN_predicted_CDS_1|999_bp nnggttcctctttggccaaaccacatggaagttgagggtctagaagctcagatcataaag tctatacaggccagtgccctggagcagaaggcagtgcacagagacgtggagaatggaact aggggccacacaggagatacctgggaagagtcgcctcaagaaaggccgggctctcggcgc agcctgcctggcagcctttccgagaagagccccagcatggagccctcggccgccacgccg ttccgggtcacgggtctgcaggatggagagcgtgttcaaggtggtcgttttgcccacttc tcatggcaaccctctaggggaggcaccattagtgtgcccatttcacagggcttcctcagc cgccgcctcaagggctccatcaagcgcaccaagagccagcccaagctggaccgcaaccac agcttccgccacatcctgccggggttccggagcgccgccgccgccgccgcggacaatgag aggctgggagggcaggatgggctgtctttggcttggtttagcagcaaggaagggttgtgt cctggctccatgggacccccactggctgccatgcctctggctggggagagggtcatgaag cccctggagctgctggagaatggagccatcgtagcggggcgggggacggagatgcctttc atcctgctctgttcatggcaagctgcttctgatgctgtgatagccccacccactgcccca cccactgcccgtgaggaatttgaggttccaagaggtcagaagaacatagaggtgacttcc atcactacccaggagggtgaaggcaaacagggtgatgatgatggtggccagggactcccc caccagcctctgctcaggatctgtgggaagatgggtgaagctggcgctgaaatccccagc caccccaggcagtactacagcagactcaccaagggagttgcctattgtgctgcaggccac actgccctggtgtcaccaccttctgtggactggatgtga >gi568815589f:121657023_121882401|GENSCAN_predicted_peptide_2|270_aa MACDPAELTLPMGSSASAIGPPYTPVAETASPKATGFAASIPKDFKQTFVVGAQASLLTW FWALATPEALEPDLASEDGQVSRNKWKITVFHAGNHMAQPPGPRRRETLLKANYRFQPWQ VALWGLWAGTTLKSSCLFLSFTYHENVRSTRTGTCTPRLDSFKTSEENMEELLPSIPYCR KAGPTHSRPVPSDTQLPGMPKSPFPPSREPSPSPCGPTTNSVEPSLSAELAKFSKVTADK RQSWNFNPAFACSVLWVKRGDQGLVAQGSG >gi568815589f:121657023_121882401|GENSCAN_predicted_CDS_2|813_bp atggcatgtgatccggctgagctcaccttacctatggggagttctgcatcagccataggg ccaccctacacgcctgttgcagaaacagcctccccaaaagctacaggctttgccgcaagc atccccaaggacttcaaacagacctttgtggtgggagcgcaggcctctctgctgacctgg ttctgggcacttgcaacccccgaggctctggagccagatctggcttccgaagatggccag gtctccagaaacaagtggaaaattaccgtcttccatgcaggtaaccacatggcacaacct ccaggtccaagaagacgagaaacactcctgaaagcaaactatcgcttccagccatggcaa gtcgccctctggggcctctgggcaggaactaccctgaagtcatcttgtttgtttctgtct tttacctatcatgaaaatgtccggtccacgagaacagggacctgcacgccccggctggac tccttcaagacttcagaggagaacatggaggagctgctgccctccatcccttactgcagg aaggcaggcccaacacatagccgccccgtgcccagtgatacccaactacctggcatgccc aaaagccctttccctccctcaagggaaccttcaccttcaccctgtggcccaaccaccaac tcagtggagccttccttgagcgctgagttagctaagttctctaaagtcacagctgacaag cggcagagctggaatttcaacccagcatttgcctgcagtgtcctgtgggtcaagagagga gaccagggcctggtggctcaggggagtgggtga >gi568815589f:121657023_121882401|GENSCAN_predicted_peptide_3|1410_aa MALAVPCVTWMSWCHLLPGSSKDSMPWRTESLGCRDGCSLCFTSHPVEKLPGKRGGSWLP RSHLMPRLKESRSHESLLSPSSAVEALDLSMEEEVVIKPVHSSILGQDYCFEVTTSSGSK CFSCRSAAERDKWMENLRRAVHPNKDNSRRVEHILKLWVIEAKDLPAKKKYLCELCLDDV LYARTTGKLKTDNVFWGEHFEFHNLPPLRTVTVHLYRETDKKKKKERNSYLGLVSLPAAS VAGRQFVEKWYPVVTPNPKGGKGPGPMIRIKARYQTITILPMEMYKEFAEHITNHYLGLC AALEPILSAKTKEEMASALVHILQSTGKVKDFLTDLMMSEVDRCGDNEHLIFRENTLATK AIEEYLKLVGQKYLQDALGEFIKALYESDENCEVDPSKCSAADLPEHQGNLKMCCELAFC KIINSYCVFPRELKEVFASWRQECSSRGRPDISERLISASLFLRFLCPAIMSPSLFNLLQ EYPDDRTARTLTLIAKVTQNLANFAKFGSKEEYMSFMNQFLEHEWTNMQRFLLEISNPET LSNTAGFEGYIDLGRELSSLHSLLWEAVSQLEQSIVSKLGPLPRILRDVHTALSTPGSGQ LPGTNDLASTPGSGSSSISAGLQKMVIENDLSGLIDFTRLPSPTPENKDLFFVTRSSGVQ PSPARSSSYSEANEPDLQMANGGKSLSMVDLQDARTLDGEAGSPAGPDVLPTDGQAAAAQ LVAGWPARATPVNLAGLATVRRAGQTPTTPGTSEGAPGRPQLLAPLSFQNPVYQMAAGLP LSPRGLGDSGSEGHSSLSSHSNSEELAAAAKLGSFSTAAEELARRPGELARRQMSLTEKG GQPTVPRQNSAGPQRRIDQPPPPPPPPPPAPRGRTPPNLLSTLQYPRPSSGTLASASPDW VGPSTRLRQQSSSSKGDSPELKPRAVHKQGPSPVSPNALDRTAAWLLTMNAQLLEDEGLG PDPPHRDRLRSKDELSQAEKDLAVLQDKLRISTKKLEEYETLFKCQEETTQKLVLEYQAR LEEGEERLRRQQEDKDIQMKGIISRLMSVEEELKKDHAEMQAAVDSKQKIIDAQVYTALR SLSHDPRSHPHCPQEKRIASLDAANARLMSALTQLKERKVSSVLLTAGTAAKPSLLILHI STGPQTDQPKKHLTNFKSACVLKNLKPLQLTPDLKPKRLIFFCNTAWPQYKLDNGSKWPE NGTFDFSILQDLNNFCRKMGKWSEMHRKEFFSLAQSHADNRRLHEPDLQEGIRAVPREDP QWNYQANSPGIAKQDYMVSCVVEGLKKAAYKAINYDKLKELPKIASEAPWTITDAELRVT LTVEDIIPQFGLPTSTQQSDNGPAFTSQITQAVPQALGIQWNLHIPYHPQSSGKECAARR KERAENKNLEGKNVIADGGQQNVCAEKSNQ >gi568815589f:121657023_121882401|GENSCAN_predicted_CDS_3|4233_bp atggccttggctgtcccgtgtgtgacctggatgtcatggtgccacctccttcctgggagc agtaaggattccatgccatggaggacagagagcttgggctgcagggatggatgcagcctc tgctttacatcccaccccgtggagaagctcccagggaagcggggagggtcatggctgccc aggtcccatctgatgccgaggctgaaggagtctcgctcccacgagtccctgctcagcccc agcagtgcggtggaggcgctggacctcagcatggaggaagaggtggtcatcaagcccgtg cacagcagcatccttggccaggactactgcttcgaggtgacgacgtcatcaggaagcaag tgcttttcctgccggtctgcagctgagcgggataagtggatggagaacctccggcgagcg gtgcatcccaacaaggacaacagccggcgtgtggagcacatcctgaagctgtgggtgatc gaggccaaggacctgccagccaagaagaagtacctgtgcgagctgtgcctggacgatgtg ctctatgcccgcaccacgggcaagctcaagacggacaatgttttctggggcgagcacttc gagttccacaacttgccgcctctgcgcacggtcactgtccacctgtaccgggagaccgac aagaagaagaagaaggagcgcaacagttacctgggcctggtgagcctacctgctgcctcg gtggccgggcggcagttcgtggagaagtggtacccggtggtgacgcccaaccccaagggc ggcaagggccctggacccatgatccgcatcaaggcgcgctaccaaaccatcaccatcctg cccatggagatgtacaaagagttcgctgagcacatcaccaaccactacctggggctgtgt gcagccctcgagcccatcctcagtgccaagaccaaggaggagatggcatctgccctggtg cacatcctgcagagcacgggcaaggtgaaggacttcctgacagacctgatgatgtcagag gtggaccgctgcggggacaacgagcacctcatcttccgggagaacacactggccaccaag gccattgaggagtacctcaagctagtgggccagaagtacctgcaggacgccctaggtgag ttcatcaaagcgctgtatgagtcagatgagaactgcgaagtggatcccagcaagtgctcg gccgctgacctcccagagcaccagggcaacctcaagatgtgctgcgagctggccttctgc aagatcatcaactcctactgtgtcttcccacgggagttgaaagaggtgtttgcctcgtgg aggcaggagtgcagcagtcgcggccgcccggacatcagtgagcggctcatcagcgcctcc ctcttcctgcgcttcctctgcccagccatcatgtcgccctcactcttcaacctgctgcag gagtaccctgatgaccgcactgcccgcaccctcaccctcatcgccaaggtcacccagaac ctggccaactttgccaaatttggcagcaaggaggaatacatgtccttcatgaaccagttc ctagagcatgagtggaccaacatgcagcgcttcctgctggagatctccaaccccgagacc ctctccaatacagccggcttcgagggctacatcgacctgggccgcgagctctccagcctg cactcactgctctgggaggccgtcagccagctggagcagagcatagtatccaaactggga cccctgcctcggatcctgagggacgtccacacagcactgagcaccccaggtagcgggcag ctcccagggaccaatgacctggcctccacaccgggctctggcagcagcagcatctcagct gggctgcagaagatggtgattgagaacgatctttccggtctgatagatttcacccggtta ccgtctccaacccccgaaaacaaggacttgttttttgtcacaaggtcctccggggtccag ccctcacctgcccgcagctcgagttactcggaagccaacgagcctgatcttcagatggcc aacggtggcaagagcctctccatggtggacctccaggacgcccgcacgctggatggggag gcaggctccccggcgggccccgacgtcctccccacagatgggcaggccgctgcagctcag ctggtggccgggtggccggcccgggcaaccccagtgaacctggcagggctggccacggtg cggcgggcaggccagacaccaaccacaccaggcacctccgagggcgcgccaggccggccc cagctgttggcaccgctctccttccagaaccctgtgtaccagatggcggctggcctgccg ctgtcaccccgtggccttggcgactcaggctctgagggccacagctccctgagctcacac agcaacagcgaggagttggcggctgctgccaagctgggaagtttcagcactgccgcggag gagctggctcggcggcccggtgagctggcacggcgacagatgtcactgactgaaaaaggc gggcagcccacggtgccacggcagaacagtgctggcccccagaggaggatcgaccagcct ccgcccccacccccgccgccacctcctgccccccgcggccggacgccccccaacctgctg agcaccctgcagtacccaagaccctcaagcggaaccctggcgtcggcctcacctgattgg gtgggccccagtacccgcctgaggcagcagtcctcttcctccaagggggacagcccagaa ctgaagccacgggcagtgcacaagcagggcccttcacctgtgagccccaatgccctggac cgcacagccgcttggctcttgaccatgaacgcgcagttgttagaagacgagggcctgggc ccagaccccccccacagggataggctaaggagtaaggacgagctcagccaagcagaaaag gacctggcggtgctgcaggacaagctgcgaatctccaccaagaagctggaggagtatgag accctgttcaagtgccaggaggagacgacgcagaagctggtgctggagtaccaggcacgg ctggaggagggcgaggagcggctgcggcggcagcaggaggacaaggacatccagatgaag ggcatcatcagcaggttgatgtccgtggaggaagaactgaagaaggaccacgcagagatg caagcggctgtggactccaaacagaagatcattgatgcccaggtatacacagccctaagg agcctgtcccatgacccccgctcacatccccattgtccacaggagaagcgcattgcctcg ttggatgccgccaatgcccgcctcatgagtgccctgacccagctgaaagagaggaaagtg tcatccgtgctgctcacagcgggcacagctgccaagccttccctgctcatcctccacatc agcactggtcctcagaccgaccagcccaagaaacatctcaccaatttcaaatccgcctgt gtcctcaagaacttaaaacctcttcaactcacacctgacctaaaacctaaacgccttatt ttcttctgcaacaccgcttggccccaatacaaactcgacaatggctctaaatggccagaa aacggcactttcgatttctccatcctacaagacctaaataatttttgtcgaaaaatgggc aaatggtctgagatgcacaggaaagagtttttttctctagcccaatctcatgctgataac cgccggcttcatgagccagacctccaggaaggcattagagcagttccccgagaggatccc caatggaactaccaggcaaattccccaggtatagctaagcaagattacatggtttcctgc gtagttgaagggcttaaaaaagcagcttacaaagctattaattatgacaaacttaaagaa ctacccaagatcgcctcggaagccccctggaccatcacggacgccgagcttcgggtaact ctgacagtggaggacataattcctcagtttggccttcccacctctacacagcagtctgat aacggaccagcctttactagccaaatcacccaagcagttcctcaggctcttggtattcag tggaaccttcatattccttaccatcctcaatcttcaggaaaggagtgtgctgctaggaga aaggaacgggctgaaaacaagaacttggaaggaaagaatgtcattgctgatggtggacaa cagaatgtgtgtgctgaaaagtcaaatcagtga >gi568815589f:121657023_121882401|GENSCAN_predicted_peptide_4|176_aa MKARRVTGAGAWVTAAAGEAGLDPSEQGKKGHSGCCIERGNSGNRETSEEAVGASRDDSC SGMCLQAFVEAFFFLAQRKFKMLPLHEQVASLIDLCEYHLSLLDEKRLQGDSRRDIESNT EITEMASISGVNGPGNERGEACGEKGDRVLRTSFVPGASVTAAIMERQVALPPLYS >gi568815589f:121657023_121882401|GENSCAN_predicted_CDS_4|531_bp atgaaggcccgcagggtgacgggagcaggggcgtgggttacagcggcagctggagaggcg gggctcgatccctcagagcagggcaagaaaggtcattctggatgctgtattgaaaggggc aatagtggaaacagggagaccagcgaggaggctgtgggggcttcaagggacgacagctgc tcagggatgtgtctgcaggccttcgtagaagctttctttttcctggctcagaggaagttc aagatgctgccacttcatgagcaggtggcctcactgattgacctttgcgagtaccacctg tccctgctggatgaaaaacgcctgcaaggtgacagcagaagggacatagagagcaacact gagatcactgagatggcctccatctctggggtaaatgggccaggcaatgagcgaggggag gcttgtggggaaaaaggtgaccgtgtgctgaggaccagcttcgtcccaggcgccagcgtc accgcagccatcatggagaggcaggtggcattgccccccttatacagctga >gi568815589f:121657023_121882401|GENSCAN_predicted_peptide_5|161_aa MSSTDLGVLSGFSKSQQLEKPFAGKEDALDGELTSAPDCNANPEAHLPSICLKQVFPKYA KQFNYLRLVDRMANLFIRFLGIKGTMKLGPTGFRTFIRSCKLSSSSLSMAAVDILYIDIT RRWNSMTLDQRDSAGSLLMQLPVIDANTSYIYQAPNTGQVL >gi568815589f:121657023_121882401|GENSCAN_predicted_CDS_5|486_bp atgagcagtacagacttgggagtcctcagtggttttagcaagtctcagcagcttgaaaaa ccattcgctggaaaggaagatgctttggacggcgagctgaccagtgctccagactgcaac gccaaccccgaagcccacctgccttccatttgcctcaagcaggtgttccccaagtacgca aaacagttcaactacctgcgcctggtggacaggatggcaaatttgtttatccggttcctg ggcatcaaggggacaatgaagttggggccaacaggctttcgtaccttcataaggagctgc aaactcagcagcagcagcctgtccatggctgccgtggacatcctctacattgacatcaca cggaggtggaactccatgaccctggaccagcgggactcagcaggctctctcctaatgcaa ctgccggtgattgacgccaacaccagctacatttatcaagcacctaacacaggccaggtg ctgtaa >gi568815589f:121657023_121882401|GENSCAN_predicted_peptide_6|58_aa MLSEAGFVFNTCTAGIDRCASAGPVGKRDNGEDTRSMLRAGRAENPVGKEAYWEVFVK >gi568815589f:121657023_121882401|GENSCAN_predicted_CDS_6|177_bp atgctgagtgaagctggttttgtgttcaacacctgtaccgcgggaattgaccgatgtgcc agcgcaggcccagtgggcaagagggacaatggcgaagacacgcgctccatgctcagggcc ggcagggcagagaaccctgttggaaaagaagcttattgggaggtctttgtaaaatga