GENSCAN 1.0 Date run: 21-Sep-117 Time: 11:33:50 Sequence gi568815594r:15879395_16175906 : 296512 bp : 43.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1066 1105 40 -1.36 1.01 Init + 4521 4602 82 2 1 58 65 88 0.273 4.63 1.02 Intr + 13143 13220 78 1 0 40 116 57 0.328 3.22 1.03 Term + 16225 16364 140 2 2 23 44 91 0.235 -3.67 1.04 PlyA + 16703 16708 6 1.05 2.03 PlyA - 16829 16824 6 1.05 2.02 Term - 21888 21689 200 1 2 30 55 195 0.752 7.96 2.01 Init - 22385 22274 112 2 1 42 116 44 0.881 2.98 2.00 Prom - 23945 23906 40 -0.16 3.00 Prom + 24178 24217 40 -3.16 3.01 Init + 26698 26770 73 0 1 42 65 98 0.630 2.13 3.02 Intr + 26809 26873 65 2 2 119 94 20 0.404 4.14 3.03 Term + 33710 33781 72 1 0 61 45 76 0.127 -1.49 3.04 PlyA + 34266 34271 6 1.05 4.03 PlyA - 35340 35335 6 1.05 4.02 Term - 35780 35632 149 0 2 84 36 122 0.528 4.66 4.01 Init - 44683 44596 88 1 1 60 89 77 0.647 5.82 4.00 Prom - 45013 44974 40 -3.56 5.02 PlyA - 45558 45553 6 1.05 5.01 Sngl - 57238 56534 705 1 0 103 39 535 0.949 44.32 5.00 Prom - 57944 57905 40 -9.36 6.00 Prom + 58136 58175 40 -2.26 6.01 Init + 58768 58823 56 0 2 76 90 11 0.398 0.01 6.02 Intr + 60958 61036 79 1 1 120 78 7 0.460 2.45 6.03 Term + 67529 67648 120 1 0 47 48 137 0.599 4.07 6.04 PlyA + 68729 68734 6 1.05 7.02 PlyA - 70549 70544 6 1.05 7.01 Sngl - 83735 83064 672 2 0 90 48 378 0.988 28.18 7.00 Prom - 91202 91163 40 -4.66 8.00 Prom + 91870 91909 40 -7.06 8.01 Init + 92920 92974 55 0 1 22 81 86 0.536 2.75 8.02 Intr + 93577 93675 99 2 0 79 94 53 0.898 5.08 8.03 Term + 97339 97556 218 2 2 102 42 105 0.853 4.61 8.04 PlyA + 98795 98800 6 1.05 9.19 PlyA - 99583 99578 6 1.05 9.18 Term - 101143 101024 120 1 0 92 42 104 0.636 4.67 9.17 Intr - 104961 104869 93 0 0 93 91 91 0.998 10.06 9.16 Intr - 106434 106366 69 0 0 117 84 23 0.947 4.18 9.15 Intr - 106643 106563 81 2 0 107 83 16 0.876 2.83 9.14 Intr - 112997 112854 144 2 0 104 80 24 0.252 3.68 9.13 Intr - 121225 121102 124 0 1 68 66 83 0.065 4.69 9.12 Intr - 127296 127144 153 2 0 108 59 148 0.622 13.09 9.11 Intr - 129714 129555 160 1 1 72 105 84 0.966 7.55 9.10 Intr - 131915 131847 69 0 0 65 100 21 0.494 0.15 9.09 Intr - 133944 133881 64 0 1 64 94 36 0.769 0.09 9.08 Intr - 136846 136772 75 1 0 64 85 74 0.949 4.41 9.07 Intr - 139146 138929 218 1 2 48 99 152 0.911 10.52 9.06 Intr - 144021 143932 90 1 0 85 115 57 0.984 8.07 9.05 Intr - 144964 144901 64 1 1 37 79 59 0.866 -1.81 9.04 Intr - 145918 145798 121 0 1 106 83 76 0.830 9.40 9.03 Intr - 161754 161617 138 2 0 65 38 78 0.059 0.08 9.02 Intr - 169690 169462 229 2 1 90 103 58 0.033 4.43 9.01 Init - 196512 196293 220 0 1 67 115 155 0.963 12.92 9.00 Prom - 200124 200085 40 -5.96 10.00 Prom + 202013 202052 40 -4.46 10.01 Init + 202832 203051 220 1 1 60 86 71 0.341 2.89 10.02 Term + 203791 204017 227 2 2 66 42 125 0.255 2.64 10.03 PlyA + 204942 204947 6 -0.45 11.00 Prom + 207817 207856 40 -4.86 11.01 Init + 210612 210621 10 2 1 113 119 8 0.918 6.78 11.02 Intr + 233763 233842 80 1 2 61 94 36 0.056 0.77 11.03 Intr + 236764 236884 121 0 1 96 121 39 0.963 8.07 11.04 Term + 241380 241573 194 1 2 36 47 111 0.364 -0.62 11.05 PlyA + 245566 245571 6 1.05 12.07 PlyA - 245974 245969 6 1.05 12.06 Term - 255166 254973 194 2 2 106 32 122 0.934 5.98 12.05 Intr - 256586 256477 110 1 2 31 74 75 0.491 0.33 12.04 Intr - 257578 257537 42 0 0 84 98 27 0.268 0.66 12.03 Intr - 280772 280740 33 1 0 156 50 21 0.019 2.44 12.02 Intr - 284143 283945 199 2 1 76 25 83 0.131 -0.79 12.01 Intr - 287399 287239 161 1 2 97 100 140 0.988 15.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_1|99_aa MKALSTQAPSTGQNHFMVQDFLPEENAGRHILLVDVMSGGVAATLRRCGDKHEDNSPPLH PSNYCEHHKRREQVFSISTLSRCRGQSLDLTDVQSLLVE >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_1|300_bp atgaaagcactcagcacgcaggctccatcaactggccagaaccacttcatggtccaggat ttcttaccagaagagaatgctggcagacacatactgctagtggatgtgatgtctggaggt gtagcagccaccttgagacgatgtggtgataagcatgaagacaactcacccccactacat ccatctaattactgtgagcaccacaagaggagggagcaggtcttctccatttccacactc tcccggtgccggggccagagcctggacttaacagatgtccaatccctgcttgttgagtga >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_2|103_aa MKLEAEYCSVPNKGVHVLIPGTYIYVIFHDQRKFPYQAWATGQNPVSKKRRRSRSRREEE VEVEVEMEEEALAAAEEGEGEGEEEEEKKEEKKEEAADTAWQD >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_2|312_bp atgaagctggaggcagaatactgttcagtccccaacaagggtgtccatgttctaattcct ggaacctacatatatgttatcttccacgaccaaagaaaatttccatatcaagcctgggcg acaggacaaaaccctgtctctaaaaaaagaagaagaagcagaagcagaagagaagaagaa gtggaagtggaagtggaaatggaagaggaagcattagcagcagcagaagaaggagaagga gaaggagaggaagaagaagagaagaaggaggagaagaaggaagaagcagcagacactgca tggcaggattga >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_3|69_aa MGRHQAAGGPGRGWPLAWGATALLGVIRQACEHPAAGCQVGDSLCQRVREGMRLAHVHRI HEVIGWNVN >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_3|210_bp atggggcgacatcaggcggcgggcgggcccgggcggggctggcctctggcctggggcgcc actgctctgctgggtgtgattaggcaagcttgcgagcacccagctgctggctgccaagtg ggtgactcactctgccagagggttagagaagggatgcggctggcccacgttcacaggatt catgaggtcattggctggaatgtgaactaa >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_4|78_aa MAKPIQKEEGSAEAVGGGQPSRSEKDEKGEKEPALSCILNNIPLMSGLYFPEDVERLRML YGTLHSKMRYQFLSLHCG >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_4|237_bp atggccaaacccatccagaaggaagaaggctcagctgaggcagtaggtggaggtcagcct tccaggtcggagaaggacgagaaaggtgagaaagaaccagcactgagctgcatcctcaac aacatcccgttaatgagtggtctatattttcctgaagatgtggagagactgagaatgcta tatggcacactgcacagcaaaatgcgttaccagttcctgtcccttcactgtggatag >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_5|234_aa MKICSLTLLSFLLLAAQVLLVEGKKKVKNGLHSKVVSEQKDTLGNTQIKQKSRPGNKGKF VTKDQANCRWAATEQEEGISLKVECTQLDHEFSCVFAGNPTSCLKLKDERVYWKQVARNL RSQKDICRYSKTAVKTRVCRKDFPESSLKLVSSTLFGNTKPRKEKTEMSPREHIKGKETT PSSLAVTQTMATKAPECVEDPDMANQRKTALEFCGETWSSLCTFFLSIVQDTSC >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_5|705_bp atgaagatctgtagcctcaccctgctctccttcctcctactggctgctcaggtgctcctg gtggaggggaaaaaaaaagtgaagaatggacttcacagcaaagtggtctcagaacaaaag gacactctgggcaacacccagattaagcagaaaagcaggcccgggaacaaaggcaagttt gtcaccaaagaccaagccaactgcagatgggctgctactgagcaggaggagggcatctct ctcaaggttgagtgcactcaattggaccatgaattttcctgtgtctttgctggcaatcca acctcatgcctaaagctcaaggatgagagagtctattggaaacaagttgcccggaatctg cgctcacagaaagacatctgtagatattccaagacagctgtgaaaaccagagtgtgcaga aaggattttccagaatccagtcttaagctagtcagctccactctatttgggaacacaaag cccaggaaggagaaaacagagatgtcccccagggagcacatcaaaggcaaagagaccacc ccctctagcctagcagtgacccagaccatggccaccaaagctcccgagtgtgtggaggac ccagatatggcaaaccagaggaagactgccctggagttctgtggagagacttggagctct ctctgcacattcttcctcagcatagtgcaggacacgtcatgctaa >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_6|84_aa MECRPKELQALGGVRSGSRLSSTASEQAVGQSPPWLILPFQPHFQHNDIYQVTENEQSYN PIEIEYAQGPSIGSILSKMPSVAV >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_6|255_bp atggaatgcaggcccaaggaactccaggcccttggtggagtgagaagtggcagcagactc agcagcacagcatcagagcaggctgtggggcagtcacctccttggctcatcctgcccttc caacctcatttccagcacaatgacatctaccaggtgactgagaatgaacaaagctacaac cctatagagatagaatatgcccaaggacccagtattggctccatcctgtctaagatgccc tcagtagctgtgtga >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_7|223_aa MKFVPCLLLVTLSCLGTLGQAPRQKQGSTGEEFHFQTGGRDSCTMRPSSLGQGAGEVWLR VDCRNTDQTYWCEYRGQPSMCQAFAADPKPYWNQALQELRRLHHACQGAPVLRPSVCREA GPQAHMQQVTSSLKGSPEPNQQPEAGTPSLRPKATVKLTEATQLGKDSMEELGKAKPTTR PTAKPTQPGPRPGGNEEAKKKAWEHCWKPFQALCAFLISFFRG >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_7|672_bp atgaagttcgtcccctgcctcctgctggtgaccttgtcctgcctggggactttgggtcag gccccgaggcaaaagcaaggaagcactggggaggaattccatttccagactggagggaga gattcctgcactatgcgtcccagcagcttggggcaaggtgctggagaagtctggcttcgt gtcgactgccgcaacacagaccagacctactggtgtgagtacagggggcagcccagcatg tgccaggctttcgctgctgaccccaaaccttactggaatcaagccctgcaggagctgagg cgccttcaccatgcgtgccagggggccccggtgcttaggccatccgtgtgcagggaggct ggaccccaggcccatatgcagcaggtgacttccagcctcaagggcagcccagagcccaac cagcagcctgaggctgggacgccatctctgaggcccaaggccacagtgaaactcacagaa gcaacacagctgggaaaggactcgatggaagagctgggaaaagccaaacccaccacccga cccacagccaaacctacccagcctggacccaggcccggagggaatgaggaagcaaagaag aaggcctgggaacattgttggaaacccttccaggccctgtgcgcctttctcatcagcttc ttccgagggtga >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_8|123_aa MFFTVLKATKSKNLVLASEKGRSWPPKNLKHHTREGAPSTLYKTYAARTYTGGLIGCSTF FQAYQCICFSGSRGVVPSRFPFRNDLPPAGGSMVSREPPAVRAYHSDSRAYSSMAVLLQP WEQ >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_8|372_bp atgtttttcacagttctgaaggccacgaagtccaagaacctggtgctggcatctgaaaaa ggaaggtcctggcccccgaagaacctgaaacatcacacaagggaaggggctccaagcacc ctctacaagacctatgcagccagaacctacacaggaggacttattggatgcagcacattc ttccaggcctaccagtgcatctgcttctcagggagccgtggagttgtgccttctagattc ccttttaggaatgacttgcctccagctgggggcagcatggtcagcagagagcctccagct gtcagagcctaccactctgacagcagagcctacagctctatggcagtgctgctccagccc tgggagcagtga >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_9|743_aa MALVLGSLLLLGLCGNSFSGGQPSSTDAPKAWNYELPATNYETQDSHKAGPIGILFELVH IFLYVVQPRDFPEGAAAQEPGPWPIALSGSPSPAAPSSGWLHCNGQDIRLQEIPGASLHP VQPEYSLPLKELEMIIFPVTGWAVPHATSSANVNTTKKAINILCFDLMDTLNMSWGFPQP MLGDPLVQTREDLPLSIGIFYGFVANHQVRTRIKRSRKLADSNFKDLRTLLNETPEQIKY ILAQYNTTKDKAFTDLNSINSVLGGGILDRLRPNIIPVLDEIKSMATAIKETKEALENMN STLKSLHQQSTQLSSSLTSVKTSLRSSLNDPLCLVHPSSETCNSIRLSLSQLNSNPELRQ LPPVDAELDNVNNVLRTDLDGLVQQGYQSLNDIPDRVQRQTTTVVAASHVYTDLLSHSGT MTRFVSGKQGIKRVLNSIGSDIDNVTQRLPIQDILSAFSVYVNNTESYIHRNLPTLEEYD SYWWLGGLVICSLLTLIVIFYYLGLLCGVCGYDRHATPTTRGCVSNTGGVFLMVGVGLSF LFCWILMIIVVLTFVFGANVEKLICEPYTSKELFRHTGSISSELESLKVNLNIFLLGAAG RKNLQDFAACGIDRMNYDSYLAQERVTRILASLDFAQNFITNNTSSVIIEETKKYGRTII GYFEHYLQWIEFSISEKVASCKPVATALDTAVDVFLCSYIIDPLNLFWFGIGKATVFLLP ALIFAVKLAKYYRRMDSEDVYDE >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_9|2232_bp atggccctcgtactcggctccctgttgctgctggggctgtgcgggaactccttttcagga gggcagccttcatccacagatgctcctaaggcttggaattatgaattgcctgcaacaaat tatgagacccaagactcccataaagctggacccattggcattctctttgaactagtgcat atctttctctatgtggtacagccgcgtgatttcccagaaggggcggcagcacaggaaccc ggcccatggcccatcgcactgtcggggagccccagcccagccgccccatcatcagggtgg ctgcactgcaatggtcaggatatcaggctccaggagatccctggagcgtcccttcaccct gtgcaacctgagtactcgcttcctctaaaggaactggaaatgataatcttcccagttacg ggctgggctgtaccacatgctacatccagtgcaaatgtcaacacaacaaaaaaagcaatt aacatcttgtgttttgacctcatggacaccctgaacatgtcttggggattcccacagccc atgcttggagacccgctggttcagacccgtgaggatctacctttgagcattggcatcttc tatggttttgtggcaaatcaccaggtaagaacccggatcaaaaggagtcggaaactggca gatagcaatttcaaggacttgcgaactctcttgaatgaaactccagagcaaatcaaatat atattggcccagtacaacactaccaaggacaaggcgttcacagatctgaacagtatcaat tcagtgctaggaggcggaattcttgaccgactgagacccaacatcatccctgttcttgat gagattaagtccatggcaacagcgatcaaggagaccaaagaggcgttggagaacatgaac agcaccttgaagagcttgcaccaacaaagtacacagcttagcagcagtctgaccagcgtg aaaactagcctgcggtcatctctcaatgaccctctgtgcttggtgcatccatcaagtgaa acctgcaacagcatcagattgtctctaagccagctgaatagcaaccctgaactgaggcag cttccacccgtggatgcagaacttgacaacgttaataacgttcttaggacagatttggat ggcctggtccaacagggctatcaatcccttaatgatatacctgacagagtacaacgccaa accacgactgtcgtagcagccagccatgtgtacacagacttgttgtcacactctggcacc atgaccaggtttgtgtcagggaaacaaggtatcaaaagggtcttgaattccattggttca gatatcgacaatgtaactcagcgtcttcctattcaggatatactctcagcattctctgtt tatgttaataacactgaaagttacatccacagaaatttacctacattggaagagtatgat tcatactggtggctgggtggcctggtcatctgctctctgctgaccctcatcgtgattttt tactacctgggcttactgtgtggcgtgtgcggctatgacaggcatgccaccccgaccacc cgaggctgtgtctccaacaccggaggcgtcttcctcatggttggagttggattaagtttc ctcttttgctggatattgatgatcattgtggttcttacctttgtctttggtgcaaatgtg gaaaaactgatctgtgaaccttacacgagcaaggaattattccggcatactggaagcata agcagtgaattggaaagtctgaaggtaaatcttaatatctttctgttgggtgcagcagga agaaaaaaccttcaggattttgctgcttgtggaatagacagaatgaattatgacagctac ttggctcaggagagagtaactaggattctagcttctctggattttgctcagaacttcatc acaaacaatacttcctctgttattattgaggaaactaagaagtatgggagaacaataata ggatattttgaacattatctgcagtggatcgagttctctatcagtgagaaagtggcatcg tgcaaacctgtggccaccgctctagatactgctgttgatgtctttctgtgtagctacatt atcgaccccttgaatttgttttggtttggcataggaaaagctactgtatttttacttccg gctctaatttttgcggtaaaactggctaagtactatcgtcgaatggattcggaggacgtg tacgatgagtaa >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_10|148_aa MSNITQVESYEVDDIHIFHPKNRQFLMIQPNISQRLFVSHESRGALKNYDRALAFLTKNS CRPQELQQTFLQKEFPLPQTRSQDPGESHLGHRSPRERVRSLWGPGPVRTLTSDSRSGTT AAGYPHPRPHPQWMERRPKRGVAGHHSD >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_10|447_bp atgagtaacatcacacaagttgaatcatatgaggttgacgatattcacatcttccatcca aaaaaccgacaatttcttatgattcaacccaacatatcgcagcggttgttcgtgtcgcat gagtctcggggggcactcaagaactacgatcgtgcacttgctttcttaacaaagaacagc tgccgccctcaggaactccagcaaacctttctccagaaagagtttcctttaccgcagact cgttcccaagaccctggcgagtctcacctcggtcaccgttctccccgagagcgagtccga agtctctggggccccggtccagttcgcacactcacctccgactcccgctccggaaccacg gccgccggctacccccacccccgcccccatccccagtggatggaaagaagacccaagcgg ggcgtcgcgggccaccactcagactaa >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_11|134_aa MGPVFHLTLMNHTMMEASSHLPDEKGEVLEAFLDCLPLADRAAQLSFGALTPVITYTHLP NSFLNACPLEAFSRAVATASITPITWHPSSLCLGTPGVQIHEKGAHREESSNFYQPKAHL QATSCPSILPFLPD >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_11|405_bp atgggccctgtatttcatctaaccctcatgaaccacacaatgatggaggcatcgtcccat ttgccagatgagaaaggtgaggttctagaggcctttctcgactgcctgccactcgctgac cgagcagctcagctttcctttggtgcactcacacctgtcatcacttacacgcatttgccc aattctttcctgaatgcttgcccattagaagccttctccagggccgtggccaccgcctcc atcacccccatcacctggcacccctcctctctatgcctgggaactcctggggtccagatc cacgagaaaggagcccacagagaagaaagcagcaacttctaccagcctaaggcccacctc caagctacttcctgtcccagcattctacccttcctgcctgattga >gi568815594r:15879395_16175906|GENSCAN_predicted_peptide_12|246_aa XLISLKVLNSIVLLGKSCQYVKEAKMEEKLSNPPATCTPGKPSSKSQNKCKPSQGLSTEE NLSASITKQPIHQKENIIPLLVTSNSDQFLTTPDGDEKDITQDNSELKHRSSKKDLLEID RWNPEKVVTLSQAPGSRWEQLAVGLGKNSGHQGSGELHWLAILVLLHISIANIVLPHIIV ERDMLDFSCAFKQYFSAEASKVAVLTSCGCTTWISQQVHTYYHQQGHIYLPHGPPDSTYV TISLPS >gi568815594r:15879395_16175906|GENSCAN_predicted_CDS_12|741_bp nngttgatatccctgaaagtacttaatagcatcgtgctgttggggaaatcgtgccagtat gtgaaggaagccaaaatggaagagaagctgtcgaatcctcccgcaacctgcactccaggc aagccgtccagtaaatcacagaacaaatgtaaaccctctcaaggcctttccacagaagaa aacctgtctgcctccatcaccaaacaacctattcatcaaaaggaaaatatcataccatta cttgtgacaagcaattctgatcagtttttgacaactccagatggtgacgagaaggacata acgcaggacaattctgaattaaaacacagatcctcaaagaaagatttgttagagatagac aggtggaacccagagaaggtagtgaccttgtcccaggctcctggcagcagatgggagcag ctggcggtagggctgggcaaaaactctggacaccaaggctctggtgagcttcactggttg gcaatacttgtgttgctacacatcagcattgccaacattgtgttaccacacatcattgtt gaaagagacatgctggatttctcttgtgccttcaagcagtacttttcagctgaggcttct aaggtggctgttctgaccagctgtggctgcaccacgtggatatcgcagcaagtccacacc tactatcatcagcagggccacatctatctaccccatggacctcctgattctacctatgtg accatcagtcttccatcttaa