GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:50:11 Sequence gi568815595f:151780558_151981545 : 200988 bp : 37.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 681 676 6 1.05 1.03 Term - 4757 4641 117 2 0 79 43 60 0.865 -1.84 1.02 Intr - 7397 7263 135 2 0 77 99 83 0.813 8.14 1.01 Init - 9426 9370 57 0 0 69 81 51 0.901 3.86 1.00 Prom - 10231 10192 40 -8.25 2.00 Prom + 11354 11393 40 -5.15 2.01 Init + 16575 16622 48 2 0 95 123 22 0.571 7.40 2.02 Intr + 25686 25763 78 2 0 83 84 65 0.216 4.43 2.03 Intr + 26570 27416 847 1 1 59 38 520 0.150 34.11 2.04 Intr + 29606 30615 1010 0 2 56 99 275 0.130 14.90 2.05 Intr + 33582 33743 162 2 0 69 111 113 0.977 10.95 2.06 Intr + 36860 37031 172 1 1 61 119 92 0.636 8.19 2.07 Intr + 39826 39895 70 2 1 94 71 105 0.856 6.72 2.08 Intr + 44106 44277 172 0 1 91 91 57 0.825 5.32 2.09 Term + 47019 47615 597 2 0 63 39 299 0.834 16.04 2.10 PlyA + 47645 47650 6 1.05 3.04 PlyA - 47875 47870 6 1.05 3.03 Term - 53856 53838 19 2 1 111 45 19 0.005 -3.29 3.02 Intr - 66031 65909 123 0 0 106 87 65 0.377 6.98 3.01 Init - 67138 66978 161 1 2 45 34 152 0.436 4.85 3.00 Prom - 73377 73338 40 -4.95 4.00 Prom + 77948 77987 40 -0.75 4.01 Init + 93064 93149 86 0 2 73 103 85 0.347 8.84 4.02 Intr + 99295 99370 76 1 1 107 2 42 0.109 -3.90 4.03 Term + 100002 100991 990 2 0 112 55 341 0.992 24.28 4.04 PlyA + 101596 101601 6 1.05 5.04 PlyA - 101814 101809 6 1.05 5.03 Term - 104848 104741 108 1 0 79 47 77 0.321 0.23 5.02 Intr - 137184 137058 127 2 1 74 88 0 0.091 -1.64 5.01 Init - 138657 138392 266 0 2 60 64 179 0.260 9.23 5.00 Prom - 139772 139733 40 -4.85 6.05 PlyA - 139794 139789 6 1.05 6.04 Term - 140648 140473 176 0 2 60 48 171 0.779 7.24 6.03 Intr - 148067 147824 244 2 1 41 110 133 0.281 6.95 6.02 Intr - 148923 148711 213 0 0 37 101 92 0.227 3.39 6.01 Init - 156772 156710 63 1 0 75 77 12 0.125 0.00 6.00 Prom - 161865 161826 40 -4.55 7.02 PlyA - 161991 161986 6 1.05 7.01 Sngl - 164454 164248 207 0 0 83 50 156 0.854 6.24 7.00 Prom - 178019 177980 40 -5.55 8.02 PlyA - 178712 178707 6 1.05 8.01 Term - 184964 184774 191 0 2 111 45 101 0.144 4.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:151780558_151981545|GENSCAN_predicted_peptide_1|102_aa MGTREKDGKGEGVETFRDKAMRLNGISENKLQIEKRPEFQDQKHPTQNRSENNRYSNMRA SDSELSRKIKKAWFPVAVRKFYAGPQLPSSMQLQNDQTLHTI >gi568815595f:151780558_151981545|GENSCAN_predicted_CDS_1|309_bp atgggaaccagggaaaaagatggtaaaggggagggagtggagacctttagagacaaggcc atgagactgaatggcatctcagagaataaactgcagatagagaagaggcctgagttccaa gatcagaagcaccctactcaaaatagatcagaaaacaatagatatagtaacatgagagcc agtgactcagagctgagcaggaaaataaagaaagcttggtttccagtggctgtgaggaaa ttctatgctggccctcagctgccttcctcaatgcagctccaaaacgatcaaaccttacat accatatga >gi568815595f:151780558_151981545|GENSCAN_predicted_peptide_2|1051_aa MENTAAKCCYGKSSYSNVLETQRISYSFPLLADTSNTRATSGHAQQPAPILPLREVAGAE DIIRVHVPFSLSDLSQIAKRLGSFSSDPDTYIKEFKYLTQSYELTWHDLYIILSSTLLPE KKERVWLAAQAHANDLHRQDPTKPIGAAAVPLEEPPWKYQPTDPGRASRNHMITCLIAGL NKAAHKAVNFEKLKEISQRADENPAEFLSRFTEALQKYTRVDPTSREETIVLNNHFISQS APNIQHKLKKAEDGPQTPQQDLLNLTFKVFNNREEQIKLDKAQRDCAKYQLLAVAIHQPS HSTQGHKKPNGSNPPGPCFKCSKEVTYLGVQLSPGAQAMTPAQATLINSLPLPSSKNEIL SFLRLEGFFRIWIPNFALLAQPLYEAAKGPLNEPLSPIHNILPSFCKLQTALITAPALSL PDLSQPFVLYTTKNQGIALGVLGQQKGNPPSFDPVAYLCKQLDNTVKGQPTCLKASSAVA VLPLESKKLTFGQSTTIHSPHNLQDLLSSWALSSLSPSQIQSLYALFIKNPEFSLAKSAP LNLASLLPISSSPPTHSCTDILDHLQPQFPNISSKPLTNPDDQLFIDDSSSRAPGSPKIV GYAVVTLNHVIEAKPLPPETSSQKAELSSHKSPNPLQGQTGQHIHRLQVCLPHSSFSCRH LRPRSGTFTMGRKSLYLLIVGILIAYYIYTPLPDNVEEPWRMMWINAHLKTIQNLVVGSF DEVPPTSDENVTVTETKFNNILVRVYVPKRKSEALRRGLFYIHGGGWCVGSAALSGYDLL SRWTADRLDAVVVSTNYRLAPKYHFPIQFEDVYNALRWFLRKKVLAKYGVNPERIGISGD SAGGNLAAAVTQQLLDDPDVKIKLKIQSLIYPALQPLDVDLPSYQENSNFLFLSKSLMVR FWSEYFTTDRSLEKAMLSRQHVPVESSHLFKFVNWSSLLPERFIKGHVYNNPNYGSSELA KKYPGFLDVRAAPLLADDNKLRGLPLTYVITCQYDLLRDDGLMYVTRLRNTGVQVTHNHV EDGFHGAFSFLGLKISHRLINQYIEWLKENL >gi568815595f:151780558_151981545|GENSCAN_predicted_CDS_2|3156_bp atggaaaatactgctgctaaatgctgttatggaaagtcgtcatattcgaacgtgctggag acccagaggatttcctactcctttccattgctggcagacacatccaacaccagggccacc tcagggcatgcccaacaaccagcccccatacttcccctccgagaggtggctggagccgaa gacatcattcgagtccacgttcccttctccctctctgacctctcccaaattgcaaaacgt ctcgggtcgttttcctctgatcccgacacttatatcaaagaatttaagtaccttacccaa tcttatgaactcacttggcatgatctctacattatcctctcttctaccctccttccagaa aagaaggaaagagtgtggcttgcagcacaggcacatgccaacgatcttcatcggcaagac cctactaagcccataggggctgctgcagttcccctggaggaacccccctggaagtaccaa cccacagaccctggccgggcatctcgtaaccatatgattacttgcctcatcgcaggactt aacaaagcagcccataaggccgtaaattttgaaaagctcaaagaaatctcccaaagagcc gatgaaaatcctgctgaatttctttctcgttttacagaggccctccaaaaatatactcgt gtagaccccacctcccgggaagaaactatcgttcttaataaccatttcatctctcagtct gctcctaacatacagcacaaactgaaaaaggccgaagatggccctcaaactccacaacaa gatctccttaacctgactttcaaagtcttcaataacagggaggagcagattaaattagac aaggcccaaagagattgtgctaaataccagcttctggcagtggctatccatcaacctagc catagtacccaagggcacaaaaaacccaatggcagtaaccctcctgggccttgttttaag tgcagcaaagaagtaacatacttaggagtccaactctcccctggggcccaagccatgacc ccagcacaagcaaccttaataaacagcttgcctctgccttcctcaaaaaatgaaattctc tctttcttaagactagaaggtttctttagaatatggattcccaactttgccctcctggct caacccctctacgaagcagccaaaggccccctcaatgaacccctaagccccatacacaac atacttcccagtttctgtaaactccaaactgctctcatcactgcacctgccctgtcctta cccgacctctcccaaccctttgttctctataccaccaaaaatcaaggaatagctcttggg gtcttagggcaacaaaagggaaatcctccttcctttgaccctgtagcatatctctgtaaa caactagacaacactgtcaaagggcagccaacctgtcttaaagcatcatcagcagtggcc gttttgcctctggaaagcaaaaaactaacatttggccaaagcaccaccattcacagccct cacaacttacaggatctcctctcctcctgggcattaagctccctctctccttcccaaatt cagtcgctctacgccctctttatcaaaaatcctgaattcagccttgccaaaagtgccccc ctcaacctggcatccctacttcccatatcctcttcccctcctactcattcttgcactgac attctggatcacttgcagccacaattccctaacatctcctccaagcctctcactaatcca gatgaccaactatttatagatgactcctcttccagagcccccggctctcccaaaattgtt gggtatgcagtagttaccttaaaccatgtaattgaggctaaacccctacccccagaaacc tcctcccagaaagcagaactcagctctcacaagagccctaaccctctccaaggacaaaca ggtcaacatatacacagactccaagtatgcctaccacattcttcattctcatgccgccat ctgagaccaagaagcgggacgttcaccatgggaagaaaatcgctgtaccttctgattgtg gggatcctcatagcatattatatttatacgcctctcccagataacgttgaggagccatgg agaatgatgtggataaacgcacatctgaaaactatacaaaatttggttgtcgggagcttt gatgaagtcccaccaacctcagatgaaaatgtcactgtgactgagacaaaattcaacaac attcttgttcgggtatatgtgccaaagagaaagtctgaagcactaagaagggggttgttt tacatccatggtggaggctggtgcgtgggaagtgctgctctaagtggttatgacttgctg tcaagatggacagcagacagacttgatgctgtcgtcgtatcaaccaactacagattagca cctaagtatcatttcccaattcaatttgaagatgtatataatgccttaaggtggttctta cgtaaaaaagttcttgcaaaatatggtgtgaaccctgagagaatcggtatttctggagat agtgcaggagggaatttagctgcagcagtgactcaacagctccttgatgacccagatgtc aagatcaaactcaagatccagtctttaatttatcctgcccttcagcctcttgatgtagat ttaccgtcatatcaagaaaattcaaattttctatttctatccaaatcactcatggtcaga ttctggagtgaatattttaccactgatagatcacttgaaaaagccatgctttccagacaa catgtacctgtggaatcaagtcatctcttcaaatttgttaattggagttccctgctccct gagaggtttataaaaggacatgtttataacaatccaaattatggcagttctgagctggct aaaaaatatccagggttcctagatgtgagggcagcccctttgttggctgatgacaacaaa ttacgtggcttacccctgacctatgtcatcacctgtcaatatgatctcttaagagatgat ggactcatgtatgtcacccgacttcgcaacactggggttcaggtgactcataaccatgtt gaggatggattccatggagcattttcatttctgggacttaaaattagtcacagacttata aatcagtatattgagtggctaaaggaaaatctatag >gi568815595f:151780558_151981545|GENSCAN_predicted_peptide_3|100_aa MLPDFKLYYKATVTKTTWYCYRNRHIDQWNTTETSEITPHIYNRLIFDKPDKNKDKDEAR SHHPQQTNTGTENQTPHVLTHKWESNNENTWTQGRKDVKG >gi568815595f:151780558_151981545|GENSCAN_predicted_CDS_3|303_bp atgctaccagacttcaaactgtactacaaggctacagtaaccaaaacaacatggtactgc taccgaaacagacatatagaccaatggaacacaacagagacctcagaaataacaccacac atctacaaccgtctgatcttcgacaaacctgacaagaacaaggacaaggatgaagctaga agccatcatcctcagcaaactaacacaggaacagaaaaccaaacaccacacgttctcact cataagtgggagtcgaacaatgagaacacatggacacagggaaggaaagatgtcaaagga tga >gi568815595f:151780558_151981545|GENSCAN_predicted_peptide_4|383_aa MSSGTTLGQKLYPAAGRVPVKGSSLPTEWLWFNSAEFVEQLRHAGDHGMFRDTKAWNATC KNWLAAEAALEKYYLSIFYGIEFVVGVLGNTIVVYGYIFSLKNWNSSNIYLFNLSVSDLA FLCTLPMLIRSYANGNWIYGDVLCISNRYVLHANLYTSILFLTFISIDRYLIIKYPFREH LLQKKEFAILISLAIWVLVTLELLPILPLINPVITDNGTTCNDFASSGDPNYNLIYSMCL TLLGFLIPLFVMCFFYYKIALFLKQRNRQVATALPLEKPLNLVIMAVVIFSVLFTPYHVM RNVRIASRLGSWKQYQCTQVVINSFYIVTRPLAFLNSVINPVFYFLLGDHFRDMLMNQLR HNFKSLTSFSRWAHELLLSFREK >gi568815595f:151780558_151981545|GENSCAN_predicted_CDS_4|1152_bp atgtccagtggcacaacactgggtcagaagctatatccagctgctggcagagttcctgtc aagggatcaagtcttccaacagaatggttatggtttaactcagcagaatttgttgaacaa ctacgacatgctggggatcatggtatgtttagagacacaaaagcatggaatgcaacttgc aaaaactggctggcagcagaggctgccctggaaaagtactacctttccattttttatggg attgagttcgttgtgggagtccttggaaataccattgttgtttacggctacatcttctct ctgaagaactggaacagcagtaatatttatctctttaacctctctgtctctgacttagct tttctgtgcaccctccccatgctgataaggagttatgccaatggaaactggatatatgga gacgtgctctgcataagcaaccgatatgtgcttcatgccaacctctataccagcattctc tttctcacttttatcagcatagatcgatacttgataattaagtatcctttccgagaacac cttctgcaaaagaaagagtttgctattttaatctccttggccatttgggttttagtaacc ttagagttactacccatacttccccttataaatcctgttataactgacaatggcaccacc tgtaatgattttgcaagttctggagaccccaactacaacctcatttacagcatgtgtcta acactgttggggttccttattcctctttttgtgatgtgtttcttttattacaagattgct ctcttcctaaagcagaggaataggcaggttgctactgctctgccccttgaaaagcctctc aacttggtcatcatggcagtggtaatcttctctgtgctttttacaccctatcacgtcatg cggaatgtgaggatcgcttcacgcctggggagttggaagcagtatcagtgcactcaggtc gtcatcaactccttttacattgtgacacggcctttggcctttctgaacagtgtcatcaac cctgtcttctattttcttttgggagatcacttcagggacatgctgatgaatcaactgaga cacaacttcaaatcccttacatcctttagcagatgggctcatgaactcctactttcattc agagaaaagtga >gi568815595f:151780558_151981545|GENSCAN_predicted_peptide_5|166_aa MPYIVAGKRAYAGDLPFIKPSDFVRLIHYHKNSMGETALMIQLSPSGPTFDTWSLLQVKV RFGWRHSQTISQGVGSIRKRQGGAKAATRSLQWGEVVRWSHTFWSMSLPSFYSSSHKAAS DLRVQEKKRKEGEFQASGSWLAGLRAGGICRARPLGSLASAAFPGA >gi568815595f:151780558_151981545|GENSCAN_predicted_CDS_5|501_bp atgccttacatagtggcaggtaaaagagcttatgcaggggatctcccatttataaaacca tcagatttcgtgagacttattcattaccacaagaacagtatgggtgaaactgccctcatg attcaattatctccatctggccccacctttgacacgtggagtttattacaagtcaaggtg agatttgggtggagacacagccaaaccatatcacaaggggttggcagcatccgaaagcga caaggaggggccaaggcagcaacacgaagtctgcagtggggagaagtggtgaggtggtca cacacattctggtccatgtccctaccaagtttctactcttcttcccacaaggctgcctct gacttgagggttcaagagaagaagaggaaagagggagaatttcaagccagtggatcctgg cttgctgggctccgtgctggtgggatctgccgagcaagaccacttggctccctggcttca gccgcctttccaggggcatga >gi568815595f:151780558_151981545|GENSCAN_predicted_peptide_6|231_aa MQYYLSDYIQAGWKQKTLMQHCGHNVSGIGGFLVSLTSRIKPRTLAVSVTALQVAHLRLE FVPSDVRMCSEFLPSGGFCGLAGSGVKLQTFATQEPSWLHPVDPAPELQVELPASPAPCA RTPQPLGGRWDWAPWSRGWHLSGRLGAAQEPMEGVGGSGMAGCRSRALPCGKAVYRRHPL TDTLPPHQQDCNIVTLDYIGKSCKMQTLCEEEYLGNPKVKKEDKSKETRRT >gi568815595f:151780558_151981545|GENSCAN_predicted_CDS_6|696_bp atgcagtattatttatcagactacatccaagcagggtggaagcaaaaaacacttatgcag cattgcggccataatgtttccggaattggtgggttcttggtctcactgacttcgagaata aagccgcggaccctcgcggtgagtgttacagctcttcaggtggcgcatctgcgtctggag tttgttccttctgatgttcggatgtgttcggagtttcttccttctggtgggttctgtggt ctcgctggctcaggagtgaagctgcagaccttcgcgactcaggagcccagctggcttcac ccagtggatcccgcaccggagctgcaggtggagctgcctgccagtcccgcgccgtgcgct cgcactcctcagcccttgggtggtcgatgggactgggcgccgtggagcagggggtggcac ttgtctgggaggctcggggccgcacaggagcccatggagggggtgggaggctcaggcatg gcgggctgccggtcccgagccctgccctgcgggaaggcagtatatagaaggcatcccctc actgacactttaccaccacaccagcaggactgcaatatagtaacactggattacatcgga aagagctgcaagatgcagactctctgtgaggaggagtacttagggaatcctaaagtcaag aaagaagacaaaagcaaggagactagaagaacatga >gi568815595f:151780558_151981545|GENSCAN_predicted_peptide_7|68_aa MAEGKEEQVTSYVDSSRQRESLFEETYPYKTIRSCETYSPSLEQHGKDLPHDSITSYWVP PTTCGNSR >gi568815595f:151780558_151981545|GENSCAN_predicted_CDS_7|207_bp atggcagaaggcaaggaggagcaagtcacatcttatgtggatagcagcaggcaaagagag agcttgttcgaagaaacttacccttataaaaccatcagatcttgtgagacttattcacca tcattagaacagcacgggaaagacctgccccatgattcaattacttcctactgggtccct cctacaacatgtgggaattcaagatga >gi568815595f:151780558_151981545|GENSCAN_predicted_peptide_8|63_aa XMGLPDSLTMVIVFAFLDLATQWIYWALGLVMGSICKESCDMIHLQVLWPWIPAPALVKL AGE >gi568815595f:151780558_151981545|GENSCAN_predicted_CDS_8|192_bp ngaatggggcttcctgacagcctaactatggtgattgtttttgcttttctggatctagcc acccagtggatctactgggctctggggctggtaatggggagtatctgcaaagaatcctgt gatatgatccatcttcaggtcttgtggccatggataccagcacctgccttggtgaagtta gcaggggagtga