GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:20:07 Sequence gi568815575r:72201726_72402869 : 201144 bp : 44.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 166 161 6 1.05 1.02 Term - 6973 3289 3685 0 1 110 54 1573 0.992 141.54 1.01 Init - 11917 11814 104 1 2 89 101 67 0.968 7.81 1.00 Prom - 19036 18997 40 -6.26 2.00 Prom + 25215 25254 40 -5.56 2.01 Init + 33224 33262 39 1 0 95 71 64 0.547 5.59 2.02 Term + 39674 39778 105 1 0 90 38 104 0.694 4.01 2.03 PlyA + 41595 41600 6 1.05 3.09 PlyA - 42346 42341 6 1.05 3.08 Term - 60785 60679 107 0 2 33 49 150 0.290 4.17 3.07 Intr - 71047 70954 94 1 1 126 23 57 0.385 2.54 3.06 Intr - 71664 71507 158 1 2 72 115 160 0.995 16.83 3.05 Intr - 72247 72076 172 1 1 60 107 220 0.901 20.62 3.04 Intr - 73425 73328 98 1 2 44 113 130 0.920 10.83 3.03 Intr - 73999 73819 181 1 1 70 86 78 0.928 5.24 3.02 Intr - 74509 74432 78 1 0 42 78 59 0.491 0.05 3.01 Init - 75470 75468 3 2 0 91 103 0 0.670 1.80 3.00 Prom - 82398 82359 40 -3.46 4.07 PlyA - 83941 83936 6 1.05 4.06 Term - 100519 99998 522 1 0 113 38 393 0.999 31.08 4.05 Intr - 101209 101085 125 2 2 109 76 61 0.986 7.30 4.04 Intr - 103475 103322 154 2 1 97 37 78 0.663 3.25 4.03 Intr - 104919 104752 168 0 0 32 99 81 0.309 3.74 4.02 Intr - 116097 115885 213 0 0 49 52 127 0.059 4.11 4.01 Init - 117988 117947 42 1 0 104 70 30 0.057 3.22 4.00 Prom - 120750 120711 40 -7.16 5.04 PlyA - 121419 121414 6 1.05 5.03 Term - 123386 123223 164 0 2 70 50 136 0.978 6.10 5.02 Intr - 150113 150008 106 2 1 104 105 127 0.926 15.79 5.01 Init - 174385 174383 3 1 0 108 81 0 0.349 1.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:72201726_72402869|GENSCAN_predicted_peptide_1|1262_aa MGALLYQKRSGHPAISGGVEVSDGSATVANSSGGRYVKEAKEATKNGDLEEAFKLFNLAK DIFPNEKVLSRIQKIQEALEELAEQGDDEFTDVCNSGLLLYRELHNQLFEHQKEGIAFLY SLYRDGRKGGILADDMGLGKTVQIIAFLSGMFDASLVNHVLLIMPTNLINTWVKEFIKWT PGMRVKTFHGPSKDERTRNLNRIQQRNGVIITTYQMLINNWQQLSSFRGQEFVWDYVILD EAHKIKTSSTKSAICARAIPASNRLLLTGTPIQNNLQELWSLFDFACQGSLLGTLKTFKM EYENPITRAREKDATPGEKALGFKISENLMAIIKPYFLRRTKEDVQKKKSSNPEARLNEK NPDVDAICEMPSLSRKNDLIIWIRLVPLQEEIYRKFVSLDHIKELLMETRSPLAELGVLK KLCDHPRLLSARACCLLNLGTFSAQDGNEGEDSPDVDHIDQVTDDTLMEESGKMIFLMDL LKRLRDEGHQTLVFSQSRQILNIIERLLKNRHFKTLRIDGTVTHLLEREKRINLFQQNKD YSVFLLTTQVGGVGLTLTAATRVVIFDPSWNPATDAQAVDRVYRIGQKENVVVYRLITCG TVEEKIYRRQVFKDSLIRQTTGEKKNPFRYFSKQELRELFTIEDLQNSVTQLQLQSLHAA QRKSDIKLDEHIAYLQSLGIAGISDHDLMYTCDLSVKEELDVVEESHYIQQRVQKAQFLV EFESQNKEFLMEQQRTRNEGAWLREPVFPSSTKKKCPKLNKPQPQPSPLLSTHHTQEEDI SSKMASVVIDDLPKEGEKQDLSSIKVNVTTLQDGKGTGSADSIATLPKGFGSVEELCTNS SLGMEKSFATKNEAVQKETLQEGPKQEALQEDPLESFNYVLSKSTKADIGPNLDQLKDDE ILRHCNPWPIISITNESQNAESNVSIIEIADDLSASHSALQDAQASEAKLEEEPSASSPQ YACDFNLFLEDSADNRQNFSSQSLEHVEKENSLCGSAPNSRAGFVHSKTCLSWEFSEKDD EPEEVVVKAKIRSKARRIVSDGEDEDDSFKDTSSINPFNTSLFQFSSVKQFDASTPKNDI SPPGRFFSSQIPSSVNKSMNSRRSLASRRSLINMVLDHVEDMEERLDDSSEAKGPEDYPE EGVEESSGEASKYTEEDPSGETLSSENKSSWLMTSKPSALAQETSLGAPEPLSGEQLVGS PQDKAAEATNDYETLVKRGKELKECGKIQEALNCLVKALDIKSADPEVMLLTLSLYKQLN NN >gi568815575r:72201726_72402869|GENSCAN_predicted_CDS_1|3789_bp atgggcgccttgctatatcagaaacgcagcggacaccctgccatatctggaggggtggaa gtcagcgatgggtctgcaacggtggcaaacagcagtggtggacgatatgtgaaagaggcc aaagaagcaactaagaatggagacctggaagaagcatttaaacttttcaatttggcaaag gacatttttcccaatgaaaaagtgctgagcagaatccaaaaaatacaggaagccttggag gagttggcagaacagggagatgatgaatttacagatgtgtgcaactctggcttgctactt tatcgagaactgcacaaccaactctttgagcaccagaaggaaggcatagctttcctctat agcctgtatagggatggaagaaaaggtggtatattggctgatgatatgggattagggaag actgttcaaatcattgctttcctttccggtatgtttgatgcatcacttgtgaatcatgtg ctgctgatcatgccaaccaatcttattaacacatgggtaaaagaattcatcaagtggact ccaggaatgagagtcaaaacctttcatggtcctagcaaggatgaacggaccagaaacctc aatcggattcagcaaaggaatggtgttattatcactacataccaaatgttaatcaataac tggcagcaactttcaagctttaggggccaagagtttgtgtgggactatgtcatcctcgat gaagcacataaaataaaaacctcatctactaagtcagcaatatgtgctcgtgctattcct gcaagtaatcgcctcctcctcacaggaaccccaatccagaataatttacaagaactatgg tccctatttgattttgcttgtcaagggtccctgctgggaacattaaaaacttttaagatg gagtatgaaaatcctattactagagcaagagagaaggatgctaccccaggagaaaaagcc ttgggatttaaaatatctgaaaacttaatggcaatcataaaaccctattttctcaggagg actaaagaagacgtacagaagaaaaagtcaagcaacccagaggccagacttaatgaaaag aatccagatgttgatgccatttgtgaaatgccttccctttccaggaaaaatgatttaatt atttggatacgacttgtgcctttacaagaagaaatatacaggaaatttgtgtctttagat catatcaaggagttgctaatggagacgcgctcacctttggctgagctaggtgtcttaaag aagctgtgtgatcatcctaggctgctgtctgcacgggcttgttgtttgctaaatcttggg acattctctgctcaagatggaaatgagggggaagattccccagatgtggaccatattgat caagtaactgatgacacattgatggaagaatctggaaaaatgatattcctaatggaccta cttaagaggctgcgagatgagggacatcaaactctggtgttttctcaatcgaggcaaatt ctaaacatcattgaacgcctcttaaagaataggcactttaagacattgcgaatcgatggg acagttactcatcttttggaacgagaaaaaagaattaacttattccagcaaaataaagat tactctgtttttctgcttaccactcaagtaggtggtgtcggtttaacattaactgcagca actagagtggtcatttttgaccctagctggaatcctgcaactgatgctcaagctgtggat agagtttaccgaattggacaaaaagagaatgttgtggtttataggctaatcacttgtggg actgtagaggaaaaaatatacagaagacaggttttcaaggactcattaataagacaaact actggtgaaaaaaagaaccctttccgatattttagtaaacaagaattaagagagctcttt acaatcgaggatcttcagaactctgtaacccagctgcagcttcagtctttgcatgctgct cagaggaaatctgatataaaactagatgaacatattgcctacctgcagtctttggggata gctggaatctcagaccatgatttgatgtacacatgtgatctgtctgttaaagaagagctt gatgtggtagaagaatctcactatattcaacaaagggttcagaaagctcaattcctcgtt gaattcgagtctcaaaataaagagttcctgatggaacaacaaagaactagaaatgagggg gcctggctaagagaacctgtatttccttcttcaacaaagaagaaatgccctaaattgaat aaaccacagcctcagccttcacctcttctaagtactcatcatactcaggaagaagatatc agttccaaaatggcaagtgtagtcattgatgatctgcccaaagagggtgagaaacaagat ctctccagtataaaggtgaatgttaccaccttgcaagatggtaaaggtacaggtagtgct gactctatagctactttaccaaaggggtttggaagtgtagaagaactttgtactaactct tcattgggaatggaaaaaagctttgcaactaaaaatgaagctgtacaaaaagagacatta caagaggggcctaagcaagaggcactgcaagaggatcctctggaaagttttaattatgta cttagcaaatcaaccaaagctgatattgggccaaatttagatcaactaaaggatgatgag attttacgtcattgcaatccttggcccattatttccataacaaatgaaagtcaaaatgca gaatcaaatgtatccattattgaaatagctgatgacctttcagcatcccatagtgcactg caggatgctcaagcaagtgaggccaagttggaagaggaaccttcagcatcttcaccacag tatgcatgtgatttcaatcttttcttggaagactcagcagacaacagacaaaatttttcc agtcagtctttagagcatgttgagaaagaaaatagcttgtgtggctctgcacctaattcc agagcagggtttgtgcatagcaaaacatgtctcagttgggagttttctgagaaagacgat gaaccagaagaagtagtagttaaagcaaaaatcagaagtaaagctagaaggattgtttca gatggcgaagatgaagatgattcttttaaagatacctcaagcataaatccattcaacaca tctctctttcaattctcatctgtgaaacaatttgatgcttcaactcccaaaaatgacatc agtccaccaggaaggttcttttcatctcaaatacccagtagtgtaaataagtctatgaac tctagaagatctctggcttctaggaggtctcttattaatatggttttagaccacgtggag gacatggaggaaagacttgacgacagcagtgaagcaaagggtcctgaagattatccagaa gaaggggtggaggaaagcagtggcgaagcctccaagtatacagaagaggatccttccgga gaaacactgtcttcagaaaacaagtccagctggttaatgacgtctaagcctagtgctcta gctcaagagacctctcttggtgcccctgagcctttgtctggtgaacagttggttggttct ccccaggataaggcggcagaggctacaaatgactatgagactcttgtaaagcgtggaaaa gaactaaaagagtgtggaaaaatccaggaggccctaaactgcttagttaaagcgcttgac ataaaaagtgcagatcctgaagttatgctcttgactttaagtttgtataagcaacttaat aacaattga >gi568815575r:72201726_72402869|GENSCAN_predicted_peptide_2|47_aa MKGLLDIDIDGFQRMQQQGTILEAESSPHQIPILLAPSSWTLQPPEP >gi568815575r:72201726_72402869|GENSCAN_predicted_CDS_2|144_bp atgaagggcctgcttgacatcgatattgatggatttcaaaggatgcagcaacaaggcacc atcttggaagcagagagcagccctcaccagataccaatcctgctggcaccttcatcttgg actcttcagcctccagaaccataa >gi568815575r:72201726_72402869|GENSCAN_predicted_peptide_3|296_aa MARGPKKHLKRVAAPKHWMLDKLTGVFAPRPSTGPHKLRECLPLIIFLRNRLKYALTGDE VKKICMQRFIKIDGKVRTDITYPAGFMDVISIDKTGENFRLIYDTKGRFAVHRITPEEAK YKLCKVRKIFVGTKGIPHLVTHDARTIRYPDPLIKVNDTIQIDLETGKITDFIKFDTGNL CMVTGGANLGRIGVITNRERHPGSFDVVHVKDANGNSFATRLSNIFVIGKGNKPWISLPR GKGIRLTIAEERDKRLAAKQSMNFHDPEKLGLFAILLVKKNAHTFTKPTGSEKVMN >gi568815575r:72201726_72402869|GENSCAN_predicted_CDS_3|891_bp atggctcgtggtcccaagaagcatctgaagcgggtggcagctccaaagcattggatgctg gataaattgaccggtgtgtttgctcctcgtccatccaccggtccccacaagttgagagag tgtctccccctcatcattttcctgaggaacagacttaagtatgccctgacaggagatgaa gtaaagaagatttgcatgcagcggttcattaaaatcgatggcaaggtccgaactgatata acctaccctgctggattcatggatgtcatcagcattgacaagacgggagagaatttccgt ctgatctatgacaccaagggtcgctttgctgtacatcgtattacacctgaggaggccaag tacaagttgtgcaaagtgagaaagatctttgtgggcacaaaaggaatccctcatctggtg actcatgatgcccgcaccatccgctaccccgatcccctcatcaaggtgaatgataccatt cagattgatttggagactggcaagattactgatttcatcaagttcgacactggtaacctg tgtatggtgactggaggtgctaacctaggaagaattggtgtgatcaccaacagagagagg caccctggatcttttgacgtggttcacgtgaaagatgccaatggcaacagctttgccact cgactttccaacatttttgttattggcaagggcaacaaaccatggatttctcttccccga ggaaagggtatccgcctcaccattgctgaagagagagacaaaagactggcggccaaacag agcatgaatttccacgaccctgaaaaattgggcttatttgccattttactggtgaagaaa aatgcccacaccttcactaagcccacaggaagtgagaaggttatgaactga >gi568815575r:72201726_72402869|GENSCAN_predicted_peptide_4|407_aa MEGMGLHKLKVKIKTPRLSAHIIFSDHNIKAEFLNVASKSGSCNLIDPILCCFPAPISTF AHAVPNTWRMLILSSDVTYLPLHFKAPRGRVANGAAAGAERDRFAVAVAAACVPGAGGSA ASLAAHRAPRFPLSRAGGFSEAGGGCSPKGRPEAKSGQRDWELVAGGPPGISRREGTCCS RFPSRLSQPFRSAQQLQLAASLPANLSNFCQGSEMPTTSRPALDVKGGTSPAKEDANQEM SSVAYSNLAVKDRKAVAILHYPGVASNGTKASGAPTSSSGSPIGSPTTTPPTKPPSFNLH PAPHLLASMHLQKLNSQYQGMAAATPGQPGEAGPLQNWDFGAQAGGAESLSPSAGAQSPA IIDSDPVDEEVLMSLVVELGLDRANELPELWLGQNEFDFTADFPSSC >gi568815575r:72201726_72402869|GENSCAN_predicted_CDS_4|1224_bp atggagggcatgggtttgcacaagctgaaagtgaagataaagactccacgcttatctgcc cacatcatctttagtgaccacaatataaaggctgaatttcttaacgtggcatccaagtct ggctcctgcaacctcatagatcccatcctgtgctgtttcccagctcccatcagcaccttt gcccatgctgtccccaatacctggagaatgctgatcctcagctcagatgtcacctatctc ccacttcattttaaggcgccccggggtcgggtggccaacggcgcggccgcgggcgctgag cgcgaccggttcgcggtagcggtggcggcggcgtgcgtgccaggggctgggggctccgcc gcctctcttgcggctcaccgagctccgcgcttccctctctccagggcaggcggcttctca gaggcgggagggggctgctcgcccaagggaaggcctgaagcgaagagcgggcagcgggac tgggaactggtggccggggggccgccaggcatcagccggcgagaaggcacttgttgctct aggtttcccagccgcctgtcgcagccgtttcgcagtgcacaacagctccagctggcagca tcacttcccgccaatttatccaacttctgccaaggctctgaaatgccaacaacgtcgagg cctgcacttgatgtcaagggtggcacctcacctgcgaaggaggatgccaaccaagagatg agctccgtggcctactccaaccttgcggtgaaagatcgcaaagcagtggccattctgcac taccctggggtagcctcaaatggaaccaaggccagtggggctcccactagttcctcggga tctccaataggctctcctacaaccacccctcccactaaacccccatccttcaacctgcac cccgcccctcacttgctggctagtatgcacctgcagaaacttaatagccagtatcagggg atggctgctgccactccaggccaacccggggaggcaggacccctgcaaaactgggacttt ggggcccaggcgggaggggcagaatcactctctccttctgctggtgcccagagccctgct atcatcgattcggacccagtggatgaggaagtgctgatgtcgctggtggtggaactgggg ttggaccgagccaatgagcttccggagctgtggctggggcagaatgagtttgacttcact gcggactttccatctagctgctaa >gi568815575r:72201726_72402869|GENSCAN_predicted_peptide_5|90_aa MFFTAYGPDYVLEITPSCRPDRNEPHRIQQILNYIKGIINTEEETSNLDWGNHEGYKQAV TLKLNLGPGMGGGIPGKGTGIHEGKKAQEI >gi568815575r:72201726_72402869|GENSCAN_predicted_CDS_5|273_bp atgtttttcacagcatatggtcctgattatgtgctggaaatcacgccaagctgccggcca gaccgcaatgagccccaccgaatccaacaaatcctcaactacatcaaaggtattattaac acagaggaggagacaagcaatctagactggggcaaccatgaaggctacaagcaggcggtg acactcaagctgaatcttgggccaggcatgggaggaggcattccaggcaaagggactgga atacatgaaggcaagaaggcacaggaaatttag