GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:22:27 Sequence gi568815582r:11737848_11951512 : 213665 bp : 44.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 606 504 103 2 1 118 75 54 0.399 7.28 1.13 Intr - 4983 4630 354 2 0 -26 94 479 0.337 31.50 1.12 Intr - 14985 14822 164 0 2 74 77 236 0.961 19.87 1.11 Intr - 18523 18390 134 2 2 46 79 74 0.853 2.66 1.10 Intr - 25812 25631 182 2 2 79 96 97 0.845 9.11 1.09 Intr - 27306 27206 101 0 2 110 44 7 0.863 -2.59 1.08 Intr - 27838 27642 197 2 2 110 85 102 0.994 11.23 1.07 Intr - 33140 32936 205 2 1 131 94 -32 0.756 0.47 1.06 Intr - 36672 36389 284 1 2 89 115 164 0.945 16.34 1.05 Intr - 37166 37133 34 2 1 102 91 32 0.999 2.70 1.04 Intr - 38685 38605 81 0 0 97 99 -1 0.769 1.73 1.03 Intr - 39062 38904 159 2 0 80 80 118 0.999 10.38 1.02 Intr - 41516 41319 198 2 0 101 101 118 0.999 13.95 1.01 Init - 59483 59277 207 2 0 51 91 186 0.948 12.02 1.00 Prom - 60452 60413 40 -6.56 2.12 PlyA - 60637 60632 6 1.05 2.11 Term - 61951 61745 207 1 0 101 42 79 0.125 1.94 2.10 Intr - 66233 66136 98 0 2 29 105 48 0.046 0.33 2.09 Intr - 80150 80015 136 2 1 63 77 53 0.059 1.84 2.08 Intr - 100266 100006 261 0 0 89 28 122 0.463 3.98 2.07 Intr - 102138 101848 291 0 0 58 57 187 0.847 9.93 2.06 Intr - 103973 103848 126 2 0 75 30 74 0.688 1.18 2.05 Intr - 104153 104060 94 1 1 48 77 106 0.748 5.47 2.04 Intr - 108996 108848 149 0 2 85 100 17 0.604 1.63 2.03 Intr - 109959 109821 139 2 1 83 91 32 0.325 3.47 2.02 Intr - 112571 112432 140 2 2 98 87 -36 0.269 -3.44 2.01 Init - 113665 113561 105 1 0 84 109 110 0.549 12.51 2.00 Prom - 120276 120237 40 -4.16 3.14 PlyA - 121417 121412 6 1.05 3.13 Term - 126485 126414 72 2 0 98 37 39 0.137 -2.29 3.12 Intr - 138328 138239 90 1 0 50 79 70 0.816 2.49 3.11 Intr - 139733 139560 174 2 0 61 97 161 0.999 14.44 3.10 Intr - 145329 145168 162 0 0 8 116 118 0.658 6.77 3.09 Intr - 147427 147334 94 0 1 79 75 64 0.996 4.17 3.08 Intr - 148764 148624 141 2 0 58 103 64 0.883 4.37 3.07 Intr - 149084 148930 155 2 2 44 83 78 0.994 1.77 3.06 Intr - 149883 149723 161 1 2 43 99 116 0.268 7.91 3.05 Intr - 158938 158711 228 2 0 29 87 157 0.183 7.54 3.04 Intr - 160034 159993 42 0 0 77 91 37 0.208 1.11 3.03 Intr - 177859 177522 338 0 2 19 70 670 0.311 53.46 3.02 Intr - 178533 178291 243 2 0 77 26 197 0.572 9.01 3.01 Init - 183753 183665 89 0 2 72 90 40 0.785 2.74 3.00 Prom - 184958 184919 40 -5.06 4.00 Prom + 186038 186077 40 -2.66 4.01 Init + 187424 187464 41 1 2 80 81 16 0.949 -0.22 4.02 Intr + 188351 188457 107 2 2 93 101 162 0.984 17.86 4.03 Term + 193386 193546 161 1 2 40 43 131 0.235 1.90 4.04 PlyA + 193937 193942 6 -3.44 5.05 PlyA - 194400 194395 6 -0.45 5.04 Term - 195814 195666 149 2 2 47 44 184 0.965 7.96 5.03 Intr - 196077 195978 100 0 1 69 86 64 0.973 3.98 5.02 Intr - 199821 199693 129 0 0 82 92 32 0.931 3.89 5.01 Init - 203396 203322 75 2 0 70 56 113 0.985 5.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:11737848_11951512|GENSCAN_predicted_peptide_1|801_aa MAASPGRQARPPAARGRRRGRNRAAAAAANRGRAASEHPGPAAAAATAARLPPPASACLL PPAGASSPQVYLRALVRNLFNEGNDVYREHDWNNSISQYTEALNIADYAKSEEILIPKEI IEKLYINRIACYSNMGFHDKVLEDCNIVLSLNASNCKALYRKSKALSDLGRYKKAYDAVA KCSLAVPQDEHVIKLTQELAQKLGFKIRKAYVRAEALNHSVEDIEPDLLTPRQEAVPVVS LPAPSFSHEVGSELASVPVMPLTSILPLQVEESALPSAVLANGGKMPFTMPEAFLDDGDM VLGDELDDLLDSAPETNETVMPSALVRGPLQTASVSPSMPFSASLLGTLPIGARYAPPPS FSEFYPPLTSSLEDFCSSLNSFSMSESKRDVAAEEECRYSGHCTFAYCQEEIDVWTLERK GAFSREAFFGGNGKINLTVFKLLQEHLGEFIFLCEKCFDHKPRMISKRNKDNSTACSHPV TKHEFEDNKCLVHILRETTVKYSKIRSFHGQCQLDLCRHEVRYGCLREDECFYAHSLVEL KVWIMQNETVQDMEQFYELWLKSQKNEKSEDIASQSNKENGKQIHMPTDYAEVTVDFHCW MCGKNCNSEKQWQGHISSEKHKEKVFHTEDDQYCWQHRFPTGYFSICDSGCGELRDWGAT GRAGPSGPRRRRRYKAGDWEHVMSECGGRGGGSSSSEDAEDEGGGGGGPAGSDCLSSSPT LATASSAGRLRRGLRGAFLMARQRPELLCGAVALGCALLLALKFTCRKWSPRQHAVLAAA ALVLVPKAQSQLLSKEFLNNC >gi568815582r:11737848_11951512|GENSCAN_predicted_CDS_1|2403_bp atggcggcgtctcccgggcggcaagcgcggcctcccgccgctagagggcgccgccgcggg cgcaaccgagccgctgccgccgccgctaaccgagggagagctgcgagcgagcacccaggc cccgccgccgccgccgccaccgctgcccgcctgccgccgcccgcctccgcctgccttctg ccgccggccggcgcgtcgtccccacaggtatatttgcgtgctctcgtgagaaatcttttt aatgaaggaaatgacgtttatcgggaacatgattggaacaactcgataagccagtacacg gaagccttgaatatagctgattatgcaaaatctgaagaaattttaatccccaaagaaata attgaaaaactatatataaatcgtattgcctgctattctaatatgggtttccatgataaa gttttggaggactgcaatatagtcctcagtttaaatgccagtaactgcaaagctctgtat cggaaatctaaggctttaagtgatttaggaagatacaaaaaggcttacgatgctgtagca aagtgctccttagcagtgcctcaggatgagcatgtaataaaactaactcaagaactagct cagaaattgggatttaaaataagaaaagcgtatgtcagagctgaggctttgaaccattct gtggaagatattgagccagatttattaactccaaggcaagaagcagttcctgttgtctct ttaccggcacccagtttttctcatgaagttggaagtgagctggcctcagttcctgttatg cccttaacttctattttgccactacaagtggaagagagcgctctgccatctgcagtgctg gcaaatggaggaaagatgcccttcactatgccagaagcttttctagatgatggagatatg gtccttggagatgaactagatgacctgcttgattctgcacctgaaactaatgaaactgtt atgccgtcagctttagtcagaggaccccttcagacagccagtgtctctcctagcatgccc ttttcggcatcgctgttaggaaccttacccattggtgcgaggtatgctcctccaccctcc ttctcagaattttatccacctttgacttcatccttagaagatttttgttcttctttaaat tcattttcaatgagtgaatccaaacgagatgttgctgctgaggaggaatgtagatattca ggccactgcacgtttgcttattgccaagaggagatagatgtgtggacactggagcggaaa ggagcattcagccgggaggctttctttggcggcaatggaaagattaaccttactgtgttc aaacttctccaggagcatcttggggaatttatattcctttgtgagaaatgttttgatcat aagcctagaatgataagtaaaagaaataaagataattctactgcttgttctcacccggtt acaaagcatgagtttgaagacaataagtgccttgtccacattttgcgagagacaacagta aaatactccaaaatacgttcttttcatggtcagtgtcagcttgatttatgtcgacatgaa gttcggtatggctgtttaagggaagatgagtgcttttatgcccatagtcttgtggaactg aaagtctggataatgcaaaatgaaacagtacaagatatggagcaattttacgaactatgg ctcaagagtcaaaaaaatgaaaaaagtgaagacatagccagtcagtcaaacaaggaaaat ggaaaacaaattcacatgccaacagattatgctgaagttacagtggactttcactgctgg atgtgtgggaaaaactgcaacagtgagaagcagtggcagggccacatctcctccgagaag cacaaagagaaggttttccacaccgaggacgaccagtactgctggcagcaccgcttccca acaggctatttcagtatttgtgatagcgggtgcggcgagctgcgggattggggagcaacg ggccgggccgggccttcgggcccgaggcggcggcggcggtataaagccggcgactgggag catgtaatgtcggaatgcggaggccgcggcggcggcagcagcagcagcgaggacgccgag gacgagggagggggcggcggcggccccgcgggctcagactgcctcagctcgagcccgacc ctggccacagcgtcctcggcgggccggctccgtcgcgggctgcgtggcgccttcctcatg gcgcgccagcggccggagctgctctgcggggccgtggcgctcggctgcgcgctgctcctc gccctcaagttcacctgcagaaagtggtctcctcgacaacacgcagtgctagctgccgca gccttagtgcttgtgcccaaagcacagtcacagctgctcagtaaagaatttttgaataat tgt >gi568815582r:11737848_11951512|GENSCAN_predicted_peptide_2|581_aa MEDSASASLSSAAATGTSTSTPAAPTARKQLDKEQVRKAVDALLTHCKSRKNNYGLLLNE NESLFLMVVLWKIPSKELRVRLTLPHSIRSDSEDICLFTKDEPNSTPEKTEQFYRKLLNK HGIKTVSQIISLQTLKKEYKSYEAKLRLLSSFDFFLTDARIRRLLPSLIGRHFYQRKNAI RIGHVGMQIEHIIENIVAVTKGLSEKLPEKWESVKLLFVKTEKSAALPIFSSFVSNWDEA TKRSLLNKKKKEARRKRRERNFEKQKERKKKRQQARKTASVLSKDDVAPESGDTTVKKPE SKKEQTPEHGKKKRGRGKAQVKATNESEDEIPQLVPIGKKTPANEKVEIQKHATGKKSPA KSPNPSTPRGKKRKALPASETPKAAESETPGKSPEKKPKIKEEAVKEKSPSLGKKDARQT PKKPEAKFFTTPSKSIMAFVSSHVSVWLFVTTIQCLRAQGSGNFQRAEHGEVPGGWLTQE AERKTKATGNNPIILTPNTAENRYNMQRGGGFKVSRTWKLAGAGVSPVSRDLLSLLATGL DEVISDHDDCLEDQIGRCVSEQLLDGLSVKASPREATVVEA >gi568815582r:11737848_11951512|GENSCAN_predicted_CDS_2|1746_bp atggaggattcggcctcggcctcgctgtcttctgcagccgctactggaacctccacctcg actccagcggccccgacagcacggaagcagctggataaagaacaggttagaaaggcagtg gacgctctcttgacgcattgcaagtccaggaaaaacaattatgggttgcttttgaatgag aatgaaagtttatttttaatggtggtattatggaaaattccaagtaaagaactgagggtc agattgaccttgcctcatagtattcgatcagattcagaagatatctgtttatttacgaag gatgaacccaattcaactcctgaaaagacagaacagttttatagaaagcttttaaacaag catggaattaaaaccgtttctcagattatctccctccaaactctaaagaaggaatataaa tcctatgaagccaagctccgccttctgagcagttttgatttcttccttactgatgccaga attaggcggctcttaccctcactcattgggagacatttctatcaaagaaagaatgctata cgtattggtcacgttggaatgcaaattgagcacatcattgaaaacattgttgctgtcacc aaaggactttcagaaaaattgccagagaagtgggagagcgtgaaactcctgtttgtgaaa actgagaaatcggctgcacttcccatcttttcctcgtttgtcagcaattgggatgaagcc accaaaagatctttgcttaataagaagaaaaaagaggcaaggagaaaacgaagagaaaga aattttgaaaaacaaaaggagaggaagaagaagaggcagcaggctaggaagactgcatca gttcttagtaaagatgatgtggcacctgaaagtggtgatactacagtgaagaaacctgaa tcaaagaaggaacagaccccagagcatgggaagaaaaaacgtggcagaggaaaagcccaa gttaaagcaacaaatgaatccgaagacgaaatcccacagctggtaccaataggaaagaag actccagctaatgaaaaagtagagattcaaaaacatgccacaggaaagaagtctccagca aagagtcctaatcccagcacacctcgtgggaagaaaagaaaggctttgccagcatctgag accccaaaagctgcagagtctgagaccccagggaaaagcccagagaagaagccaaaaatc aaagaagaggcagtgaaggaaaaaagtccttcgctggggaaaaaagatgcgagacagact ccaaaaaagccagaggccaagtttttcaccactcctagtaaatctatcatggcctttgtg tccagtcatgtttctgtctggttgttcgtcacaactatccagtgcctaagagcacagggt tcagggaacttccagagagctgaacacggggaggttcctggagggtggctgacccaagaa gctgaaaggaaaaccaaggcaaccggaaataatcctattatactgactcctaatacagct gagaacaggtacaacatgcagaggggtggaggattcaaggtgtcaagaacctggaaactg gctggggctggggtgtcaccagtatccagagacctcctgagccttctggcaacaggacta gacgaggtcatttcagatcatgatgactgcttggaagatcagatagggcgctgtgttagt gagcaacttttagatgggctgtcagtgaaggcctctccaagagaggctactgtagttgag gcctga >gi568815582r:11737848_11951512|GENSCAN_predicted_peptide_3|662_aa MSHSAQLAIISLTHLQAFLIRILNREGFSSLCPAGQSGPLLWAKILSVFGPAQSCTNQGL PLPPHQVTGGAAERPPDLRLLALPAGSRSRFRLRPVPELRLPSAPGKTAAGGGGGGGGGG GGSSSGSSSSDSAPDCWDQADMEAPGPGPCGGGGSLAAAAEAQRENLSAAFSRQLNVNAK PFVPNVHAAEFVPSFLRGPAAPPPPVGGAANNHGAGSGAGGRAGSNSAVSMELSEPIVEN GETEMSPEESWEHKEEISEAEPGGGSLGDGRPPEESAHEMMEEEEEIPKPKSVVAPPGAP KKEHVNVVFIGHVDTNQEERDKGKTVEVGRAYFETEKKHFTILDAPGHKSFVPNMIGGAS QADLAVLVISARKGEFETGFEKGGQTREHAMLAKTAGVKHLIVLINKMDDPTVNWSNERY EECKEKLVPFLKKVGFNPKKDIHFMPCSGLTGANLKEQSDFCPWYIGLPFIPYLDNLPNF NRSVDGPIRLPIVDKYKCRISLWAIELLYSTKKSETGFYFTFDQDMGTVVLGKLESGSIC KGQQLVMMPNKHNVEVLGILSDDVETDTVAPGENLKIRLKGIEEEEILPGFILCDPNNLC HSGRTFDAQIVIIEHKSIICPGYNAVLHIHTCIEEVEITIYSILCMNNFSKSFYRSLIDD VA >gi568815582r:11737848_11951512|GENSCAN_predicted_CDS_3|1989_bp atgagccacagcgcccagctggctattatatctttaacacatctccaagctttcctcatc cgcattttgaacagggagggcttcagcagcctgtgtccagcaggccagagtggacccctc ctatgggctaaaatactctcggtcttcggcccagcacagagctgcaccaaccagggcctg ccgctgcccccccaccaggtgacaggtggggccgccgagcggccaccagacctccgcctc ctcgcgttgcctgccgggagccggagccgcttccggttgcgccctgtccctgaacttcga ctcccgtcggcccccgggaaaaccgcagcgggtggcggcggcggcggcggcggcggcggc ggcgggagcagcagcggcagcagcagcagcgactcggcgcctgactgctgggaccaggcg gacatggaagcccccgggccgggcccttgcggcggcggcggctccctggcggcggcggcc gaggcccagcgggagaacctcagcgcggccttcagccggcaactcaacgtcaacgccaag cccttcgtgcccaacgtccacgccgccgagttcgtgccgtccttcctgcggggcccggca gcgccgccacccccagttggcggcgccgccaataaccacggagccggcagcggcgcggga ggccgtgcgggttcaaattcagctgttagcatggaactttcagaacctattgtagaaaat ggagagacagaaatgtctccagaagaatcatgggagcacaaagaagaaataagtgaagca gagccagggggtggttccttgggagatggaaggccgccagaggaaagtgcccatgaaatg atggaggaggaagaggaaatcccaaaacctaagtctgtggttgcaccgccaggtgctcct aagaaagagcatgtaaatgtagtattcattgggcacgtagacacaaatcaggaagaacga gacaagggtaaaacagtagaagtgggtcgtgcctattttgaaaccgaaaagaagcatttc acaattctagatgcccctggccacaagagttttgtcccaaatatgattggtggtgcctct caagctgatttggctgtgctggtaatctcagccaggaaaggagagtttgaaactggattt gaaaaaggaggacagacaagagaacatgcaatgttggcaaagacagcaggtgtaaaacac ctaattgtgctaattaataagatggatgatccaacagtaaattggagcaatgagagatat gaagaatgtaaggagaaactagtgccatttttgaaaaaagttggcttcaatcccaaaaag gacattcactttatgccctgctcaggacttactggagcaaatctcaaagagcagtcggat ttctgtccttggtacattggattaccgtttattccatatctggataatttgccgaacttc aatagatcagttgatggaccaatcaggctgccaattgtggataagtacaagtgtagaatt tcactttgggctattgagctgttatacagcaccaagaagtctgaaactggattttatttt acatttgatcaggatatgggcactgtggtcctgggaaagctggaatcaggatctatttgt aaaggccagcagcttgtgatgatgccaaacaagcacaacgtggaagttcttggaatactt tccgatgatgtagagactgataccgtagccccaggtgaaaacctcaaaatcagactgaaa ggaattgaagaagaggagattcttccagggtttatactttgtgatcctaataatctttgt cattctggacgcacatttgatgcccagatagtgattatagagcacaaatccatcatctgc ccaggctataatgcggtgctgcatattcatacctgtattgaggaggtggaaataacaatt tattcaattctgtgcatgaataacttctctaagtccttttatagatccctgattgatgac gtagcataa >gi568815582r:11737848_11951512|GENSCAN_predicted_peptide_4|102_aa MKGGNPILREGVAWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCGDNGKPSTLVL TDFNKVQFTEIEGRKRKRPKKGKFAVLPSHHLIHGPDPKDLT >gi568815582r:11737848_11951512|GENSCAN_predicted_CDS_4|309_bp atgaagggagggaatccaattttacgagagggtgtagcctggtgctgccatggccacgac tgttgttacactcgagctgaggaggccggctgcagccccaagacagagcgctactcctgg cagtgcgtcaatcagagcgtcctgtgcggtgacaacggcaaaccatccaccctggtgttg actgactttaacaaggttcagttcacagagattgagggcagaaaaaggaaacggcctaaa aagggtaagtttgctgtgttgccctcacaccacttgattcatggtcctgatcctaaggat ctcacctga >gi568815582r:11737848_11951512|GENSCAN_predicted_peptide_5|150_aa MRVLWMLFWLLFWLLLEFISHQSTCVINTLADHHHRGTDFGGSPWLRIIIAFPRSYKVVL TLWTVYLWLSFLKTIFQSENGHDVSTDVQQRARRSNRRRQEGNKIGLKDVITLRRHVETK GRAKIRKMKVTTKINHHDKINGKRKTAKKQ >gi568815582r:11737848_11951512|GENSCAN_predicted_CDS_5|453_bp atgcgggtgctgtggatgctcttttggctcctcttttggctcctgctggaatttatcagc catcagtccacctgtgttatcaatactctggctgaccatcatcatcgtgggactgacttt ggtggaagtccttggttacgtatcattattgcatttccgagaagttataaagttgtcctt accctctggacagtttacctttggttgtctttcctgaagactatcttccagtctgaaaat ggacatgatgtatccacagatgtacagcagagagccaggaggtccaaccgccgtagacag gaaggaaataaaattggcctgaaagacgtcattactctacggagacatgtggaaacaaaa ggtagagctaaaatccgtaagatgaaggtgacaacgaaaatcaaccatcatgacaaaatc aatggaaagaggaagaccgccaaaaaacagtaa