GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:59:21 Sequence gi568815597r:27995745_28196422 : 200678 bp : 42.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 146 141 6 -0.45 1.12 Term - 1634 1572 63 2 0 99 44 69 0.692 0.51 1.11 Intr - 4305 4216 90 0 0 106 99 51 0.984 7.27 1.10 Intr - 8675 8592 84 2 0 88 86 91 0.975 8.00 1.09 Intr - 15342 15203 140 1 2 117 10 154 0.121 9.76 1.08 Intr - 17550 17367 184 0 1 89 107 148 0.988 15.24 1.07 Intr - 21495 21410 86 1 2 58 80 75 0.949 2.32 1.06 Intr - 39936 39800 137 2 2 42 87 149 0.783 9.49 1.05 Intr - 43161 43095 67 1 1 82 106 19 0.705 0.14 1.04 Intr - 43885 43742 144 2 0 36 -14 161 0.403 0.03 1.03 Intr - 46906 46827 80 0 2 59 106 83 0.972 5.48 1.02 Intr - 92971 92780 192 0 0 40 92 167 0.320 9.99 1.01 Init - 100678 100002 677 1 2 101 2 744 0.144 61.78 1.00 Prom - 108961 108922 40 -6.15 2.04 PlyA - 110348 110343 6 1.05 2.03 Term - 117379 117264 116 2 2 80 29 172 0.944 8.25 2.02 Intr - 119791 119628 164 0 2 58 10 85 0.569 -3.60 2.01 Init - 125577 125393 185 0 2 94 92 233 0.774 20.94 2.00 Prom - 130995 130956 40 -3.45 3.03 PlyA - 131607 131602 6 1.05 3.02 Term - 133805 133526 280 1 1 109 52 192 0.926 11.63 3.01 Init - 146884 146766 119 1 2 29 67 122 0.561 3.92 3.00 Prom - 151313 151274 40 -3.85 4.02 PlyA - 151446 151441 6 1.05 4.01 Sngl - 155277 154249 1029 0 0 85 37 1003 0.859 91.73 4.00 Prom - 160589 160550 40 -6.15 5.10 PlyA - 160812 160807 6 1.05 5.09 Term - 167739 167608 132 0 0 59 37 154 0.662 4.51 5.08 Intr - 168720 168592 129 0 0 69 21 115 0.574 2.87 5.07 Intr - 178379 178353 27 2 0 96 70 43 0.208 0.69 5.06 Intr - 179857 179724 134 2 2 103 54 65 0.232 4.04 5.05 Intr - 181124 181022 103 2 1 110 25 71 0.330 1.83 5.04 Intr - 183128 183008 121 1 1 61 109 42 0.361 3.18 5.03 Intr - 184884 184715 170 0 2 92 42 102 0.201 3.82 5.02 Intr - 198098 197978 121 1 1 1 110 79 0.655 0.98 5.01 Intr - 198492 198165 328 1 1 103 70 103 0.512 3.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 9787 9710 78 1 0 44 64 80 0.840 0.31 S.002 Term - 15342 15197 146 1 2 117 54 169 0.879 13.49 S.003 Sngl - 100678 99998 681 1 0 101 42 743 0.855 66.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:27995745_28196422|GENSCAN_predicted_peptide_1|647_aa MAAAAVQGERSGGSGGCSGAGGASNCGTGSGRSGLLDKWKIDDKPVKIDKWDGSAVKNSL DDSAKKVLLEKYKYVENFGLIDGRLTICTISCFFAIVALIWDYMHPFPESKPVLALCVIS YFVMMGILTIYTSYKEKSIFLVAHRKDPTGMDPDDIWQLSSSLKGFDDKYTLKLTFISGR TKQQREAEFTKSIAKFFDHSGTLVMDAYEPEISRLHDSLAIERKIKLNTCAQRFLVERGV DCEVTLAAYFRFLAMRIRVTLTPRFPPLDWFYCGSGLISMSCCGAFTAFGQVSNPDVSDQ KPETSSLASNLPMSEENIYSVHTRSSNHESQAFIAAIPDEVMGVKRTIAGKVFESSSEYH ISSSLMTCTDYIPRSSNDYTSQMYSAKPYAHILSVPVSETAYPGQTQYQTLQQTQPYAVY PQATQTYGLPPFASSTNASLISTSSTIANIPAAAVASISNQDYPTYTILGQNQYQACYPS SSFGVTGQTNSDAESTTLAATTYQSEKPSVMAPAPAAQRLSSGDPSTSPSLSQTTPSKDT DDQSRKNMTSKNRGKRKADATSSQDSELERVFLWDLDETIIIFHSLLTGSYAQKYGKDPT VVIGSGLTMEEMIFEVADTHLFFNDLEECDQVHVEDVASDDNGQDLR >gi568815597r:27995745_28196422|GENSCAN_predicted_CDS_1|1944_bp atggcggcggcagctgtacagggcgagagaagcggtggtagcggaggctgtagtggggct ggtggtgcttccaactgcgggacagggagtggccgtagcggcttgttggataagtggaag atagatgataagcctgtaaaaattgacaagtgggatggatcagctgtgaaaaactctttg gatgattctgccaaaaaggtacttctggaaaaatacaaatatgtggagaattttggtcta attgatggtcgcctcaccatctgtacaatctcctgtttctttgccatagtggctttgatt tgggattatatgcacccctttccagagtccaaacccgttttggctttgtgtgtcatatcc tattttgtgatgatggggattctgaccatttatacctcatataaggagaagagcatcttt ctcgtggcccacaggaaagatcctacaggaatggatcctgatgatatttggcagctgtcc tccagtcttaaagggtttgatgacaaatacaccttgaagctgaccttcatcagtgggaga acaaagcagcagcgggaagccgagttcacaaagtccattgctaagttttttgaccacagt gggacactggtcatggatgcatatgagcctgaaatatccaggctccatgacagtcttgcc atagaaagaaaaataaaactcaacacatgcgcccagaggtttctagtggaacgaggtgtg gattgtgaggtgactctggccgcttacttccggttcctagcgatgcgcatccgggtcacg ctaacgccgcggtttcctccgctcgattggttctactgtgggtctggactgatctccatg tcctgttgtggggcttttacagcctttggtcaagtaagcaatccagatgtcagtgatcag aagcctgaaacatcaagccttgcttcaaaccttcccatgtcagaggaaaatatttacagc gttcataccagaagtagtaatcatgagagtcaagccttcatagcagctattccagatgaa gtcatgggggtcaagaggacaatagctggaaaagtctttgagagttcatcagagtatcac atcagttcctcacttatgacatgcaccgattacatccctcgctcatccaatgattatacc tcacaaatgtattctgcaaaaccttatgcacatattctctcagttcctgtttcggaaact gcttaccctggacagactcaataccagacactacagcagactcaaccctatgctgtctac cctcaggcaacccaaacgtatggactacctccttttgcttcaagcacaaatgccagcctg atatctacttcttctacaattgccaatattccagcagcagcagtagccagcatctcaaac caggattatcccacctatactattcttggtcagaatcagtaccaggcctgctaccccagc tccagctttggagtcacaggtcagactaacagtgatgcagagagcaccacattagcagca accacataccagtcggagaagcctagtgtcatggcgcctgcacctgcagcacagagactt tcctctggagacccttctacaagtccatctttgtcccagactacaccaagtaaagatact gatgatcagtccaggaaaaacatgactagcaagaaccggggcaagaggaaagctgatgcc acttcttcccaagacagtgaattagaacgggtatttctgtgggacttggatgaaaccatc atcatcttccactcacttcttactggatcctatgcccagaaatatggaaaggacccaaca gtagtgattggctcaggtttaacaatggaagaaatgatttttgaagtggctgatactcat ctatttttcaatgacttagaggagtgtgaccaggtacatgtggaagatgtggcttctgat gacaatggccaagacttgaggtga >gi568815597r:27995745_28196422|GENSCAN_predicted_peptide_2|154_aa MLALISRLLDWFRSLFWKEEMELMLVGLQYSGKTTFVNVIASGQFSEDMIPTVGFNMREV TSVFIDKVVQQSPLISPPQKETPYPLAVTSCSPPPPHATINLNKGSSGKASSLSQGILYS GTSALRGEAAAEDLAPDGESCAFSLPNYHFEDIV >gi568815597r:27995745_28196422|GENSCAN_predicted_CDS_2|465_bp atgctggcactcatctcccgcctgctggactggttccgttcgctcttctggaaggaggag atggagctgatgctcgtggggctgcagtactcgggcaagaccacctttgtcaatgtcatc gcgtcaggtcaattcagtgaagatatgatacccacagtgggcttcaacatgagggaagta acaagtgtattcattgacaaagttgtgcaacaatcaccactaatttcaccacctcaaaaa gaaaccccatacccactagcagtcacttcctgttcccccccaccaccgcatgcaaccatt aatcttaacaagggatcttcggggaaagcgtccagtctttcacaaggaattctctattct ggaacttctgccctgagaggagaagctgctgctgaagatcttgctccagatggagagagt tgtgcatttagcttgcccaactaccattttgaggacatcgtctag >gi568815597r:27995745_28196422|GENSCAN_predicted_peptide_3|132_aa MSSSKAHSATLPSPVDIFHDTECEGPYLPSSRQERRASSRIWSSSSTLAGLGLTMGDGAS GVEALSFQGESDLFDKVLRTTHLDTSFCTQESESSILSAFPQEDFTPGCTLGFLVAGNMT QGTVALVGGTEA >gi568815597r:27995745_28196422|GENSCAN_predicted_CDS_3|399_bp atgtccagctccaaagctcattctgccacactcccttcccctgtggacatctttcatgat acagaatgtgaaggtccctatcttccatcttccagacaagaaagacgagcatcaagcaga atctggtcatcttcatccaccttggctgggctggggctgactatgggagatggtgcctct ggggtggaggcactgtctttccaaggtgagtctgatctgtttgacaaggttcttagaacc acacaccttgacacctcattctgcactcaggagagtgaatcatccatcttgagtgccttt ccccaggaagatttcacccctggatgcactctaggttttctcgttgctgggaacatgact cagggcacagtggcccttgtgggtggaacagaagcttga >gi568815597r:27995745_28196422|GENSCAN_predicted_peptide_4|342_aa MEPHDSSHMDSEFRYTLFPIVYSIIFVLGVIANGYVLWVFARLYPCKKFNEIKIFMVNLT MADMLFLITLPLWIVYYQNQGNWILPKFLCNVAGCLFFINTYCSVAFLGVITYNRFQAVT RPIKTAQANTRKRGISLSLVIWVAIVGAASYFLILDSTNTVPDSAGSGNVTRCFEHYEKG SVPVLIIHIFIVFSFFLVFLIILFCNLVIIRTLLMQPVQQQRNAEVKRRALWMVCTVLAV FIICFVPHHVVQLPWTLAELGFQDSKFHQAINDAHQVTLCLLSTNCVLDPVIYCFLTKKF RKHLTEKFYSMRSSRKCSRATTDTVTEVVVPFNQIPGNSLKN >gi568815597r:27995745_28196422|GENSCAN_predicted_CDS_4|1029_bp atggagccacatgactcctcccacatggactctgagttccgatacactctcttcccgatt gtttacagcatcatctttgtgctcggggtcattgctaatggctacgtgctgtgggtcttt gcccgcctgtacccttgcaagaaattcaatgagataaagatcttcatggtgaacctcacc atggcggacatgctcttcttgatcaccctgccactttggattgtctactaccaaaaccag ggcaactggatactccccaaattcctgtgcaacgtggctggctgccttttcttcatcaac acctactgctctgtggccttcctgggcgtcatcacttataaccgcttccaggcagtaact cggcccatcaagactgctcaggccaacacccgcaagcgtggcatctctttgtccttggtc atctgggtggccattgtgggagctgcatcctacttcctcatcctggactccaccaacaca gtgcccgacagtgctggctcaggcaacgtcactcgctgctttgagcattacgagaagggc agcgtgccagtcctcatcatccacatcttcatcgtgttcagcttcttcctggtcttcctc atcatcctcttctgcaacctggtcatcatccgtaccttgctcatgcagccggtgcagcag cagcgcaacgctgaagtcaagcgccgggcgctgtggatggtgtgcacggtcttggcggtg ttcatcatctgcttcgtgccccaccacgtggtgcagctgccctggacccttgctgagctg ggcttccaggacagcaaattccaccaggccattaatgatgcacatcaggtcaccctctgc ctccttagcaccaactgtgtcttagaccctgttatctactgtttcctcaccaagaagttc cgcaagcacctcaccgaaaagttctacagcatgcgcagtagccggaaatgctcccgggcc accacggatacggtcactgaagtggttgtgccattcaaccagatccctggcaattccctc aaaaattag >gi568815597r:27995745_28196422|GENSCAN_predicted_peptide_5|421_aa XWGLLRTGLKPSLGQGSSPPQSKDPVPLSPASPSPALPPRGAQHQRGPAQYCGMSWCAWL DVGIPGDLGVGAPPQELASWPQLPCPPSRARGQAFPNPSSCCCSLEGRADSPRSHPGPPR EAVQETCSQGTPICLSLFVTAWTIVPAVSQTHCGIGDACPEYTPPFFTHLLPRRLPCMDH INGSFVSGWGQPTGGADKMSEDGRRERTWGLGECVRCVSVEQYGRMNSRKEMSLAGVSEG RLGVSVQLLASGAVYTPPHPSLFEKPLCSAQVLAAQLGSSCGGPDMGLCTHSLVKYDRTS RLSPHTSGCSPLALTEYPAVFQVFGVHDESGEKPLSKEYLDPRECAHLECGVPPPSPLLV LALLVNAPSGLRGCKELLYSSPGETVHTSFVPGISEPSSGHGHSSIHSSVPDTGIEEMKK I >gi568815597r:27995745_28196422|GENSCAN_predicted_CDS_5|1266_bp ngctgggggctcctgaggacagggctaaaaccttctctaggtcagggctcatctccccct cagtcaaaggacccagttcctctctcccctgccagcccaagcccagcccttcctcccaga ggggcgcagcatcagaggggcccagcccagtattgtgggatgagctggtgtgcctggctg gatgtgggtattcccggtgaccttggtgttggggccccgccccaggagctggcttcctgg cctcagctgccctgcccacccagcagagcccgggggcaggcctttcctaatccttcatcc tgctgctgctcattggagggtagagcagactccccgagaagtcatccaggacctccccga gaagccgtccaggaaacatgctctcaggggacccccatctgcctcagcctctttgtcact gcctggaccattgtccctgctgtttctcagacccactgtgggattggtgatgcatgccca gagtacactccacccttcttcacccacctcttgccccggaggctgccctgtatggaccac attaatggttcctttgtttctggttggggtcagccaactggcggtgccgataagatgtca gaggatgggagaagagaaaggacatggggattgggagagtgtgtgcggtgcgtgagtgtg gagcagtatggacgcatgaactcgaggaaggagatgagcctggcaggtgtgagtgagggg agactgggcgtgagtgtgcagctcctggcctcgggcgctgtctacaccccaccccatccc tcgctctttgagaagcctctgtgcagtgcccaggtcctggcggcccagcttggaagttcc tgtggaggtccagacatggggctgtgcactcactcacttgtgaaatatgacagaaccagc aggctctctccccacacctctggctgctctcctttagctctgactgaatacccagctgtg ttccaggtgtttggggtgcatgatgagagcggggagaagcctctgagcaaggagtacttg gatccaagagagtgtgcccacttggaatgtggtgttcccccgccttcaccacttctagtc cttgccctgttggtgaatgcaccctcaggcctgcggggctgcaaggaactgctgtacagc tcccctggggagactgtgcacacttcctttgtgccggggatttcggagccatcatcaggg catggccactcatccattcattcctcggtgcccgacactgggattgaagagatgaaaaag atctag