GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:18:03 Sequence gi568815595r:145970720_146260989 : 290270 bp : 36.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 930 978 49 2 1 37 84 77 0.789 3.36 1.02 Term + 5008 5255 248 2 2 75 48 136 0.528 3.27 1.03 PlyA + 5370 5375 6 1.05 2.03 PlyA - 8329 8324 6 1.05 2.02 Term - 26030 25679 352 1 1 88 43 192 0.615 7.67 2.01 Init - 34018 33963 56 1 2 67 102 48 0.804 4.91 2.00 Prom - 55145 55106 40 -2.45 3.00 Prom + 58301 58340 40 -4.05 3.01 Init + 63389 63538 150 1 0 75 -26 127 0.479 0.09 3.02 Term + 65056 65208 153 0 0 -1 47 245 0.559 8.44 3.03 PlyA + 65250 65255 6 1.05 4.18 PlyA - 65455 65450 6 1.05 4.17 Term - 69803 69686 118 1 1 -7 50 149 0.033 -1.37 4.16 Intr - 95966 95716 251 2 2 16 82 129 0.095 0.41 4.15 Intr - 100448 100323 126 2 0 83 91 143 0.995 14.06 4.14 Intr - 100704 100558 147 0 0 85 63 188 0.999 15.51 4.13 Intr - 101946 101842 105 0 0 26 103 105 0.954 5.29 4.12 Intr - 108538 108397 142 0 1 65 95 130 0.990 10.83 4.11 Intr - 111144 111019 126 2 0 114 84 134 0.999 14.47 4.10 Intr - 114554 114450 105 1 0 76 80 79 0.945 4.31 4.09 Intr - 116189 116068 122 2 2 66 48 58 0.188 -2.03 4.08 Intr - 121182 121081 102 0 0 68 95 43 0.173 2.45 4.07 Intr - 132133 132036 98 2 2 90 115 76 0.923 9.31 4.06 Intr - 133623 133560 64 0 1 98 57 39 0.786 -0.83 4.05 Intr - 135925 135813 113 2 2 112 61 129 0.999 11.78 4.04 Intr - 139729 139566 164 0 2 46 105 151 0.995 11.30 4.03 Intr - 150529 150393 137 1 2 83 58 145 0.964 9.45 4.02 Intr - 153510 153419 92 1 2 126 86 43 0.873 6.69 4.01 Init - 170196 170100 97 0 1 68 115 11 0.037 2.32 4.00 Prom - 183832 183793 40 -2.55 5.11 PlyA - 184038 184033 6 1.05 5.10 Term - 189911 189775 137 0 2 93 41 78 0.295 0.80 5.09 Intr - 190358 190162 197 1 2 20 99 164 0.036 8.84 5.08 Intr - 197028 196973 56 0 2 28 103 74 0.008 -0.24 5.07 Intr - 224563 224405 159 1 0 81 61 175 0.351 13.36 5.06 Intr - 226074 225913 162 0 0 104 52 126 0.997 9.85 5.05 Intr - 229320 229094 227 1 2 74 55 127 0.234 4.78 5.04 Intr - 232988 232802 187 2 1 34 57 127 0.286 2.54 5.03 Intr - 236042 235807 236 0 2 88 78 181 0.878 13.58 5.02 Intr - 250206 250096 111 1 0 97 94 72 0.967 8.13 5.01 Init - 254804 254624 181 2 1 72 44 215 0.618 13.01 5.00 Prom - 261267 261228 40 -4.05 6.06 PlyA - 261384 261379 6 1.05 6.05 Term - 271579 271346 234 1 0 22 43 154 0.222 -0.36 6.04 Intr - 279918 279838 81 0 0 77 98 30 0.385 1.82 6.03 Intr - 280296 280241 56 1 2 93 101 71 0.971 6.68 6.02 Intr - 286290 285660 631 0 1 -19 67 322 0.426 10.21 6.01 Init - 286992 286885 108 0 0 50 20 128 0.393 2.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 182283 182476 194 1 2 4 47 234 0.822 7.50 S.002 Init - 190270 190162 109 1 1 71 99 147 0.814 13.18 S.003 Term + 212436 212952 517 0 1 78 54 212 0.803 9.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:145970720_146260989|GENSCAN_predicted_peptide_1|98_aa MFGDVGDMTFTSLEEQCCAGSVGLANILKNPYLVKSSKIIDLDTNNIPHSISLADSGQFQ LSIEAPKAGESSVAPKNGKKGYNLHLPKLKETPELNGN >gi568815595r:145970720_146260989|GENSCAN_predicted_CDS_1|297_bp atgtttggagatgttggagatatgacttttaccagtttagaagaacagtgctgtgctggc tctgtagggctagccaatatccttaaaaatccatatttggtgaaatctagtaaaataatt gatctagatacgaataatattccccacagtatttcactggctgattccggccaattccag ttatcaatagaagctccaaaggctggagagagttctgtggcccccaaaaatgggaagaag gggtataatcttcaccttcccaagctcaaagaaacccctgaactaaatgggaactga >gi568815595r:145970720_146260989|GENSCAN_predicted_peptide_2|135_aa MEKLKEEPRALSVAKSIQRHFELFVRHDMAIRNFKNCPADSIVQQSLRAITLDQQFLDLA CIGIICRAVTTQTTDSIESEPSRDSDSIDLEWGLRIYISNKFPDDANTAGPLKITDLRAV LLTYSCTDQLIWSKG >gi568815595r:145970720_146260989|GENSCAN_predicted_CDS_2|408_bp atggaaaagctaaaagaggaacctagagctttgtctgtggctaagtccattcaaaggcat tttgagttatttgtaagacatgatatggctatcagaaattttaaaaactgtcctgctgat tctattgtgcaacaaagtttgagagccattactctagaccagcagttccttgacttagcc tgcatcggcatcatttgtagggctgtgacaacacagactactgattctattgaatcagaa ccttccagggattctgattcaatagacctggagtggggtttgagaatttacatttctaac aagttcccagatgatgccaatactgctggtcctttaaaaattactgatcttagagcagtg cttctcacttacagctgtactgatcagttgatctggagtaagggctaa >gi568815595r:145970720_146260989|GENSCAN_predicted_peptide_3|100_aa MNAEGDSEGTVAIKRVSVPCKRDSTTEFQLLLPFEYPGTESADLISHEGLPQAGPSEDIP EEVIVIIREDRPMHVIDPEDLPVGQDVEVEDSDIHDYDPV >gi568815595r:145970720_146260989|GENSCAN_predicted_CDS_3|303_bp atgaatgctgaaggggactcagagggtactgtggcaatcaagagagtttctgtcccatgt aaaagggacagcaccactgagtttcagttgcttttgccatttgagtacccaggcactgaa tcagcagatctaatttctcatgaggggctgcctcaggcaggtccttcagaagatattcca gaagaagtcattgttatcatacgagaggacagacccatgcatgttatcgaccctgaagac cttccagtgggacaagatgtggaagtggaagacagtgatattcatgattatgaccctgtg tag >gi568815595r:145970720_146260989|GENSCAN_predicted_peptide_4|702_aa MWKTFCLGPLIIGINSSVDLDVLGKYPIIGKYDKLLVITVATKESDGFHRFMQSAKYFNY TVKVLGQGEEWRGGDGINSIGGGQKVRLMKEVMEHYADQDDLVVMFTECFDVIFAGGPEE VLKKFQKANHKVVFAADGILWPDKRLADKYPVVHIGKRYLNSGGFIGYAPYVNRIVQQWN LQDNDDDQLFYTKVYIDPLKREAINITLDHKCKIFQTLNGAVDEVVLKFENGKARAKNTF YETLPVAINGNGPTKILLNYFGNYVPNSWTQDNGCTLCEFDTVDLSAVDEVYHEKDIKVF FDKAKHEIKTIKIVGPEENLSQAEARNMGMDFCRQDEKCDYYFSVDADVVLTNPRTLKIL IEQNRKIIAPLVTRHGKLWSNFWGALSPDGYYARSEDYVDIVQGNRVGVWNVPYMANVYL IKGKTLRSEMNERNYFVRDKLDPDMALCRNAREMPCPDVFWFPIFSEKACDELVEEMEHY GKWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGF ALLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQTGNNKCLRGCGEKGPLHIGT TAVENVINYNHYGVQFGGSSKKLKMELPYDTAIPVLDIYPEEKEISVPKQYLHSCVCCST VHNSKGEGFANILAQVEEHFEGHQEVLTNKKLEELVESSTEE >gi568815595r:145970720_146260989|GENSCAN_predicted_CDS_4|2109_bp atgtggaaaacattctgtctggggccacttattataggaataaacagctctgtagattta gatgttctggggaagtaccctatcattggaaaatatgataaattattagtcataactgta gcaacaaaagaaagtgatggattccatcgatttatgcagtcagccaaatatttcaattat actgtgaaggtccttggtcaaggagaagaatggagaggtggtgatggaattaatagtatt ggagggggccagaaagtgagattaatgaaagaagtcatggaacactatgctgatcaagat gatctggttgtcatgtttactgaatgctttgatgtcatatttgctggtggtccagaagaa gttctaaaaaaattccaaaaggcaaaccacaaagtggtctttgcagcagatggaattttg tggccagataaaagactagcagacaagtatcctgttgtgcacattgggaaacgctatctg aattcaggaggatttattggctatgctccatatgtcaaccgtatagttcaacaatggaat ctccaggataatgatgatgatcagctcttttacactaaagtttacattgatccactgaaa agggaagctattaacatcacattggatcacaaatgcaaaattttccagaccttaaatgga gctgtagatgaagttgttttaaaatttgaaaatggcaaagccagagctaagaatacattt tatgaaacattaccagtggcaattaatggaaatggacccaccaagattctcctgaattat tttggaaactatgtacccaattcatggacacaggataatggctgcactctttgtgaattc gatacagtcgacttgtctgcagtagatgaagtttatcatgaaaaggacatcaaggtattt tttgataaagctaagcatgaaatcaaaactataaaaatagtaggaccagaagaaaatcta agtcaagcggaagccagaaacatgggaatggacttttgccgtcaggatgaaaagtgtgat tattactttagtgtggatgcagatgttgttttgacaaatccaaggactttaaaaattttg attgaacaaaacagaaagatcattgctcctcttgtaactcgtcatggaaagctgtggtcc aatttctggggagcattgagtcctgatggatactatgcacgatctgaagattatgtggat attgttcaagggaatagagtaggagtatggaatgtcccatatatggctaatgtgtactta attaaaggaaagacactccgatcagagatgaatgaaaggaactattttgttcgtgataaa ctggatcctgatatggctctttgccgaaatgctagagaaatgccctgtccagatgtcttt tggttccccatattttctgaaaaagcctgtgatgaattggtagaagaaatggaacattac ggcaaatggtctgggggaaaacatcatgatagccgtatatctggtggttatgaaaatgtc ccaactgatgatatccacatgaagcaagttgatctggagaatgtatggcttcattttatc cgggagttcattgcaccagttacactgaaggtctttgcaggctattatacgaagggattt gcactactgaattttgtagtaaaatactcccctgaacgacagcgttctcttcgtcctcat catgatgcttctacatttaccataaacattgcacttaataacgtgggagaagactttcag acaggcaataacaaatgcttgagaggatgtggagaaaagggacccctccacattgggacc actgctgttgagaatgtaattaattacaaccactatggagtacagtttggaggttcctca aaaaaactaaaaatggagctgccatatgatacagcaattccagtgcttgatatataccca gaagaaaaagaaatcagtgtgccgaagcaatatctgcactcctgtgtttgttgcagcact gttcacaatagcaaaggagaaggatttgccaacatacttgctcaggtggaagaacatttt gaaggccatcaagaagtattaacaaataagaaattggaagaacttgttgaatcatctaca gaggaatag >gi568815595r:145970720_146260989|GENSCAN_predicted_peptide_5|550_aa MGSCAARASPTSTTPCSTAPSPIDHPRAEECKRMARDWQAAPPAALLRDPLGEASCAPES GVVPTAPEQPAGEMENQTKPPDPRPDAPPEYNSHFLPGPPGTAVPPPTGYPGGLPMGYYS PQQPSTFPLYQPVGGIHPVRYQPGKYPMPNQSVPITWMPGPTPMANCPPGLEYLVQAEKG EEATEKKSLKLAEVGSQGLRGEKRHFCNIEVQGEAASPDVEAAVSFSEDPNYNKIIDEVM TCFETNNRYDIKNNSDQMVYIVTEDTDDFTRNAYRTLRPFVLRVTDCMGREIMTMQRPFR CTCCCFCCPSARQELEVQCPPGVTIGFVAEHWNLCRAVYSIQNEKKENVMRVRGPCSTYG CGSDSVFEVKSLDGISNIGSIIRKWNGLLSAMADADHFDIHFPLDLDVKMKAMIFGACFL ILCQNRDDQNQQTKSHGKDRADPGLCVLARSAALRGRGRLSPRGPRGPNMGGCTVKPQLL LLALVLHPWNPCLGADSEKPSSIPTEQVGVSFWTCRSWAPTPFLQLSFPLLSESWHGFIL DGRGSTPSVT >gi568815595r:145970720_146260989|GENSCAN_predicted_CDS_5|1653_bp atgggctcctgtgcggcccgagcctccccgacgagcaccaccccctgctccaccgcaccc agtcccatcgaccacccaagggctgaggagtgcaagcgcatggcacgggactggcaggca gctccacctgcagccctgctgcgggatccactaggtgaagccagctgtgctcctgagtct ggtgtggtacccacagcccctgaacagcctgcaggtgaaatggaaaatcaaacaaaacca ccagatccaaggcctgatgctcctcctgaatacaattctcattttttaccaggaccccct ggaacagctgtccctccacctactggctacccaggaggcttgcctatgggatactacagt ccacagcaacccagtaccttccctttgtaccagccagttggtggtatccatcctgtccgg tatcagcctggcaaatatcctatgccaaatcagtctgttccaataacatggatgccaggg ccaactcctatggcaaactgccctcctggtctggaatacttagttcaggctgagaaaggt gaggaagctacagaaaaaaaaagtttgaagctggcagaggttggctcacaaggcttaagg ggggaaaaaaggcacttttgtaacatagaagtgcaaggtgaagcagcaagtcctgatgta gaagctgcagtaagtttttcagaagatcctaactataacaagatcattgatgaagtgatg acatgttttgaaactaataatagatatgatattaaaaacaactcagaccagatggtttac attgtaaccgaagacacagatgactttaccaggaatgcctatcggacactaaggcccttc gtcctccgggtcactgattgtatgggccgagaaatcatgacaatgcagagacccttcaga tgcacctgctgttgcttctgttgcccctctgccagacaagagctggaggtgcagtgtcct cctggtgtcaccattggctttgttgcggaacattggaacctgtgcagggcggtgtacagc atccaaaatgagaagaaagaaaatgtgatgagagttcgtgggccatgctcaacctatggc tgtggttcagattctgtttttgaggtcaaatcccttgatggcatatccaacatcggcagt attatccggaagtggaatggtttgttatcagcaatggcagatgctgaccattttgacatt cacttcccactagacctggatgtgaagatgaaagccatgatttttggagcttgcttcctc attctctgccagaatagggatgatcagaaccagcagactaagagccatgggaaagacagg gcggacccgggtctctgcgttctcgcgagaagcgcggcgctgcggggccgtgggcgcctg agcccgcgcggccctcgagggccgaatatggggggatgcacggtgaagcctcagctgctg ctcctggcgctcgtcctccacccctggaatccctgtctgggtgcggactcggagaagccc tcgagcatccccacagaacaagttggagtttccttttggacttgcaggtcatgggcacca acacctttcctgcagctctcctttcccctcctttccgagagttggcatggatttatcctt gatggacgaggcagtactccttctgtgacctaa >gi568815595r:145970720_146260989|GENSCAN_predicted_peptide_6|369_aa MDSEVQTEEVSDGNKEFIGNRSEGSPRYALAKNLAAYAGPWYASLKKTWQLSTRFQRMYK KAWLPGQEPATGAESSQRISTRTAQRGNVGLEPQHRGPTRALPNGAVGNVGLEPQHRVPT RALPNGAVGRGLPAFRPQNGRATDSLQPHPGTKTGNQLQPMRVAMGATSWNATGVELPKV LGAHYLHQCALDVGHGVKGDFGALRFNDCSVGMQTCVGPIAPFVWPISPFWNGNVYPMPV PPQYLVVQLDPASGVPGAQHSAGQAVSLDWVSHGSLAFLRRGSHGLRPGHSQKNFRQIQQ SLSERKTIHESGSPQNQKSFRELHRATWAGSIHKQRKEVTYINQLIGSSLVFALFGHGVM RHLPSTDMV >gi568815595r:145970720_146260989|GENSCAN_predicted_CDS_6|1110_bp atggacagtgaagtccagactgaggaggtctcagatggaaataaggaatttattgggaac aggagtgaaggtagcccccgttacgccctcgcaaagaacttagctgcatatgcaggccca tggtatgcaagtttgaagaagacttggcagctttccactcgatttcagaggatgtataag aaagcttggctgcccgggcaggagcctgctacaggggcagagtcctcacagagaatctct actaggacagcacagaggggaaatgtggggttggagccccaacacagaggccccacgagg gcactacctaatggagctgtgggaaatgtggggttggagccccaacacagagtccccacg agggcactacctaatggagctgtggggagggggctgccagccttcagaccccagaatggt agagccactgacagcttgcaacctcatcctggaaccaaaacaggcaaccaactccaacct atgagagtagccatgggggctacatcctggaatgccacaggggtggagctgcccaaggtg ttgggagcccactacttgcatcagtgtgccctggatgtgggacatggagtcaaaggagat tttggagctttgagattcaatgactgctctgttgggatgcagacttgtgtggggcctatt gcccctttcgtttggccaatttctcccttttggaatgggaatgtttacccaatgcctgta ccaccacagtatcttgtagtacaactagacccggcgtctggcgtccccggtgcccagcat tctgcggggcaggcggtctcgcttgattgggtttctcatgggtctctggcgtttctacgg cgcggctctcacggactcaggccaggccactcgcagaaaaactttaggcaaattcagcag agtttatctgagcgaaaaacgattcatgaatcaggcagccctcagaaccagaagagtttc agagagctccaccgagcaacatgggcaggcagtattcataaacagaggaaagaagtgacg tacataaatcagttgattggatccagcctggtatttgctttatttggtcatggtgtgatg aggcatttgccttcaacggacatggtctga