GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:08:48 Sequence gi568815588r:29926103_30147812 : 221710 bp : 44.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13620 13725 106 2 1 72 78 62 0.087 3.39 1.02 Intr + 37145 37249 105 0 0 76 74 100 0.155 7.59 1.03 Term + 43180 43256 77 2 2 62 52 94 0.540 1.20 1.04 PlyA + 44399 44404 6 1.05 2.04 PlyA - 44832 44827 6 1.05 2.03 Term - 67162 67046 117 1 0 98 54 49 0.552 1.04 2.02 Intr - 67423 67329 95 2 2 74 35 78 0.299 0.78 2.01 Init - 81239 81140 100 2 1 41 72 102 0.297 4.32 2.00 Prom - 81345 81306 40 -3.46 3.04 PlyA - 83510 83505 6 1.05 3.03 Term - 91815 91781 35 1 2 126 45 25 0.402 -0.25 3.02 Intr - 103764 100001 3764 2 2 136 94 1761 0.925 168.90 3.01 Init - 121710 121430 281 0 2 79 98 181 0.832 14.69 3.00 Prom - 128230 128191 40 -3.86 4.04 PlyA - 130073 130068 6 1.05 4.03 Term - 132083 132046 38 0 2 105 46 49 0.817 -0.10 4.02 Intr - 132978 132845 134 2 2 77 52 107 0.819 6.29 4.01 Init - 133480 133380 101 1 2 66 123 115 0.944 10.54 4.00 Prom - 164033 163994 40 -5.26 5.06 PlyA - 165338 165333 6 1.05 5.05 Term - 166530 166474 57 0 0 94 44 105 0.854 4.39 5.04 Intr - 176928 176894 35 1 2 89 99 -9 0.001 -1.76 5.03 Intr - 184153 183997 157 1 1 35 78 127 0.110 5.98 5.02 Intr - 185874 185830 45 0 0 45 101 45 0.056 0.11 5.01 Init - 220997 220839 159 2 0 58 75 130 0.957 8.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 18890 18951 62 1 2 79 78 77 0.849 6.52 S.002 Term + 21321 21531 211 0 1 85 53 81 0.806 1.07 S.003 Init + 50045 50052 8 1 2 111 111 11 0.801 5.42 S.004 Init + 171875 171944 70 1 1 74 92 49 0.982 5.11 S.005 Term + 172130 172242 113 0 2 126 48 63 0.984 4.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:29926103_30147812|GENSCAN_predicted_peptide_1|95_aa VINLTLNLHFHYYDLNVCVSPKFPEILNLKVMVLGGCSHSVLPPCEEDACFSFAFCHDYK FPEASAAIRNCTLQILLKPSGLDINTVQMRRAESA >gi568815588r:29926103_30147812|GENSCAN_predicted_CDS_1|288_bp gtcataaatctaacactcaacttgcatttccactactatgatctgaatgtctgtgtctcc ccaaaattccctgaaattctaaacctcaaggtgatggtattaggaggttgttctcactct gtcctgccgccctgtgaagaagatgcctgcttctcctttgccttctgccatgattataag tttcctgaggcctctgcagcaattcggaactgcacactgcagatcctgctgaagccatct ggcctggacattaacacagtgcagatgcggcgtgcagagagtgcctga >gi568815588r:29926103_30147812|GENSCAN_predicted_peptide_2|103_aa MKDIRGKIVSKEKVHAVQINMAANILKDVDYPLALPRKSARDLASLGLLRFSELAPVGPH LSDNERPLMVDRPQHGPGQIVRHREVTCHLPTVRTMMADTTRS >gi568815588r:29926103_30147812|GENSCAN_predicted_CDS_2|312_bp atgaaagatatcaggggcaaaattgtgtccaaggagaaagttcatgcagtacaaattaac atggcagcaaatattctgaaggatgtagactatcccctcgccttgcctcgtaaatcagcc cgcgacctcgcctccctggggctgctgcgcttctctgaactcgctccagtgggtccacat ctttctgataatgagaggcccttgatggtggacagaccccagcacggccctggacagatt gtgaggcacagagaagtcacctgccacctccccaccgtcagaacaatgatggcagacacc accaggtcctga >gi568815588r:29926103_30147812|GENSCAN_predicted_peptide_3|1359_aa MYSVEDLLISHGYKLSRDPPASREDNPKGRQAARTGTRAGQGLQNGHEDGPAALAHRKTS AGKGHVSDSESRRSTPRGHGEPQSTSASRTSEAGFCNQPPSAWSSHPPTGNDQAYRRRGR QEARSQKPREHENLEARGMAQAHSLPVHVREGPWEVGGRSEHVMKKPVWEEELRMSGPAK WQNVSLESWNQPRKLGRQMSDGDGERLFQDLYPFIQGEHVLNSQNKGKSRSLPRVLSPES LSCTEIPIPLNERHSPKMPPYPPTCAPNLDSTRNSEKSGCSAPFPRPKFGRPLKPPSYSS HQQSRGGADSSDSQDSQQMDAYVPRHELCLSDPGLEPPVYVPPPSYRSPPQNIPNPYLED TVPINVCGGHSQQQSPTEKAGASGQPPSGPPGTGNEYGVSPRLPQGLPAHPRPVTAYDGF VQYIPFDDPRLRHFKLAQPQGFCEDIKLDDKSYNSSPVTAQEPAHGGMQPDGAIWNPQSL IPPSGDERGLVLADSSPRWLWGQPPGDGENSGLPNQRDRCVARGQWPDVRGSQHGHTGRQ VSSPYSQGESTCETQTKLKKFQTGTRTKKSSKKKMNETIFCLVSIPVKSESHLPDRDMDN NDLKPSADQKNGSDKSPALQEQSLLSMSSTDLELQALTGSMGGRTEFQKQDLGEPEEDRQ TNDLSFIHLTKHRELKHSGSWPGHRYRDQQTQTSFSEEPQSSQLLPGAKLGGPSRAALSP KCSDPAASEAQTHTAFPTGDHKQRPSARNLKGHRSLSPSSNSAFSRTSLSVDQAPTPKAG RSQPCVDVHGLGAHPGPKREVVKGEPTGPCNSKQLFGQFLLKPVSRRPWDLISQLESFNK ELQEEEESSSSSSSSSSSSEESEAEPQQENRAHCRQEDVGFRGNSPEMRVEPQPRMWVPE SPVCRSGRGESKSESWSEELQPGHPRAWPPSPGRFRVEEGGGAPFCSADGSTSAEKRHLE VSNGMDELAGSPFPVTRMSSRSSDAKPLPASYPAEPREPQESPKITSAFSSVKPSEAVPR KFDSGGERGAGLPLSLSNKNRGLSAPDLRSVGLTPGQEQGASELEGSLGEASTIEIPPGE SLQARAARILGIEVAVESLLPGIRRAGQNQPAEPDASACTPESPQEELLSRPAPADVPRV STDAFYGRRKCGWTKSPLFVGDRDSARRAPQAFEHSDVDGVVTSTDPVPEPEPSPLESKF FEQKDVETKPPFRSTLFHFVERTPSVAGSEKRLRSPSKVIESLQEKLASPPRRADPDRLM RMKEVSSVSRMRVLSFRNADSQEDAEELKATTRGQAGLPGGLVSPGSGDRAQRLGHSLSV SKDSISREEKEHPAAQKEKSMDQDFWCPDSYDPSRVERV >gi568815588r:29926103_30147812|GENSCAN_predicted_CDS_3|4080_bp atgtacagtgtagaagacctcctgatctctcatggatacaagctgtcaagagacccccca gcatcacgcgaggataaccccaaggggcgccaggcagcgaggactgggacacgagcaggc cagggcctgcagaacgggcatgaggatggccctgcggccctcgcacatcgtaagacgtcc gcggggaaaggacatgtgagtgactccgaaagccgccgcagcacaccgagaggccacggg gagccccagagcacttctgcttccagaacctcggaggcggggttttgtaatcaacccccc tcagcatggtcctctcatcccccgactggtaacgaccaagcctaccggagaagaggacgg caagaagccaggagccagaagccgagggagcacgaaaacctggaggccagaggaatggcc caagcccacagcctgcctgtccacgtgagggagggtccatgggaagttggaggaaggtca gagcatgtgatgaagaagccagtttgggaagaagaattgcgaatgtcaggtcctgccaag tggcagaacgtcagcctggaaagctggaaccagccaaggaaattagggaggcagatgtct gatggagatggggagagactgtttcaagacctgtacccattcattcaaggagaacatgtg ttgaattctcaaaacaaagggaagtctcgctcactgcctagagttctttcccccgagagc ctgagttgcacggaaattcccattccattaaatgaaagacattcacctaaaatgccaccg tatcctcccacttgcgcaccaaatttggactccacgaggaattctgagaagagtggctgc tcagccccatttccccggcctaagtttgggaggcccctcaagcccccatcttacagctcg caccagcagtctaggggaggagcggacagcagtgactctcaggacagccagcagatggac gcctatgtccccaggcatgagctctgcctgtcagaccctggattggaacctccagtgtac gtgcctccgccctcatacagatcgcccccgcagaacatcccaaacccctacttggaagac acggtgcccataaatgtgtgtggcggtcacagtcaacagcagtctccgaccgagaaggct ggggccagcggtcagcctccttcaggcccccctggaactgggaatgagtatggtgtgagc ccccgcttgcctcaggggctccccgcacatccccgacctgtcactgcctatgacggcttc gttcagtacattccctttgatgatccacggttacgacattttaaactagctcagccccag ggtttctgtgaagacataaagcttgacgataaatcatataactccagtcctgtcactgct caagagccggctcatggaggaatgcagcctgatggtgccatttggaatccacagagctta atacccccgtcgggggatgagagaggcctggtcttggccgattccagcccccggtggctg tggggccagccccccggggatggggaaaacagtggcctccccaaccagagagaccgctgt gtggcaaggggacagtggcctgatgtgagaggcagccagcacgggcacactggaagacaa gtttcctccccttactcacagggcgagagcacctgcgaaactcaaaccaagctcaaaaag ttccaaactgggactcggaccaagaaaagttcaaagaaaaaaatgaacgagactatattt tgcttggtttctatcccagtgaaatcagaatcacatctgccagatagagatatggacaac aatgacttaaagcccagtgctgatcaaaagaatgggtctgataagagcccggctctgcaa gaacagagtctgctgagcatgtcttccaccgacctggagctgcaggccctcacaggaagc atgggtgggagaacggagttccaaaaacaagatctaggggaaccagaagaagacagacaa acaaatgacctcagtttcatccaccttacaaagcacagagaactcaagcattctggctct tggccagggcaccggtacagagatcagcaaacacaaaccagtttctccgaggagccccaa agttcgcagctgctccctggtgcaaagctgggagggccgagtcgtgcagcattgagtcca aaatgttcagaccctgctgcctccgaagctcagacgcacacagcattccctaccggtgat cacaaacagaggccaagtgcccgtaacctgaaaggtcacaggtccctcagcccatccagc aacagtgcgttctcaaggacttccttgtccgtggaccaggcaccgacgccaaaagcaggc cgaagtcagccctgcgtggatgtccacgggcttggagcccaccctgggcctaagcgggag gtggtgaagggggagcccacgggcccttgcaacagtaaacaactctttgggcagtttctc ctgaaaccggtcagccgtcgtccctgggatttgatcagtcagttagaaagttttaacaag gagctccaggaagaggaagaaagcagcagtagcagcagcagcagcagcagcagcagtgag gagagtgaggcggagccgcagcaggagaaccgtgctcactgcagacaggaggatgtgggc ttccgcggaaacagcccggaaatgagggttgagccacagccgaggatgtgggtgccggag agccctgtgtgtaggtcgggaagaggtgagagtaagtctgagagctggagtgaggagctg cagcctggccacccacgtgcctggcctccatccccgggccgctttcgcgtggaagaaggt ggcggtgcacctttctgctcagcagatggaagcacgagtgcagagaagagacacctggag gttagcaacggaatggacgagctggcaggtagcccatttcctgtgacgagaatgtcttca agatcaagtgacgcaaaaccactgcccgcgtcctatccagctgaacctagggagccccag gaaagtccgaaaatcaccagtgctttcagctctgtgaaaccaagtgaagcggtccctcgg aagtttgacagtggtggagagaggggggcagggctcccactgtccctgtctaacaagaac cgagggctctcagctccagacttacggtctgtggggctcacccctgggcaagaacagggt gccagtgagctagaggggtctttgggtgaagcaagcacaatagaaatccccccaggtgag tccttgcaagccagggctgcaaggatcctgggcattgaggtggcggtggagtccctcctg ccgggcatccggagagcgggacagaaccagcctgctgagcccgatgcaagtgcctgcacc ccagagtccccccaggaagagttgctatctcgcccagcaccggcagatgtccccagggtg tccactgatgccttttatggcaggaggaagtgcggctggaccaagagccctctctttgta ggggacagggacagtgccaggcgggctcctcaggcttttgagcactcagatgtggacggg gttgtcaccagcacagaccctgtccctgagcctgagcccagccccttggagtccaagttc ttcgaacaaaaggatgtggaaacaaaaccacccttcaggtccactttattccattttgta gaaagaaccccaagtgtggcaggctctgaaaagagacttagaagcccttccaaagtgatt gaaagtttacaagagaaactggcctccccgcctaggagagcagaccctgaccgcctgatg agaatgaaagaggtgagctcagtgtcacggatgagagtcctgagcttcaggaatgccgac tcccaggaggacgccgaggaattgaaggccaccacaaggggccaggccgggctcccggga ggccttgtgtctcctggcagtggggaccgtgcccagagattgggccactcactctctgtg tccaaggacagcatctccagggaagagaaggagcatccggcagcacaaaaggagaagagc atggatcaagacttctggtgcccagattcctatgaccctagcagagtggagagggtgtga >gi568815588r:29926103_30147812|GENSCAN_predicted_peptide_4|90_aa MLGLGGCRRGGSRWSPAGRSGGGGGARRHEGAARLCPPTPPAPSRPDLADLTPELTAAAL GPRGPSSHAAPAVWTARDDLQGIDNPEIEG >gi568815588r:29926103_30147812|GENSCAN_predicted_CDS_4|273_bp atgctcggcctcggcggctgcaggcggggcgggtcgcgctggtccccggccggccggagc ggcggcggtggcggcgcgaggcggcatgagggagccgctcgcctgtgtccaccaacccct cccgcgccctcgcgcccagatctcgcggacctcaccccagagctgacagccgccgcgctg ggtccccgaggtcccagttcccacgctgccccggccgtctggactgcccgggacgattta caaggaatagataacccagagattgagggctag >gi568815588r:29926103_30147812|GENSCAN_predicted_peptide_5|150_aa MLNITNYQRNANQNHNAIPPYSSKNSPNQKIKKKVNVGVNAVKREHFYTVIGNLFGGIYL GVELLDHMGSIHQEVHTSAGPYINGPYISRSIHQRVHTSAAHISWSIHQRGVYTSSGRYI SPIVPMCHGRDLEGESFFVEDSKRLIVWAA >gi568815588r:29926103_30147812|GENSCAN_predicted_CDS_5|453_bp atgctcaacatcactaattatcagagaaatgcaaatcaaaaccacaatgcgataccacct tactccagcaagaacagccctaatcaaaaaatcaaaaagaaagtaaatgttggcgtgaat gccgtgaaaagggaacacttttacactgttatcgggaatctctttgggggcatctacctg ggagtggaattgctggatcatatggggtccatacatcaggaggttcacacatcagctgga ccatacatcaatggtccatacatcagcaggtccatacatcagcgggtccatacatcagct gcacacatcagctggtctatacatcagcggggagtctatacatcatctggtcgatacata agtcccatagttcccatgtgtcatgggagggacctggaaggagaaagcttctttgtggaa gactctaagcgactcatcgtgtgggctgcctga