GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:13:07 Sequence gi568815596r:127748110_127958100 : 209991 bp : 44.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 15050 14953 98 1 2 67 94 58 0.887 3.95 1.04 Intr - 16870 16719 152 1 2 105 15 143 0.996 7.66 1.03 Intr - 17160 17065 96 0 0 106 95 14 0.944 4.11 1.02 Intr - 20892 20824 69 0 0 65 95 43 0.802 2.08 1.01 Init - 22872 22669 204 0 0 100 99 127 0.993 13.85 1.00 Prom - 36602 36563 40 -3.06 2.00 Prom + 43541 43580 40 -3.26 2.01 Init + 46207 46209 3 0 0 98 53 0 0.068 -2.50 2.02 Intr + 57477 57681 205 2 1 32 80 187 0.515 11.07 2.03 Intr + 63019 63150 132 2 0 64 66 82 0.400 4.22 2.04 Intr + 70545 70575 31 1 1 87 68 14 0.002 -3.51 2.05 Intr + 84643 85108 466 1 1 66 44 216 0.151 8.13 2.06 Term + 93740 93859 120 1 0 59 48 124 0.395 3.97 2.07 PlyA + 94207 94212 6 1.05 3.04 PlyA - 95464 95459 6 1.05 3.03 Term - 100076 99998 79 1 1 80 33 146 0.626 5.54 3.02 Intr - 104996 104816 181 0 1 29 95 225 0.789 16.23 3.01 Init - 109991 109919 73 2 1 107 53 35 0.758 3.18 3.00 Prom - 114717 114678 40 -6.86 4.08 PlyA - 115128 115123 6 1.05 4.07 Term - 117096 116985 112 2 1 117 47 81 0.973 4.93 4.06 Intr - 118887 118791 97 1 1 131 115 69 0.999 12.97 4.05 Intr - 119170 119036 135 2 0 66 98 26 0.739 1.94 4.04 Intr - 121435 121345 91 1 1 86 113 99 0.946 11.77 4.03 Intr - 122819 122705 115 1 1 103 78 97 0.999 10.65 4.02 Intr - 123250 123140 111 0 0 51 95 97 0.986 6.19 4.01 Init - 126125 125719 407 2 2 61 75 371 0.999 29.16 4.00 Prom - 150964 150925 40 -1.86 5.03 PlyA - 151461 151456 6 1.05 5.02 Term - 158103 158035 69 0 0 95 42 38 0.269 -2.16 5.01 Init - 167932 167729 204 1 0 41 12 202 0.511 6.75 5.00 Prom - 185275 185236 40 -3.56 6.07 PlyA - 186057 186052 6 1.05 6.06 Term - 194055 193897 159 0 0 118 41 118 0.993 8.04 6.05 Intr - 194428 194315 114 1 0 112 36 38 0.756 1.64 6.04 Intr - 197450 197347 104 0 2 56 82 62 0.993 2.29 6.03 Intr - 201885 201760 126 1 0 89 96 50 0.987 6.55 6.02 Intr - 202299 202051 249 1 0 55 23 244 0.746 12.01 6.01 Intr - 207235 206877 359 0 2 89 119 248 0.960 22.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:127748110_127958100|GENSCAN_predicted_peptide_1|207_aa MATEIGSPPRFFHMPRFQHQAPRQLFYKRPDFAQQQAMQQLTFDGKRMRKAVNRKTIDYN PSVIKYLENRIWQRDQRDMRAIQPDAGYYNDWTPEGRRLVTGASSGEFTLWNGLTFNFET ILQAHDSPVRAMTWSHNDMWMLTADHGGYVKYWQSNMNNVKMFQAHKEAIREASFSPTDN KFATCSDDGTVRIWDFLRCHEERILRX >gi568815596r:127748110_127958100|GENSCAN_predicted_CDS_1|621_bp atggctacagaaattggttctcctcctcgttttttccatatgccaaggttccagcaccag gcacctcgacagctgttttataagcgacctgattttgcacaacagcaagcaatgcaacag cttacttttgatggaaaacgaatgagaaaagctgtgaaccgaaaaaccatagactacaat ccatctgtaattaagtatttggagaacagaatatggcaaagagaccagagagatatgcgg gcaattcagcctgatgcaggttattacaatgattggactccagaaggaagacgcttggtc actggagcttctagtggggagtttaccctgtggaatggactcactttcaattttgaaaca atattacaggctcacgacagcccagtgagggccatgacgtggtcacataatgacatgtgg atgttgacagcagaccacggaggatatgtgaaatattggcagtcgaacatgaacaacgtc aagatgttccaggcacataaggaggcgattagagaggccagtttctcacccacggataat aaatttgctacatgctctgatgacggcactgttagaatctgggactttcttcgttgccat gaggaaagaattctccgagnn >gi568815596r:127748110_127958100|GENSCAN_predicted_peptide_2|318_aa MSQEGAAGARSGLKGGKKNPLKLPKKQAKKMNKAFKQKQEKERKKHKELKAKASWKGLRP QVEFKNLAERAAAIFPYGPPNARSGVITHWRDGATSRAPGPRASVRFDLRMCLVFCGCET MNLRLMKACRTGYSHHADDLLNSGMDTFMGNNSWNGTVQTSTLAKGGAYLEQRNTLGTSP LAAELQEVLGPVVLMQDKAVGPVAPPDPGPSPFKEGCPPIPDDAWYTDGSSWGATAPWVA VTVHPSTDTIWFDTRCGQSSQWAELKAVWMVNTKEVTPMNLGDRRLINVPVITAALVGIV HLAEILFDLSALQVIVGS >gi568815596r:127748110_127958100|GENSCAN_predicted_CDS_2|957_bp atgagtcaggaaggggcagcaggtgccaggtccggcctcaaaggtggcaagaagaaccct ttgaaactgcccaaaaagcaggccaagaagatgaataaggctttcaagcagaaacaagaa aaggaacgaaagaaacacaaggagctaaaagcgaaagcctcctggaaggggctccggcca caggtggaatttaaaaatctggcagaaagagcagccgccatcttcccctatgggccaccg aacgcacgctctggggtcatcacgcactggcgggacggcgctacgtcacgcgcgcctgga ccgcgggcctctgtgaggttcgatctgcgcatgtgcctagtgttctgtggctgtgagaca atgaatctcaggcttatgaaagcatgcaggacgggctacagtcatcatgcagatgattta ctcaacagtgggatggatacgttcatgggtaacaactcgtggaatgggacagtgcagaca tccactttggcaaaggggggcgcctacttagagcagcggaatacgctgggtacaagtccc ttagcggcagagttgcaagaggtcttgggacctgtagtcctaatgcaagataaggccgtg gggcctgtggcacccccagaccctgggccttcaccatttaaggaagggtgtccccccatt cctgatgatgcatggtatacagatgggtctagctggggtgctactgctccctgggttgct gtcacagtccaccctagtactgacaccatatggtttgataccaggtgtggacaaagtagc caatgggctgaactcaaagccgtgtggatggtgaacaccaaagaggtgacacctatgaat ttgggggatcgacgtttgattaatgttcctgttatcactgctgctcttgtgggcattgtt catcttgctgaaattctctttgatctcagtgccctgcaggtcattgttggtagctaa >gi568815596r:127748110_127958100|GENSCAN_predicted_peptide_3|110_aa MAAGGSDPRAGDVEEDASQLIFPKEFETAETLLNSEVHMLLEHRKQQNESAEDEQELSEV FMKTLNYTARFSRFKNRETIASVRSLEGRFEDEELQQILDDIQTKRSFQY >gi568815596r:127748110_127958100|GENSCAN_predicted_CDS_3|333_bp atggcggcgggtggcagcgatccgcgggctggcgacgtagaggaggacgcctcacagctc atctttcctaaagagtttgaaacagctgagacacttctaaattcagaagttcatatgctt ctggaacatcgaaagcagcagaatgagagtgcagaggacgaacaggagctctcagaagtc ttcatgaaaacattaaactacacagcccgtttcagtcgtttcaaaaacagagagaccatt gccagtgttcgtagcttggagggacggtttgaagatgaggagctgcagcagattcttgat gatatccagacaaagcgcagctttcagtattaa >gi568815596r:127748110_127958100|GENSCAN_predicted_peptide_4|355_aa MGKRRCVPPLEPKLAAGCCGVKKPKLSGSGTHSHGNQSTTVPGSSSGPLQNHQHVDSSSG RENVSDLTLGPGNSPITRMNPASGALSPLPRPNGTANTTKNLVVTAEMCCYCFDVLYCHL YGFPQPRLPRFTNDPYPLFVTWKTGRDKRLRGCIGTFSAMNLHSGLREYTLTSALKDSRF PPLTREELPKLFCSVSLLTNFEDASDYLDWEVGVHGIRIEFINEKGVKRTATYLPEVAKE QDNYTGVTSDRKDAGAGSLGTAIMACVGLSSHVSESPRDWQTDWAPDWDQIQTIDSLLRK GGFKAPITSEFRKTIKLTRYRSEKVTISYAEYIASRQHCFQNGTLHAPPLYNHYS >gi568815596r:127748110_127958100|GENSCAN_predicted_CDS_4|1068_bp atgggaaaaagacgttgtgttcctccactcgagcccaagttggcagcaggctgttgtggg gtcaagaagcccaaattatctggaagtggaacgcacagtcacgggaatcagtccacaact gtccccggctctagttcaggacctcttcaaaaccaccagcatgtggacagcagcagtgga cgggagaatgtgtcagacttaactctgggacctggaaactctcccatcacacgaatgaat cccgcatcgggagcgctgagccctcttccccggcctaatggaactgccaacaccactaag aatctggtggtgactgcagagatgtgctgctactgcttcgacgtactctactgtcacctc tatggcttcccacagccacgacttcctagattcaccaatgacccctatccgctctttgtg acgtggaagacagggcgggacaagcggcttcgtggctgcattgggaccttctcagccatg aatcttcattcaggactcagggaatacacgttaaccagtgcacttaaggacagccgattt ccccccctgacccgagaggagctgcctaaacttttctgctctgtctccctccttactaac tttgaggatgccagtgattacctggactgggaggtaggggtccatgggattcgaattgaa ttcattaatgaaaaaggtgtcaaacgcacagccacatatttacctgaggttgctaaggaa caagataattatactggggtaacatctgacaggaaggacgcaggagcagggagcctgggg acagccattatggcatgtgtcgggctatcatctcatgtctcagagtctccacgggactgg cagacagactgggcgccagactgggatcagatccagacaatagactccttgctcaggaaa ggtggctttaaagctccaattaccagtgaattcagaaaaacgatcaaactcaccaggtac cgaagtgagaaggtgacaatcagttacgcagagtatattgcttcccgacagcactgtttc cagaacggcactcttcatgccccgcccctctacaatcattactcctga >gi568815596r:127748110_127958100|GENSCAN_predicted_peptide_5|90_aa MGEGQNSNTDRILEVVDFGSSRTLMGDFERFKTSVEKVIADVVEIARELELEVEPENVTE LLQSHYKTDQPHPESSAFSIHSGRFKGAHE >gi568815596r:127748110_127958100|GENSCAN_predicted_CDS_5|273_bp atgggagaaggtcaaaatagcaacactgacaggattttggaagtagttgattttggaagt agtagaaccctcatgggtgacttcgagaggttcaagacttcagtggagaaagtaattgct gatgtggtggaaatagcaagagaactagaattagaagtggagccggaaaatgtgactgaa ttgctgcagtctcattataaaactgaccagccccatcctgaatcatctgccttcagcata cactcaggtcgtttcaaaggggcccacgaataa >gi568815596r:127748110_127958100|GENSCAN_predicted_peptide_6|370_aa XAKPKSEIHVSMATPVTVSMETVSNQNNDQPTIAVPPTAQQPPPTIPTMIAAASPPSQPA VALSTIPGAVPITPPITTIAAAPPPSVTVGGSLSSVLGPPVPEIKVKEEVEPMDIMRPVS AVPPLATNTVSPSLALLANNLSMPTSDLPPGASPRKKPRKQQHVISTEEGDMMETNSTDD EKSTAKSLLVKAEKRKSPPKEYIDEEGVRYVPVRPRPPITLLRHYRNPWKAAYHHFQRYS DVRVKEEKKAMLQEIANQKGVSCRAQGWKVHLCAAQLLQLTNLEHDVYERLTNLQEGIIP KKKAATDDDLHRINELIQGNMQRCKLVMDQISEARDSMLKVLDHKDRVLKLLNKNGTVKK VSKLKRKEKV >gi568815596r:127748110_127958100|GENSCAN_predicted_CDS_6|1113_bp nntgccaaacccaagtctgaaatccacgtgtctatggccactccggtcactgtgtccatg gagactgtatccaatcaaaataatgatcagcctaccattgccgtccctccaactgcccag cagcccccaccgaccattccaactatgattgcagcagccagtcccccgtcacaaccagcc gttgccctttcaaccattcctggagcggtccccatcactccacccatcaccaccattgca gctgcaccacctccatcagtcactgtgggtggcagtctttcctccgtcttgggccctccc gttcctgaaattaaagtgaaagaagaagtagaaccaatggatatcatgaggccagtttct gcagttcctccactggctaccaacactgtgtctccatctcttgcattgctggcaaacaac ttgtccatgcctacaagtgacctaccacctggtgcctccccaaggaaaaagcctcgaaag caacagcatgtgatctcaacagaagaaggtgacatgatggagacaaacagcactgatgat gagaagtccactgccaagagtcttctggtgaaggctgagaagcgcaagtctcctcccaag gagtatattgatgaggaaggtgtgagatatgtcccagtgcgtccaagaccccccattact ttgcttcgtcactatcggaacccctggaaagctgcttaccaccactttcagaggtacagt gacgtccgggtcaaagaggagaagaaagctatgctgcaggaaatagctaatcagaaagga gtatcctgtcgtgctcaaggctggaaagtccacctctgtgctgcccagttactacagctg acgaatctagaacatgatgtctatgaaagacttactaacctgcaggaagggattatccca aagaaaaaagcagcaacagatgatgatctccaccgaataaacgaactgatacagggaaat atgcagaggtgtaaacttgtgatggatcaaatcagtgaagccagagactccatgcttaag gttttagatcataaagaccgtgtcctgaagctgcttaacaagaacgggactgtcaaaaaa gtgtccaaattgaagcgaaaggaaaaagtctag